Apache Spark

Apache Spark is a unified analytics engine for large-scale data processing, used for big data and machine learning tasks. Coursera's Apache Spark skill catalogue teaches you about this powerful tool for handling big data analytics. You'll learn the fundamentals of Spark's distributed computing model, its powerful data processing capabilities, and how to implement machine learning algorithms with Spark. You'll also delve into Spark SQL for structured data processing, Spark Streaming for real-time data processing, and MLlib for machine learning tasks. Master these aspects to enhance your data science skills and solve complex computational problems in various industries.
36credentials
1online degree
75courses

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Apache Spark Course Catalog

  • Status: Free Trial

    Skills you'll gain: Dataflow, Data Pipelines, Real Time Data, Feature Engineering, PySpark, Cloud Storage, Data Import/Export, Apache Spark, Data Maintenance, Google Cloud Platform, Apache Hadoop, Dashboard, Data Lakes, Tensorflow, Big Data, Cloud Services, Data Storage, MLOps (Machine Learning Operations), Data Analysis, Data Warehousing

  • Status: Free Trial

    Skills you'll gain: PySpark, Apache Spark, Apache Hadoop, Data Pipelines, Big Data, Data Processing, Distributed Computing, Data Analysis Expressions (DAX), Data Integration, Data Transformation, SQL, Data Manipulation, Data Cleansing

  • Status: Free Trial

    Skills you'll gain: PySpark, Data Pipelines, Apache Spark, Data Processing, Real Time Data, Data Visualization, Natural Language Processing, Distributed Computing, Text Mining, Data Transformation, Deep Learning, Performance Tuning

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Hadoop, Apache Hive, Apache Spark, Big Data, Data Pipelines, Data Import/Export, Data Integration, Data Processing, Relational Databases, File Systems, Command-Line Interface, Configuration Management, Software Installation

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Spark, Generative AI, LLM Application, Large Language Modeling, Predictive Modeling, Matplotlib, Keras (Neural Network Library), Generative Model Architectures, Deep Learning, ChatGPT, OpenAI, Generative AI Agents, Tensorflow, Seaborn, A/B Testing, Statistical Modeling, Data Visualization, Regression Analysis, Big Data, Machine Learning

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Apache Hadoop, Real Time Data, Apache Hive, Apache Kafka, Big Data, Distributed Computing, Data Processing, Databases, MongoDB, NoSQL, System Design and Implementation, SQL, Scalability

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Spark, Apache Hadoop, Data Lakes, Big Data, Linux Commands, Linux, File Systems, Data Management, Command-Line Interface, Data Processing, Software Installation, Distributed Computing, System Configuration

  • Status: Free Trial

    Skills you'll gain: Data Visualization Software, PySpark, Data Visualization, Snowflake Schema, Data Storytelling, Site Reliability Engineering, Docker (Software), Databricks, Containerization, Interactive Data Visualization, Plotly, Data Pipelines, Matplotlib, Kubernetes, Dashboard, Apache Spark, Apache Hadoop, Big Data, Data Science, Python Programming

  • Status: Free Trial

    Skills you'll gain: NoSQL, Data Warehousing, SQL, Apache Hadoop, Extract, Transform, Load, Apache Airflow, Data Security, Linux Commands, Data Migration, Database Design, Data Governance, MySQL, Database Administration, Apache Spark, Data Pipelines, Apache Kafka, Database Management, Bash (Scripting Language), Data Store, Data Architecture

  • Status: New
    Status: Free Trial

    University of California, Davis

    Skills you'll gain: Data Governance, Presentations, SQL, Apache Spark, Distributed Computing, Descriptive Statistics, Data Lakes, Data Storytelling, Peer Review, Exploratory Data Analysis, Data Quality, Data Pipelines, Databricks, JSON, Statistical Analysis, Data Modeling, Database Design, Data Analysis, Complex Problem Solving, Data Visualization

  • Status: Free Trial

    Skills you'll gain: AWS Kinesis, Real Time Data, Apache Spark, Apache Hive, Data Pipelines, Apache Hadoop, Data Processing, Extract, Transform, Load, Amazon Web Services, Serverless Computing, Data Lakes, Data Visualization, Amazon S3, Query Languages, Performance Tuning

  • Status: Free Trial

    Skills you'll gain: Data Pipelines, Dataflow, Google Cloud Platform, Extract, Transform, Load, Data Processing, Apache Hive, Data Integration, Apache Spark, PySpark, Serverless Computing, Apache Hadoop, Big Data, Data Migration, Data Transformation, Performance Tuning