• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Coursera
Log In
Join for Free
Coursera
  • Browse
  • Pyspark

PySpark Courses Online

Learn PySpark for big data processing. Understand how to use PySpark for distributed data analysis and machine learning.

Skip to search results

Filter by

Subject
Required
 *

Language
Required
 *

The language used throughout the course, in both instruction and assessments.

Learning Product
Required
 *

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required
 *

Duration
Required
 *

Skills
Required
 *

Subtitles
Required
 *

Educator
Required
 *

Explore the PySpark Course Catalog

  • Status: Free Trial
    Free Trial
    I

    IBM

    AI Workflow: Enterprise Model Deployment

    Skills you'll gain: Apache Spark, Data Pipelines, MLOps (Machine Learning Operations), PySpark, Application Deployment, IBM Cloud, Machine Learning, Containerization, Data Science, Python Programming, Performance Tuning, Scalability

    4.3
    Rating, 4.3 out of 5 stars
    ·
    59 reviews

    Advanced · Course · 1 - 4 Weeks

  • C

    Coursera Project Network

    Diabetes Prediction With Pyspark MLLIB

    Skills you'll gain: Data Cleansing, Apache Spark, PySpark, Data Manipulation, Applied Machine Learning, Data Processing, Classification And Regression Tree (CART), Predictive Modeling, Data Science, Machine Learning, Google Cloud Platform, Python Programming

    4.6
    Rating, 4.6 out of 5 stars
    ·
    22 reviews

    Intermediate · Guided Project · Less Than 2 Hours

  • Status: Free Trial
    Free Trial
    C

    Coursera Instructor Network

    Engineering Data Ecosystems: Pipelines, ETL, Spark

    Skills you'll gain: Extract, Transform, Load, Apache Spark, Data Pipelines, Data Integration, Big Data, Data Infrastructure, Data Processing, Dataflow, Data Management, Data Architecture, Scalability

    Beginner · Course · 1 - 4 Weeks

  • U

    University of California San Diego

    Hadoop Platform and Application Framework

    Skills you'll gain: Apache Hadoop, Big Data, Data Analysis, Apache Spark, Data Science, Data Processing, Distributed Computing, Performance Tuning, Scalability, Data Storage, Python Programming

    4
    Rating, 4 out of 5 stars
    ·
    3.3K reviews

    Mixed · Course · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Packt

    NumPy, Matplotlib & Pandas – Data Science Prerequisites

    Skills you'll gain: NumPy, Pandas (Python Package), Data Manipulation, Scatter Plots, Jupyter, Data Visualization Software, Machine Learning, Data Science, Data Import/Export, Classification And Regression Tree (CART), Linear Algebra, Probability Distribution, Regression Analysis

    Beginner · Course · 1 - 3 Months

  • C

    Coursera Project Network

    Data Management with Databricks: Big Data with Delta Lakes

    Skills you'll gain: Databricks, Data Lakes, Data Pipelines, Data Integration, Dashboard, PySpark, SQL, Apache Spark, Data Management, Data Transformation, Version Control

    4.2
    Rating, 4.2 out of 5 stars
    ·
    29 reviews

    Intermediate · Guided Project · Less Than 2 Hours

  • Status: Free Trial
    Free Trial
    D

    Duke University

    Python Essentials for MLOps

    Skills you'll gain: Pandas (Python Package), MLOps (Machine Learning Operations), NumPy, Data Manipulation, Software Testing, Data Import/Export, Test Automation, Python Programming, Debugging, Data Structures, Machine Learning, Object Oriented Programming (OOP), Scripting, Program Development, Numerical Analysis, Application Programming Interface (API), Command-Line Interface

    4.3
    Rating, 4.3 out of 5 stars
    ·
    308 reviews

    Intermediate · Course · 1 - 3 Months

  • Status: Free
    Free
    C

    Coursera Project Network

    Machine Learning with PySpark: Recommender System

    Skills you'll gain: PySpark, Data Pipelines, Data Processing, AI Personalization, Dimensionality Reduction, OpenAI, Data Manipulation, Pandas (Python Package), Data Transformation, Unsupervised Learning, Applied Machine Learning, Machine Learning

    Intermediate · Guided Project · Less Than 2 Hours

  • Status: Free Trial
    Free Trial
    D

    DeepLearning.AI

    Data Modeling, Transformation, and Serving

    Skills you'll gain: Data Modeling, Data Transformation, Data Processing, Data Warehousing, Apache Hadoop, Extract, Transform, Load, Data Pipelines, Apache Spark, Feature Engineering, Data Manipulation, Star Schema, Applied Machine Learning, Real Time Data, Machine Learning

    4.5
    Rating, 4.5 out of 5 stars
    ·
    85 reviews

    Intermediate · Course · 1 - 4 Weeks

  • I

    IBM

    Scalable Machine Learning on Big Data using Apache Spark

    Skills you'll gain: Apache Spark, PySpark, Applied Machine Learning, Big Data, Data Storage Technologies, Statistical Machine Learning, Data Pipelines, Machine Learning Algorithms, Machine Learning, Data Processing, Data Science, Statistical Analysis

    3.8
    Rating, 3.8 out of 5 stars
    ·
    1.3K reviews

    Intermediate · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Preview
    Preview
    O

    O.P. Jindal Global University

    Big Data Analytics

    Skills you'll gain: Big Data, Apache Spark, Apache Hadoop, Apache Hive, Databases, Analytics, Data Storage Technologies, Data Mining, NoSQL, Applied Machine Learning, Real Time Data, Distributed Computing, SQL, Data Processing, Query Languages, Scripting Languages

    Build toward a degree

    Beginner · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    M

    Microsoft

    Microsoft Azure Machine Learning for Data Scientists

    Skills you'll gain: Responsible AI, Microsoft Azure, Unsupervised Learning, Databricks, MLOps (Machine Learning Operations), Applied Machine Learning, Regression Analysis, Scikit Learn (Machine Learning Library), Predictive Modeling, Cloud Management, Machine Learning, Artificial Intelligence and Machine Learning (AI/ML), Supervised Learning, Virtual Machines, Application Deployment, Data Pipelines

    4.3
    Rating, 4.3 out of 5 stars
    ·
    175 reviews

    Intermediate · Course · 1 - 4 Weeks

PySpark learners also search

Analytics
Business Intelligence
Business Analytics
Business Intelligence Projects
Digital Analytics
Web Analytics
Financial Analytics
Social Media Analytics
1234…10

In summary, here are 10 of our most popular pyspark courses

  • AI Workflow: Enterprise Model Deployment: IBM
  • Diabetes Prediction With Pyspark MLLIB: Coursera Project Network
  • Engineering Data Ecosystems: Pipelines, ETL, Spark: Coursera Instructor Network
  • Hadoop Platform and Application Framework: University of California San Diego
  • NumPy, Matplotlib & Pandas – Data Science Prerequisites: Packt
  • Data Management with Databricks: Big Data with Delta Lakes: Coursera Project Network
  • Python Essentials for MLOps: Duke University
  • Machine Learning with PySpark: Recommender System: Coursera Project Network
  • Data Modeling, Transformation, and Serving: DeepLearning.AI
  • Scalable Machine Learning on Big Data using Apache Spark: IBM

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Manage Cookie Preferences
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2025 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok