• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Degrees
​
Log In
Join for Free
  • Browse
  • Pyspark

PySpark Courses

PySpark courses can help you learn data manipulation, distributed computing, and data analysis techniques. You can build skills in working with large datasets, performing transformations, and executing machine learning algorithms. Many courses introduce tools like Apache Spark and its libraries, that support processing big data efficiently and integrating with AI applications.

Popular PySpark Courses and Certifications


  • C

    Coursera

    Diabetes Prediction With Pyspark MLLIB

    Skills you'll gain: Data Preprocessing, Logistic Regression, Data Cleansing, Apache Spark, PySpark, Data Manipulation, Applied Machine Learning, Predictive Modeling, Data Science, Machine Learning, Python Programming

    4.6
    Rating, 4.6 out of 5 stars
    ·
    22 reviews

    Intermediate · Guided Project · Less Than 2 Hours

  • Status: New
    New
    P

    Packt

    Data Engineering with Scala and Spark

    Skills you'll gain: Scala Programming, Data Pipelines, Test Driven Development (TDD), Apache Airflow, Data Lakes, Apache Spark, CI/CD, Apache Kafka, Data Quality, Data Architecture, Performance Tuning, Data Store, Unit Testing, Data Transformation, Data Processing, Data Validation, Maintainability, Continuous Integration, Continuous Deployment, Data Integrity

    Intermediate · Course · 3 - 6 Months

  • Status: Free Trial
    Free Trial
    P

    Pearson

    Hadoop and Spark Fundamentals: Unit 1

    Skills you'll gain: Apache Spark, Apache Hadoop, Data Lakes, Big Data, Linux Commands, Linux, File Systems, Data Management, Command-Line Interface, Data Processing, Software Installation, Distributed Computing, Scalability, System Configuration

    Intermediate · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Free Trial
    Free Trial
    C

    Coursera

    Spark, Skew & Speed: Pipeline Performance Engineering

    Skills you'll gain: System Monitoring, Data Quality, Performance Tuning, Apache Spark, Data Validation, Data Pipelines, Query Languages, Debugging, Data Transformation, Anomaly Detection, PySpark, Performance Analysis, Extract, Transform, Load, Failure Analysis, SQL, Data Architecture, Data Processing, Benchmarking, Root Cause Analysis, Distributed Computing

    Advanced · Specialization · 3 - 6 Months

  • Status: Free Trial
    Free Trial
    G

    Google Cloud

    Data Engineering, Big Data, and Machine Learning on GCP

    Skills you'll gain: Dashboard Creation, Model Deployment, Feature Engineering, PySpark, Data Import/Export, Big Data, Apache Spark, Data Governance, Apache Hadoop, Dashboard, Apache Kafka, Data Store, Cloud Services, Cloud Deployment, Data Access, Cloud API, Data Architecture, Data Quality, Data Cleansing, Machine Learning Methods

    4.6
    Rating, 4.6 out of 5 stars
    ·
    4.4K reviews

    Intermediate · Specialization · 3 - 6 Months

  • Status: Free Trial
    Free Trial
    G

    Google Cloud

    Preparing for Google Cloud Certification: Cloud Data Engr

    Skills you'll gain: Dashboard Creation, Real Time Data, Model Deployment, Google Cloud Platform, Feature Engineering, PySpark, Data Lakes, Dataflow, Data Pipelines, Cloud Storage, Data Import/Export, Big Data, Apache Spark, Data Governance, Apache Hadoop, Dashboard, Apache Kafka, Tensorflow, Cloud Engineering, Data Warehousing

    4.6
    Rating, 4.6 out of 5 stars
    ·
    4.9K reviews

    Intermediate · Professional Certificate · 3 - 6 Months

  • Status: Free Trial
    Free Trial
    E

    Edureka

    Machine Learning with PySpark

    Skills you'll gain: PySpark, Model Optimization, Model Training, Applied Machine Learning, Machine Learning Methods, Machine Learning

    Intermediate · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    E

    Edureka

    Data Streaming and NLP with PySpark

    Skills you'll gain: PySpark, Data Pipelines, Apache Spark, Dashboard Creation, Dashboard, Interactive Data Visualization, Data Processing, Real Time Data, Natural Language Processing, Distributed Computing, Deep Learning, Performance Tuning

    Intermediate · Course · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    C

    Coursera

    Building Automated Data Pipelines with Spark,dbt,and Airflow

    Skills you'll gain: Data Flow Diagrams (DFDs), Apache Airflow, Data Pipelines, Diagram Design, Dataflow, Data Mapping, Data Modeling, Data Integration, Data Architecture, Data Warehousing, Apache Spark, Extract, Transform, Load, Database Development, Data Processing, Data Transformation, Service Level, Enterprise Security

    Beginner · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    M

    Microsoft

    Perform data science with Azure Databricks

    Skills you'll gain: PySpark, Databricks, Apache Spark, MLOps (Machine Learning Operations), Microsoft Azure, Big Data, Data Lakes, Model Training, Machine Learning Methods, Data Processing, Deep Learning, Data Transformation, Model Deployment, Data Pipelines, Data Manipulation, Model Evaluation, Machine Learning, Exploratory Data Analysis

    3.1
    Rating, 3.1 out of 5 stars
    ·
    79 reviews

    Intermediate · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    D

    Duke University

    Applied Python Data Engineering

    Skills you'll gain: Data Visualization Software, PySpark, Data Visualization, Snowflake Schema, Data Storytelling, Site Reliability Engineering, Docker (Software), Databricks, Containerization, GitHub Copilot, Interactive Data Visualization, Plot (Graphics), Plotly, Data Pipelines, Kubernetes, Apache Spark, Apache Hadoop, Big Data, Data Science, Python Programming

    3.8
    Rating, 3.8 out of 5 stars
    ·
    128 reviews

    Intermediate · Specialization · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Pragmatic AI Labs

    Databricks Lakehouse Fundamentals

    Skills you'll gain: Databricks, Data Pipelines, Data Lakes, Data Wrangling, Apache Spark, Data Access, Data Processing, Data Warehousing, Data Architecture, Data Management, Data Synthesis, Data Science, Data Mining, Data Integrity, Data Modeling, Data Presentation, Data Entry, Data Storage, SQL, Python Programming

    4.5
    Rating, 4.5 out of 5 stars
    ·
    6 reviews

    Beginner · Course · 1 - 4 Weeks

1…456…11

In summary, here are 10 of our most popular pyspark courses

  • Diabetes Prediction With Pyspark MLLIB: Coursera
  • Data Engineering with Scala and Spark: Packt
  • Hadoop and Spark Fundamentals: Unit 1: Pearson
  • Spark, Skew & Speed: Pipeline Performance Engineering: Coursera
  • Data Engineering, Big Data, and Machine Learning on GCP: Google Cloud
  • Preparing for Google Cloud Certification: Cloud Data Engr: Google Cloud
  • Machine Learning with PySpark: Edureka
  • Data Streaming and NLP with PySpark: Edureka
  • Building Automated Data Pipelines with Spark,dbt,and Airflow: Coursera
  • Perform data science with Azure Databricks: Microsoft

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Accounting
  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • Human Resources (HR)
  • Microsoft Excel
  • Project Management
  • Python
  • SQL

Professional Certificates

  • Google AI Certificate
  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM AI Engineering Certificate
  • IBM AI Product Manager Certificate
  • IBM Data Science Certificate
  • Intuit Academy Bookkeeping Certificate

Courses & Specializations

  • AI Essentials Specialization
  • AI For Business Specialization
  • AI For Everyone Course
  • AI in Healthcare Specialization
  • Deep Learning Specialization
  • Excel Skills for Business Specialization
  • Financial Markets Course
  • Machine Learning Specialization
  • Prompt Engineering for ChatGPT Course
  • Python for Everybody Specialization

Career Resources

  • Career Aptitude Test
  • CAPM Certification Requirements
  • CompTIA A+ Certification Requirements
  • CompTIA Security+ Certification Requirements
  • Essential IT Certifications
  • High-Income Skills to Learn
  • How to Learn Artificial Intelligence
  • PMP Certification Requirements
  • Popular Cybersecurity Certifications
  • Share your Coursera learning story

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Udemy

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Cookies Preference Center
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2026 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok