Apache Spark

Apache Spark is a unified analytics engine for large-scale data processing, used for big data and machine learning tasks. Coursera's Apache Spark skill catalogue teaches you about this powerful tool for handling big data analytics. You'll learn the fundamentals of Spark's distributed computing model, its powerful data processing capabilities, and how to implement machine learning algorithms with Spark. You'll also delve into Spark SQL for structured data processing, Spark Streaming for real-time data processing, and MLlib for machine learning tasks. Master these aspects to enhance your data science skills and solve complex computational problems in various industries.
31credentials
75courses

Related roles

Gain the knowledge and skills you need to advance.

  • This role has a £62,876 median salary ¹.

    description:

    A Data Engineer builds data pipelines for large datasets, optimizing systems and ensuring reliable data flow using tools like Hadoop and Spark.

    This role has a £62,876 median salary ¹.

    Offered by

    DeepLearning.AI_logo
    Amazon Web Services_logo
    Google Cloud_logo

Most popular

Trending now

New releases

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Apache Spark Course Catalog

  • Status: Free Trial

    Skills you'll gain: Data Visualization Software, Big Data, Apache Hadoop, Apache Spark, Apache Hive, Distributed Computing, Data Processing, Data Warehousing, Data Analysis, Data Integration, Cloud Security, Artificial Intelligence and Machine Learning (AI/ML), SQL

  • Status: Free Trial

    Coursera Instructor Network

    Skills you'll gain: Data Pipelines, Apache Hadoop, Extract, Transform, Load, Data Transformation, Apache Hive, Data-Driven Decision-Making, Big Data, Data Warehousing, Apache Spark, Data Integration, Data Processing, Data Management, Data Analysis, Scalability

  • Status: New
    Status: Free Trial

    Skills you'll gain: NoSQL, Machine Learning Algorithms, Big Data, Apache Hadoop, Deep Learning, Statistical Analysis, Data Visualization, Data Analysis, Exploratory Data Analysis, Apache Spark, Generative AI, Data Infrastructure, Data Cleansing, Real Time Data, Apache Kafka, Data Management, Image Analysis, Artificial Neural Networks, Artificial Intelligence and Machine Learning (AI/ML), Machine Learning

  • Status: Free Trial

    Skills you'll gain: Spatial Analysis, NoSQL, Spatial Data Analysis, Geospatial Mapping, Geographic Information Systems, Public Cloud, Big Data, MongoDB, Apache Hadoop, Apache Spark, Distributed Computing, Data Architecture, Cloud Services, Data Processing, Database Systems, Cloud Computing, Scalability, Databases, Environmental Monitoring, Climate Change Programs

  • Status: Free Trial

    Skills you'll gain: Data Warehousing, Google Cloud Platform, Big Data, Apache Spark, Database Management, Data Integration, Dataflow, SQL, Data Pipelines, Metadata Management, Data Management, Real Time Data, Tensorflow, Data Science, Command-Line Interface, Applied Machine Learning, Cloud-Based Integration, Apache Hadoop, Query Languages, Machine Learning

  • Status: Free Trial

    Skills you'll gain: AWS Kinesis, Amazon DynamoDB, Amazon S3, Data Pipelines, Real Time Data, Amazon CloudWatch, AWS Identity and Access Management (IAM), Cloud Storage, Apache Spark, Dashboard, Amazon Web Services, Apache Hive, Interactive Data Visualization, Apache Hadoop, Data Visualization Software, Data Processing, Extract, Transform, Load, Data Storage, Database Management Systems, Big Data

  • Status: Free Trial

    Duke University

    Skills you'll gain: Databricks, Generative AI, Data Lakes, Extract, Transform, Load, MLOps (Machine Learning Operations), Data Transformation, LLM Application, Data Pipelines, Large Language Modeling, Apache Spark, Responsible AI, Data Analysis, Data Science, CI/CD, Machine Learning

  • Status: Free Trial

    Skills you'll gain: AI Personalization, Apache Spark, Artificial Intelligence and Machine Learning (AI/ML), AWS SageMaker, Scalability, Tensorflow, Dimensionality Reduction, Applied Machine Learning, Python Programming, Fraud detection, Predictive Modeling, Machine Learning Algorithms, Unsupervised Learning, Data Processing

  • Skills you'll gain: Azure Synapse Analytics, Data Migration, Microsoft Azure, Data Warehousing, SQL Server Integration Services (SSIS), Microsoft SQL Servers, Cloud Security, Performance Tuning, System Configuration, Cloud Storage, Cloud Computing Architecture, Data Security, Apache Spark, Data Integration

  • Skills you'll gain: Metadata Management, Data Pipelines, Data Processing, Google Cloud Platform, Data Migration, Cloud Storage, Apache Airflow, Data Lakes, Data Storage, Big Data, Data Infrastructure, Extract, Transform, Load, Apache Spark, IT Automation, Data Management, Data Transformation, Serverless Computing, SQL

  • Status: New
    Status: Free Trial

    Skills you'll gain: AWS Kinesis, Amazon Web Services, Real Time Data, Apache Spark, Extract, Transform, Load, Data Processing, Dashboard, Full-Stack Web Development, Mobile Development Tools, Event-Driven Programming, Business Intelligence, Data Visualization

  • Skills you'll gain: Extract, Transform, Load, Data Sharing, Data Pipelines, Metadata Management, Google Cloud Platform, Data Migration, Data Processing, Big Data, Data Integration, Cloud Storage, Data Warehousing, Data Management, Data Lakes, Data Import/Export, Data Transformation, Apache Spark, Serverless Computing

What brings you to Coursera today?

Leading partners

  • Google Cloud
  • Packt
  • IBM
  • EDUCBA
  • Pearson
  • University of California San Diego
  • Amazon Web Services
  • Edureka