Apache Hadoop

Apache Hadoop is an open-source software framework used for distributed storage and processing of large datasets across clusters of computers. Coursera's Apache Hadoop catalogue teaches you about the core concepts and components of this powerful framework. You'll learn about Hadoop's architecture, its key components like Hadoop Distributed File System (HDFS) and MapReduce, as well as advanced topics such as data ingestion with tools like Flume and Sqoop. You will also delve into data processing using Hive and Pig, and explore scalable machine learning algorithms. By mastering Apache Hadoop, you will be equipped to handle big data challenges, contributing to business insights and decision making.
27credentials
64courses

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Mapreduce Course Catalog

  • Status: New
    Status: Free Trial

    Skills you'll gain: Extract, Transform, Load, Apache Spark, Data Pipelines, PySpark, Apache Hadoop, Data Transformation, MySQL, Data Manipulation, Java Platform Enterprise Edition (J2EE), Data Store, Data Import/Export, Development Environment, Software Installation, System Configuration

  • Status: Free Trial
    Status: AI skills

    Skills you'll gain: NoSQL, Apache Spark, Data Warehousing, Apache Hadoop, Extract, Transform, Load, Apache Airflow, Web Scraping, Linux Commands, Database Design, SQL, IBM Cognos Analytics, MySQL, Database Administration, Data Store, Generative AI, Professional Networking, Data Import/Export, Python Programming, Data Analysis, Data Science

  • Status: Free Trial

    Johns Hopkins University

    Skills you'll gain: Data Warehousing, Apache Hadoop, Distributed Computing, Scalability, Transaction Processing, Database Systems, Database Design, Database Management Systems, Relational Databases, Database Architecture and Administration, Database Management, Cloud Computing, Query Languages, Big Data, Databases, Data Processing, Machine Learning, SQL, Data Access, Performance Tuning

  • Status: Free Trial

    Skills you'll gain: Data Visualization Software, PySpark, Data Visualization, Snowflake Schema, Data Storytelling, Site Reliability Engineering, Docker (Software), Databricks, Containerization, Interactive Data Visualization, Plotly, Data Pipelines, Matplotlib, Kubernetes, Dashboard, Apache Spark, Apache Hadoop, Big Data, Data Science, Python Programming

  • Status: Free Trial

    Skills you'll gain: Apache Hadoop, Data Processing, Distributed Computing, Performance Tuning, Big Data, Software Architecture, Scalability, Java, System Configuration

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Machine Learning, Generative AI, PySpark, Applied Machine Learning, Supervised Learning, Apache Hadoop, Data Pipelines, Unsupervised Learning, Feature Engineering, Data Processing, Extract, Transform, Load, Predictive Modeling, Data Transformation, Regression Analysis

  • Status: Free Trial

    Skills you'll gain: Feature Engineering, PySpark, Data Import/Export, Apache Spark, Dashboard, Cloud Services, Applied Machine Learning, Application Programming Interface (API), Apache Hive, Jupyter, Big Data, Artificial Intelligence and Machine Learning (AI/ML), Query Languages, Apache Hadoop, Serverless Computing, Application Deployment, Looker (Software), Cloud Computing, Scalability, SQL

  • Status: Free Trial

    Skills you'll gain: Dataflow, Data Pipelines, Real Time Data, Feature Engineering, PySpark, Cloud Storage, Data Import/Export, Apache Spark, Data Maintenance, Google Cloud Platform, Apache Hadoop, Dashboard, Data Lakes, Tensorflow, Big Data, Cloud Services, Data Storage, MLOps (Machine Learning Operations), Data Analysis, Data Warehousing

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Apache Hadoop, Real Time Data, Apache Hive, Apache Kafka, Big Data, Distributed Computing, Data Processing, Databases, MongoDB, NoSQL, System Design and Implementation, SQL, Scalability

  • Status: Preview

    Skills you'll gain: PySpark, Apache Spark, Data Management, Distributed Computing, Apache Hadoop, Data Processing, Data Analysis, Exploratory Data Analysis, Python Programming, Scalability

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Hadoop, Apache Spark, Big Data, Apache Hive, Analytics, Data Processing, Data Mapping, Text Mining, Distributed Computing, Java, Debugging, Java Programming

  • Status: Free Trial

    Skills you'll gain: AWS Kinesis, Amazon DynamoDB, Amazon S3, Data Pipelines, Real Time Data, Amazon CloudWatch, AWS Identity and Access Management (IAM), Cloud Storage, Apache Spark, Dashboard, Amazon Web Services, Apache Hive, Interactive Data Visualization, Apache Hadoop, Data Visualization Software, Data Processing, Extract, Transform, Load, Data Storage, Database Management Systems, Big Data