Apache Hadoop

Apache Hadoop is an open-source software framework used for distributed storage and processing of large datasets across clusters of computers. Coursera's Apache Hadoop catalogue teaches you about the core concepts and components of this powerful framework. You'll learn about Hadoop's architecture, its key components like Hadoop Distributed File System (HDFS) and MapReduce, as well as advanced topics such as data ingestion with tools like Flume and Sqoop. You will also delve into data processing using Hive and Pig, and explore scalable machine learning algorithms. By mastering Apache Hadoop, you will be equipped to handle big data challenges, contributing to business insights and decision making.
27credentials
2online degrees
64courses

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Hadoop Course Catalog

  • Status: Free Trial

    Skills you'll gain: PySpark, Apache Spark, Apache Hadoop, Data Pipelines, Big Data, Data Processing, Distributed Computing, Data Analysis Expressions (DAX), Data Integration, Data Transformation, SQL, Data Manipulation, Data Cleansing

  • Status: Free Trial

    Skills you'll gain: Data Warehousing, Google Cloud Platform, Big Data, Apache Spark, Data Integration, Dataflow, SQL, Data Pipelines, Metadata Management, Data Management, Real Time Data, Tensorflow, Data Science, Command-Line Interface, Applied Machine Learning, Cloud-Based Integration, Apache Hadoop, Data Mining, Query Languages, Machine Learning

  • Status: Free Trial

    Alibaba Cloud Academy

    Skills you'll gain: Relational Databases, Load Balancing, Data Visualization Software, Cloud Security, Network Security, Cloud Computing, Database Systems, Big Data, Database Management, General Networking, Apache Hadoop, Cloud Infrastructure, Cloud Services, Cloud Computing Architecture, Network Architecture, Apache Spark, Data Security, Servers, Apache Hive, Machine Learning

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Big Data, SPSS (Software), SPSS, Analytics, Real Time Data, Apache Hadoop, Data Processing, Business Analytics, Statistical Analysis, Data Analysis Software, Business Strategy, Market Share, Scalability, Machine Learning Algorithms

  • Status: Free Trial

    Skills you'll gain: Data Modeling, Data Transformation, Data Processing, Data Warehousing, Apache Hadoop, Extract, Transform, Load, Data Pipelines, Apache Spark, Feature Engineering, Data Mart, Star Schema, Data Integrity, Real Time Data, Machine Learning, Text Mining

  • Status: Free Trial

    Skills you'll gain: Data Storytelling, Data Visualization, Big Data, Data Visualization Software, Data Analysis, Dashboard, IBM Cognos Analytics, Statistical Analysis, Data Mining, Apache Hadoop, Data Collection, Tree Maps, Excel Formulas, Apache Hive, Data Science, Microsoft Excel, Data Warehousing, Data Quality, Data Cleansing, Scatter Plots

  • Status: Free Trial

    Skills you'll gain: Data Pipelines, Dataflow, Google Cloud Platform, Extract, Transform, Load, Data Processing, Apache Hive, Data Integration, Apache Spark, PySpark, Serverless Computing, Apache Hadoop, Big Data, Data Migration, Data Transformation, Performance Tuning

  • Skills you'll gain: Dataflow, Data Pipelines, Data Transformation, Extract, Transform, Load, Data Processing, Google Cloud Platform, Serverless Computing, Big Data, Apache Spark, Apache Hadoop, Cloud Storage, Performance Tuning

  • Status: Free Trial

    Skills you'll gain: Apache Kafka, Apache Spark, Scala Programming, Real Time Data, Apache Hadoop, Apache Cassandra, Applied Machine Learning, Big Data, Data Processing, Application Deployment, Distributed Computing, Development Environment

  • Status: Preview

    Coursera Instructor Network

    Skills you'll gain: Big Data, Data Processing, Data Analysis, Analytics, Data Lakes, Data Warehousing, Apache Spark, Data Storage Technologies, Apache Hadoop, Real Time Data, Distributed Computing

  • Status: Free Trial

    Georgia Institute of Technology

    Skills you'll gain: Cloud Applications, Cloud Computing, Cloud Infrastructure, Distributed Computing, Virtualization, Data Store, Multi-Tenant Cloud Environments, Virtual Machines, Scalability, Apache Hadoop

  • Status: Free Trial

    Skills you'll gain: Data Pipelines, Extract, Transform, Load, Apache Airflow, Google Cloud Platform, Data Integration, Data Migration, Data Processing, Apache Hadoop, Serverless Computing, Apache Spark, Big Data, Data Transformation