Apache Hadoop

Apache Hadoop is an open-source software framework used for distributed storage and processing of large datasets across clusters of computers. Coursera's Apache Hadoop catalogue teaches you about the core concepts and components of this powerful framework. You'll learn about Hadoop's architecture, its key components like Hadoop Distributed File System (HDFS) and MapReduce, as well as advanced topics such as data ingestion with tools like Flume and Sqoop. You will also delve into data processing using Hive and Pig, and explore scalable machine learning algorithms. By mastering Apache Hadoop, you will be equipped to handle big data challenges, contributing to business insights and decision making.
27credentials
1online degree
63courses

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Results for "apache hadoop"

  • Status: Free Trial

    Skills you'll gain: Data Store, Extract, Transform, Load, Data Architecture, Data Pipelines, Big Data, Data Warehousing, Data Governance, Apache Hadoop, Relational Databases, Apache Spark, Data Lakes, Databases, SQL, NoSQL, Data Security, Data Science

  • Status: Free Trial

    Skills you'll gain: Big Data, Data Analysis, Statistical Analysis, Apache Hadoop, Data Wrangling, Apache Hive, Data Collection, Data Mart, Data Science, Data Warehousing, Data Visualization, Analytics, Data Cleansing, Apache Spark, Data Lakes, Data Visualization Software, Microsoft Excel

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Machine Learning, Generative AI, PySpark, Applied Machine Learning, Supervised Learning, Apache Hadoop, Data Pipelines, Unsupervised Learning, Feature Engineering, Data Processing, Extract, Transform, Load, Predictive Modeling, Data Transformation, Regression Analysis

  • Status: Free Trial

    Skills you'll gain: Database Design, SQL, Apache Hive, Relational Databases, Databases, Database Management, Big Data, Database Systems, MySQL, Data Management, Amazon S3, Apache Hadoop, Data Storage, Operational Databases, Data Warehousing, Cloud Storage, Performance Tuning, File Systems, PostgreSQL, Data Analysis

  • Status: New
    Status: Free Trial

    Skills you'll gain: Apache Spark, Apache Hadoop, Data Lakes, Big Data, Linux Commands, Linux, File Systems, Data Management, Command-Line Interface, Data Processing, Software Installation, Distributed Computing, System Configuration

  • Status: Free Trial

    Skills you'll gain: Apache Spark, Apache Hadoop, Real Time Data, Apache Hive, Apache Kafka, Big Data, Distributed Computing, Data Processing, Databases, MongoDB, NoSQL, System Design and Implementation, SQL, Scalability

  • Status: Free Trial

    Skills you'll gain: Feature Engineering, PySpark, Data Import/Export, Apache Spark, Dashboard, Cloud Services, Applied Machine Learning, Apache Hive, Application Programming Interface (API), Jupyter, Big Data, Artificial Intelligence and Machine Learning (AI/ML), Query Languages, Apache Hadoop, Serverless Computing, Application Deployment, Looker (Software), Cloud Computing, Scalability, SQL

  • Status: Free Trial

    École Polytechnique Fédérale de Lausanne

    Skills you'll gain: Apache Spark, Apache Hadoop, Scala Programming, Distributed Computing, Big Data, Data Manipulation, Data Processing, Performance Tuning, Data Transformation, SQL, Data Analysis

  • Status: Free Trial

    Skills you'll gain: PySpark, Snowflake Schema, Databricks, Data Pipelines, Apache Spark, MLOps (Machine Learning Operations), Apache Hadoop, Big Data, Data Warehousing, Data Quality, Data Integration, Data Processing, DevOps, Data Transformation, SQL, Python Programming

  • Status: Free Trial

    University of California San Diego

    Skills you'll gain: Big Data, Apache Hadoop, Scalability, Data Processing, Data Science, Distributed Computing, Unstructured Data, Data Infrastructure, Data Analysis

  • Status: New
    Status: Free Trial

    Skills you'll gain: PySpark, Apache Hadoop, Apache Spark, Big Data, Apache Hive, Analytics, Data Processing, Data Mapping, Text Mining, Distributed Computing, Java, Debugging, Java Programming

  • Status: New
    Status: Preview

    Skills you'll gain: Apache Cassandra, Big Data, NoSQL, Apache Hadoop, Virtual Machines, Apache, Analytics, Data Storage, Databases, Database Architecture and Administration, Data Management, Data Architecture, Scalability