Data Lakes

Data Lakes are centralized repositories that allow you to store all your structured and unstructured data at any scale. Coursera's Data Lakes catalogue teaches you the principles of creating, managing and utilizing data lakes for big data analytics. You'll learn how to design and implement data lake infrastructure, manage data ingestion, storage, and consumption, and optimize data lake architecture for various analytics needs. You'll also understand data privacy, security, and governance in data lakes. This skill is valuable whether you are a data scientist, data engineer, IT manager, or someone interested in handling big data analytics.
23credentials
45courses

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Data Lakes Course Catalog

  • Status: Free Trial

    Skills you'll gain: Google Cloud Platform, Real Time Data, Data Pipelines, Dataflow, Tensorflow, Cloud Engineering, Data Lakes, Big Data, Dashboard, Cloud Infrastructure, Apache Spark, Data Infrastructure, Unstructured Data, Applied Machine Learning, Data Warehousing, Extract, Transform, Load, MLOps (Machine Learning Operations), Data Processing, PySpark, Cloud Storage

  • Status: Preview

    Skills you'll gain: Power BI, Microsoft Azure, Data Lakes, Data Analysis Expressions (DAX), Azure Synapse Analytics, Data Modeling, Data Governance, Data Warehousing, Microsoft SQL Servers, Data Analysis, Extract, Transform, Load, Data Integration, Data Import/Export, Real Time Data, Data Transformation, Data Pipelines, Information Management, Role-Based Access Control (RBAC), Performance Tuning

  • Status: New
    Status: Free Trial

    Skills you'll gain: Data Integration, Data Pipelines, Data Lakes, Microsoft Azure, Azure Synapse Analytics, Performance Tuning, Data Transformation, Data Validation, Data Storage Technologies, SQL, Data Storage, Cloud Storage

  • Status: Free Trial

    Skills you'll gain: Dataflow, Google Cloud Platform, Data Pipelines, Data Lakes, MLOps (Machine Learning Operations), Data Warehousing, Real Time Data, Data Processing, Data Management, Data Infrastructure, Cloud Engineering, Unstructured Data, Cloud Storage, Systems Design, Tensorflow, Big Data, Cloud Infrastructure, Data Visualization, Applied Machine Learning, Extract, Transform, Load

  • Status: Free Trial

    Skills you'll gain: Dataflow, Real Time Data, Google Cloud Platform, Data Pipelines, Data Import/Export, Looker (Software), PySpark, Data Lakes, Data Warehousing, Tensorflow, Apache Spark, Dashboard, Data Processing, Big Data, Cloud Infrastructure, Data Infrastructure, Unstructured Data, Feature Engineering, Applied Machine Learning, Data Architecture

  • Status: Free Trial

    Skills you'll gain: AWS SageMaker, AWS Kinesis, Data Integration, Data Lakes, Business Intelligence, Apache Hive, Apache Spark, Amazon Web Services, Extract, Transform, Load, Big Data, Apache Hadoop, Real Time Data, Applied Machine Learning, Data Pipelines, Data Processing, Serverless Computing

  • Status: Free Trial

    Skills you'll gain: Data Pipelines, Real Time Data, Dataflow, Google Cloud Platform, Data Lakes, Data Import/Export, Data Warehousing, Tensorflow, Feature Engineering, Dashboard, Extract, Transform, Load, Cloud Infrastructure, Apache Spark, Big Data, Data Integration, Applied Machine Learning, Data Infrastructure, PySpark, Data Processing, Unstructured Data

  • Status: Preview

    Coursera Instructor Network

    Skills you'll gain: Big Data, Data Processing, Data Analysis, Analytics, Data Lakes, Data Warehousing, Apache Spark, Data Storage Technologies, Apache Hadoop, Real Time Data, Distributed Computing

  • Status: Free Trial

    DeepLearning.AI

    Skills you'll gain: Data Storage, Query Languages, Data Lakes, File Systems, Database Systems, SQL, Databases, Data Architecture, Cloud Storage, Data Warehousing, Amazon Web Services, Amazon S3, Graph Theory, Performance Tuning

  • Status: Free Trial

    Skills you'll gain: Business Process, Business Process Modeling, Google Cloud Platform, Real Time Data, Data Processing, Data Pipelines, Systems Design, Dataflow, Cloud Engineering, Data Infrastructure, Data Quality, Data Lakes, Data Modeling, Big Data, Dashboard, Cloud Infrastructure, Data Management, Data Warehousing, MLOps (Machine Learning Operations), Applied Machine Learning

  • Skills you'll gain: Data Lakes, File Management, Microsoft Azure, Information Management, Data Management, Data Storage Technologies

  • Status: Free Trial

    Skills you'll gain: Databricks, Data Governance, Microsoft Azure, Data Lakes, Real Time Data, Data Management, Data Integration, Data Pipelines, Data Quality, User Provisioning, Performance Tuning