Data Lakes

Data Lakes are centralized repositories that allow you to store all your structured and unstructured data at any scale. Coursera's Data Lakes catalogue teaches you the principles of creating, managing and utilizing data lakes for big data analytics. You'll learn how to design and implement data lake infrastructure, manage data ingestion, storage, and consumption, and optimize data lake architecture for various analytics needs. You'll also understand data privacy, security, and governance in data lakes. This skill is valuable whether you are a data scientist, data engineer, IT manager, or someone interested in handling big data analytics.
23credentials
45courses

Filter by

Subject
Required

Language
Required

The language used throughout the course, in both instruction and assessments.

Learning Product
Required

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.

Level
Required

Duration
Required

Subtitles
Required

Educator
Required

Explore the Data Lakes Course Catalog

  • Status: Free Trial

    DeepLearning.AI

    Skills you'll gain: Data Storage, Query Languages, Data Lakes, File Systems, Database Systems, SQL, Databases, Data Architecture, Cloud Storage, Data Warehousing, Amazon Web Services, Amazon S3, Graph Theory, Performance Tuning

  • Status: Free Trial

    Skills you'll gain: Dataflow, Google Cloud Platform, Data Pipelines, Data Import/Export, Feature Engineering, Real Time Data, Tensorflow, Data Lakes, Apache Spark, Dashboard, Big Data, Data Warehousing, Applied Machine Learning, Data Management, Data Infrastructure, Cloud Engineering, Unstructured Data, Cloud Storage, MLOps (Machine Learning Operations), PySpark

  • Status: Free Trial

    Skills you'll gain: Data Pipelines, Real Time Data, Dataflow, Google Cloud Platform, Data Lakes, Data Import/Export, Data Warehousing, Tensorflow, Feature Engineering, Dashboard, Extract, Transform, Load, Cloud Infrastructure, Apache Spark, Big Data, Data Integration, Applied Machine Learning, Data Infrastructure, PySpark, Data Processing, Unstructured Data

  • Status: Preview

    Skills you'll gain: Power BI, Microsoft Azure, Data Lakes, Data Analysis Expressions (DAX), Azure Synapse Analytics, Data Modeling, Data Governance, Data Warehousing, Microsoft SQL Servers, Data Analysis, Extract, Transform, Load, Data Integration, Data Import/Export, Real Time Data, Data Transformation, Data Pipelines, Information Management, Role-Based Access Control (RBAC), Performance Tuning

  • Status: New
    Status: Free Trial

    University of California, Davis

    Skills you'll gain: Data Governance, Presentations, SQL, Apache Spark, Distributed Computing, Descriptive Statistics, Data Lakes, Data Storytelling, Peer Review, Exploratory Data Analysis, Data Quality, Data Pipelines, Databricks, JSON, Statistical Analysis, Data Modeling, Database Design, Data Analysis, Complex Problem Solving, Data Visualization

  • Status: New
    Status: Free Trial

    Skills you'll gain: Data Lakes, Microsoft Azure, Stored Procedure, Data Architecture, Performance Tuning, Data Management, Query Languages, Data Manipulation, Scripting, SQL, Data Processing, Windows PowerShell, Microsoft Visual Studio, Command-Line Interface, Heat Maps

  • Status: Free Trial

    Skills you'll gain: Digital Transformation, Google Cloud Platform, Big Data, Looker (Software), Cloud Computing, Machine Learning, Analytics, Data Management, Data Lakes, Data Storage, Unstructured Data, Business Intelligence, Data-Driven Decision-Making, Customer experience improvement, Artificial Intelligence

  • Skills you'll gain: Metadata Management, Data Pipelines, Data Processing, Google Cloud Platform, Data Migration, Cloud Storage, Apache Airflow, Data Lakes, Data Storage, Big Data, Data Infrastructure, Extract, Transform, Load, Apache Spark, IT Automation, Data Management, Data Transformation, Serverless Computing, SQL

  • Skills you'll gain: Databricks, Data Lakes, Data Pipelines, Data Integration, Dashboard, PySpark, SQL, Apache Spark, Data Management, Data Transformation, Version Control

  • Status: Free Trial

    Skills you'll gain: Dataflow, Real Time Data, Google Cloud Platform, Data Pipelines, Data Import/Export, Looker (Software), PySpark, Data Lakes, Data Warehousing, Tensorflow, Apache Spark, Dashboard, Data Processing, Big Data, Cloud Infrastructure, Data Infrastructure, Unstructured Data, Feature Engineering, Applied Machine Learning, Data Architecture

  • Status: New
    Status: Free Trial

    Skills you'll gain: Responsible AI, Azure Active Directory, Microsoft Azure, Data Lakes, Platform As A Service (PaaS), Cloud Computing, Data Integration, Relational Databases, AI Personalization, Cloud Applications, Cloud Development, Power BI, Performance Tuning, Microsoft Visual Studio, Anomaly Detection, Database Administration, Scalability, Cloud Platforms, Cloud Services, Cloud Management

  • Status: Free Trial

    Duke University

    Skills you'll gain: Databricks, Generative AI, Data Lakes, Extract, Transform, Load, MLOps (Machine Learning Operations), Data Transformation, LLM Application, Data Pipelines, Large Language Modeling, Apache Spark, Responsible AI, Data Analysis, Data Science, CI/CD, Machine Learning