From the course: Complete Guide to Data Lakes and Lakehouses

Unlock the full course today

Join today to access over 25,200 courses taught by industry experts.

Storage solutions: S3, GCS and Azure Blob Storage and HDFS

Storage solutions: S3, GCS and Azure Blob Storage and HDFS

From the course: Complete Guide to Data Lakes and Lakehouses

Storage solutions: S3, GCS and Azure Blob Storage and HDFS

- [Instructor] When it comes to storage solutions for Data Lake, Amazon S3, Google Cloud Storage, and Azure Blob Storage lead the market for cloud-based options. For on-premises setups, Hadoop HDFS is a popular choice. Let's explore these technologies in detail. Let's start with the most popular solution for data lakes, Amazon S3. S3 is a scalable object storage service that offers industry-leading durability, availability, and scalability. It is designed to store and retrieve any amount of data from anywhere on the web. S3 provides comprehensive security and compliance capabilities that meet even the most strict regulatory requirements. It supports data lifecycle management and automated archiving features, making it highly cost effective for long-term data storage. It is ideal for companies of all sizes that require high availability and robust data protection. It is commonly used for web hosting, data archives, disaster recovery, and of course, for the storage layers of data lakes.…

Contents