From the course: Complete Guide to Data Lakes and Lakehouses

Unlock this course with a free trial

Join today to access over 25,600 courses taught by industry experts.

Data lake vs. data warehouse

Data lake vs. data warehouse

- By now, I have mentioned data warehouses a couple of times, so you may be wondering, what exactly are the differences between a data lake and a lakehouse? Let's clarify that. First, let's formally define what a data warehouse is. A data warehouse is a system used for reporting and data analysis, and it's a core component of business intelligence. It stores current and historical data in one single place. This data is typically loaded from transactional systems and other relational databases and is structured in a way that makes it easy to query and generate reports. Now, let's review some of the key architectural differences of data lakes versus data warehouses. In terms of data structure, data warehouses are highly structured and use a defined schema to organize data into tables, rows, and columns, which enable really fast access through SQL queries. This structure is ideal for complex queries and generating reports, as I mentioned before. Data lakes, as mentioned previously, store…

Contents