From the course: Complete Guide to Data Lakes and Lakehouses
Unlock the full course today
Join today to access over 25,200 courses taught by industry experts.
Self-service data platforms: Dremio and Starburst
From the course: Complete Guide to Data Lakes and Lakehouses
Self-service data platforms: Dremio and Starburst
- [Instructor] Dremio and Starburst represent two sophisticated data lakehouse platforms that make self-service possible, and enhance how you can access and analyze data across diverse sources. Let's see what makes this platform so interesting. Dremio is a unified lakehouse platform for self-service analytics and AI that optimizes data querying across different data sources. It does this by minimizing data movement and maximizing query performance through advanced caching and optimization techniques. By the way, did I mention that we will use Dremio in our capstone project? Don't worry, we are almost there. These are some of the nicest features of Dremio. It uses Apache Arrow for in-memory data processing to speed up data queries, making it ideal for high-performance analytics. Dremio uses data reflections to create optimized representations of data queries, which accelerates subsequent analysis without the need for additional processing power. It also offers a logical abstraction…
Contents
-
-
-
-
-
-
-
-
-
Unified analytics platforms: Databricks and Snowflake3m 14s
-
(Locked)
Cloud data warehouses: BigQuery, Azure Synapse, and Redshift3m 11s
-
(Locked)
Self-service data platforms: Dremio and Starburst3m 29s
-
(Locked)
Interactive notebooks: Jupyter, Zeppelin, Databricks3m 55s
-
(Locked)
BI tools: Tableau, Power BI, Superset, Metabase3m 58s
-
(Locked)
APIs and services for data consumption3m 25s
-
-
-
-
-