From the course: Data Warehousing on Google Cloud Platform

Unlock this course with a free trial

Join today to access over 25,300 courses taught by industry experts.

Data load options

Data load options

- [Trainer] Let's discuss a few data processing services that help support the data warehouse solution. Cloud data flow helps define batch and streaming data pipelines. It's a fully managed service that helps process and transform the raw data and writes into BigQuery, as well as other data syncs. Data can flow from several sources in real time. You can even create the pipelines using SQL within the BigQuery Web UI. Dataproc is another managed service that supports ETL processing for big data. You can provision clusters and jobs within open source services like Hadoop and Spark. You can also use Dataproc to move on-premise, open source solutions to the cloud. Dataproc also integrates with BigQuery to support the flow from open source data solutions to data warehouse and business intelligence. When I think of the entire data solution for organizations, solutions like Dataplex encompasses that, Data lakes, data warehouses, and data marts that support different users from analytics and…

Contents