From the course: Fundamentals of Data Transformation for Data Engineering
Unlock the full course today
Join today to access over 25,200 courses taught by industry experts.
Wrangling unstructured data - SQL Tutorial
From the course: Fundamentals of Data Transformation for Data Engineering
Wrangling unstructured data
- [Narrator] In this section, we're going to talk about how to wrangle unstructured data. And that means taking data that might not be in perfect tabular format, transforming it into a more approachable format, and then saving it off somewhere to re-access it. We'll be using CTEs, common table expressions, which we'll talk about in a bit, aliases and case statements. We'll also preview some concepts like Boolean Logic, coalesce, not in all, and a few other forms of filtering. But to get started, we'll load up our database and jump right into it. So we'll be taking a look at the parks dataset for this exercise. If we select the top three rows, we can see a preview of the dataset. We need to scroll horizontally to see all the columns. There are quite a few, but off-the-bat, we can notice there's some interesting things going on. So we'll note that certain columns appear to have lists of JSON, which is a type of data, JSON…
Contents
-
-
-
-
(Locked)
DuckDB basics and query structure4m 2s
-
(Locked)
Wrangling unstructured data10m 30s
-
(Locked)
Joins and comparisons10m 19s
-
(Locked)
Aggregations10m 16s
-
(Locked)
Windows functions: A quick refresher5m 41s
-
(Locked)
Window functions10m 15s
-
(Locked)
Advanced filters3m 9s
-
(Locked)
Advanced joins8m 12s
-
(Locked)
Lambdas and UDFs10m 3s
-
(Locked)
Data generation4m 52s
-
(Locked)
SQL challenge2m 37s
-
(Locked)
SQL solution13m 8s
-
(Locked)
-
-