From the course: Fundamentals of Data Transformation for Data Engineering

Unlock the full course today

Join today to access over 25,200 courses taught by industry experts.

Wrangling unstructured data

Wrangling unstructured data

- [Narrator] In this section, we're going to talk about how to wrangle unstructured data. And that means taking data that might not be in perfect tabular format, transforming it into a more approachable format, and then saving it off somewhere to re-access it. We'll be using CTEs, common table expressions, which we'll talk about in a bit, aliases and case statements. We'll also preview some concepts like Boolean Logic, coalesce, not in all, and a few other forms of filtering. But to get started, we'll load up our database and jump right into it. So we'll be taking a look at the parks dataset for this exercise. If we select the top three rows, we can see a preview of the dataset. We need to scroll horizontally to see all the columns. There are quite a few, but off-the-bat, we can notice there's some interesting things going on. So we'll note that certain columns appear to have lists of JSON, which is a type of data, JSON…

Contents