From the course: Complete Guide to Data Lakes and Lakehouses
Unlock the full course today
Join today to access over 25,200 courses taught by industry experts.
Metadata management
From the course: Complete Guide to Data Lakes and Lakehouses
Metadata management
- [Instructor] Metadata is often described as data about data. Now, you may be wondering, isn't managing the data lake enough? Do we really need to have a strategy for keeping data about our data? Absolutely yes! Metadata enhances the way we discover and use data, and also plays a role in data governance and compliance. Let's discover why. Metadata is used to help us understand the origin, context, and meaning of data. Properly managed metadata makes data access easier, improves quality, ensures compliance, and supports data governance initiatives. So what are the different types of metadata we can use in our data lake? At the lowest level, we have technical metadata, which includes information about data formats, structures, and schemas. It helps technical users, like analytics engineers or data scientists understand how to access and use the data correctly. We can also have business metadata, which provides a higher-level context about the data, such as ownership, business terms…