From the course: Complete Guide to Data Lakes and Lakehouses

Unlock the full course today

Join today to access over 25,200 courses taught by industry experts.

Metadata management

Metadata management

- [Instructor] Metadata is often described as data about data. Now, you may be wondering, isn't managing the data lake enough? Do we really need to have a strategy for keeping data about our data? Absolutely yes! Metadata enhances the way we discover and use data, and also plays a role in data governance and compliance. Let's discover why. Metadata is used to help us understand the origin, context, and meaning of data. Properly managed metadata makes data access easier, improves quality, ensures compliance, and supports data governance initiatives. So what are the different types of metadata we can use in our data lake? At the lowest level, we have technical metadata, which includes information about data formats, structures, and schemas. It helps technical users, like analytics engineers or data scientists understand how to access and use the data correctly. We can also have business metadata, which provides a higher-level context about the data, such as ownership, business terms…

Contents