From the course: Cloud Data, Platform, and Applications Security by Pearson

Unlock this course with a free trial

Join today to access over 25,200 courses taught by industry experts.

Data discovery

Data discovery

A big part of data discovery is knowing what types of data you might have. What are you going to discover? On the exam, you want to know the different types of data you're going to discover. So we have structured data. Structured data uses a predefined data model or a schema. So it's usually directly analyzed based on some tabular format like you'd see in a spreadsheet. There's a relationship between rows and columns, what we call tuples. So think of data stored in Excel spreadsheets, CSV files, relational databases and data warehousing. That's structured data. Unstructured data is information that doesn't have a predefined data model. It's not organized in a predefined way. We often think of this as content or blob data. For example, at Azure, it's actually called Azure Blob. Audio files, video files, graphic files, archived PDF files, for example. A new term on the updated exam is semi-structured. data doesn't conform to the formal structure of data models, like SQL for example, or…

Contents