From the course: Learning Data Science: Understanding the Basics
Unlock the full course today
Join today to access over 25,100 courses taught by industry experts.
Collect unstructured data
From the course: Learning Data Science: Understanding the Basics
Collect unstructured data
- We've gone through a lot, so let's recap a little. In general your data science teams will work with three different data types. There's your structured data. That's the data that's most like the data in your spreadsheet. It has a set order and a consistent format. It's usually stored in a relational database. Then there's your semistructured data. That's the data with some structure, but there's added flexibility to change some of the field names. Finally there's the most popular type of data, there's everything else, it's the unstructured data. Some analysts estimate that 80% of your data is unstructured. When you think about it this makes a lot of sense. Think about the data you create every day. Every time you leave a voice mail. Every picture you upload to Facebook. The Microsoft Word memo you created at work or the PowerPoint presentation. Even when you search their web, it's mostly unstructured. That search for cats will bring up videos, songs, books, and even music. So what…