From the course: Data Science Foundations: Fundamentals
Unlock this course with a free trial
Join today to access over 25,300 courses taught by industry experts.
Synthetic data and simulation environments
From the course: Data Science Foundations: Fundamentals
Synthetic data and simulation environments
- [Instructor] There's a quote that I've heard attributed to various people, but I like to think it came from Mae West. She's seen here with W. C. Fields in the 1940 movie, "My Little Chickadee." Mae was well known for being completely over the top. And the line that she may or may not have said, goes like this, "If some is good and more is better, then too much is just right." And again, maybe Mae West. Well, it turns out that in data science and AI, especially for AI, too much data is pretty much just right. That is the mechanisms that make AI possible, both predictive and especially generative AI require truly unprecedented monumental amounts of data, and there's never been a situation of having too much data. So they can start with the available datasets. There are wide collection of open datasets, including ones that were developed specifically for training generative AI. And that's particularly important when you��
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.
Contents
-
-
-
-
-
-
Data preparation5m 26s
-
(Locked)
Labeling data for supervised learning8m 48s
-
(Locked)
In-house data5m 38s
-
(Locked)
Open data4m 15s
-
(Locked)
APIs2m 39s
-
(Locked)
Scraping data4m 45s
-
(Locked)
Synthetic data and simulation environments7m 12s
-
(Locked)
Passive collection of training data3m 57s
-
(Locked)
Data vendors5m 30s
-
(Locked)
New data from surveys and experiments5m 36s
-
(Locked)
Data ethics5m 14s
-
-
-
-
-
-
-