From the course: Learning Amazon SageMaker AI

Unlock the full course today

Join today to access over 25,200 courses taught by industry experts.

Solution: Prepare a dataset

Solution: Prepare a dataset

(upbeat music) - [Instructor] How did you do? Let me show you how I solved this challenge. First, I needed to import the dataset. We do that by going to Data Wrangler, click on Data, Data Wrangler, and Open in Canvas, select Import and Prepare, Tabular data, the name of my S3 bucket and the name of the data file. Click Select dataset, Next, and click Import. Once the dataset is loaded, take a moment to explore it by generating basic statistics like mean, median, and count for each column. We can do that by clicking on Analysis using the default analysis type, selecting our target column of incident, our problem type is classification. We can leave it at a sample dataset and click Create. You'll see that it's generating dataset statistics, here are the number of features, number of rows, 0% missing, 100% valid, 0% duplicate rows, the count of numeric, the count of categorical, date, time, et cetera. You can scroll down and see more about the feature summary. There's an estimated…

Contents