From the course: Complete Guide to Generative AI for Data Analysis and Data Science

Unlock the full course today

Join today to access over 24,500 courses taught by industry experts.

Distributions of data

Distributions of data

- [Instructor] The distribution of data is an important concept in statistics. And distribution, when we say a distribution, what we're talking about is a property of a variable and a dataset. And in particular we're talking about the frequency with which different values occur in the dataset for that particular variable. And it's important to understand that different variables and different data sets can have different distributions. But we think of when we say distribution, we're talking about things, well, what's the central tendency? What's the dispersion or spread in the data? And when we talk about different central tendencies, like the different mean or median, and what maybe the standard deviation is, or the range, when we talk about those things, we're talking about properties of all the different values that can be taken on. And it's important to understand these because we're trying to understand patterns and anomalies in the data. And having a sense of what type of a…

Contents