From the course: Python Data Analysis

Unlock the full course today

Join today to access over 24,500 courses taught by industry experts.

Visualizing distributions

Visualizing distributions - Python Tutorial

From the course: Python Data Analysis

Visualizing distributions

- [Instructor] We continue to analyze the income distributions in the US and in China be plotting them. Hans Rosling argues convincingly that the logarithm of daily income is the number that's really descriptive of the lifestyle available to a person anywhere in the world. So we compute that and plotted alongside yearly income. The summary statistics that we described in the last video are brought together visually in a box plot, a PANDAS plot of kind box. The box itself extends from the 25th to the 75th quantiles with the line at the median. The so-called whiskers have a complicated definition. They're the minimum and maximum values in the dataset, but only if those do not stray too far from the 25th and 75th quantiles. Precisely, not more than one and a half times the inter-quantile range between them. If they do stray out, they're considered outliers and they're plotted individually. That's what we see in the US income data for some wealthy individuals. Remember, these are just…

Contents