From the course: Python Data Analysis
Unlock the full course today
Join today to access over 24,500 courses taught by industry experts.
Visualizing distributions - Python Tutorial
From the course: Python Data Analysis
Visualizing distributions
- [Instructor] We continue to analyze the income distributions in the US and in China be plotting them. Hans Rosling argues convincingly that the logarithm of daily income is the number that's really descriptive of the lifestyle available to a person anywhere in the world. So we compute that and plotted alongside yearly income. The summary statistics that we described in the last video are brought together visually in a box plot, a PANDAS plot of kind box. The box itself extends from the 25th to the 75th quantiles with the line at the median. The so-called whiskers have a complicated definition. They're the minimum and maximum values in the dataset, but only if those do not stray too far from the 25th and 75th quantiles. Precisely, not more than one and a half times the inter-quantile range between them. If they do stray out, they're considered outliers and they're plotted individually. That's what we see in the US income data for some wealthy individuals. Remember, these are just…