Python for Data Science - Assignment 2

Python for Data Science - Assignment 2

Today I am going to share my second assignment of Python on E-Commerce Dataset.

While solving this assignment I faced lot of issues like some of the syntaxes were throwing error, some of the syntaxes I used for the first time so I took help from my Bard Infinity Coach, Google, Kaggle, Data World, W3School website.

Lets talk about my dataset which consist of E-Commerce data so i used different libraries of Python like 'Numpy', 'Pandas', 'Matplotlib', 'Seaborn' etc. This problem statement mostly covers different kind of charts.

First I loaded the dataset into Python for overviewing

No alt text provided for this image

I calculated Sales by multiplying Quantity and Unit Price

No alt text provided for this image

To perform EDA (Exploratory Data Analysis) I calculate outliers by using Boxplot

No alt text provided for this image


I used Histogram for all numerical variable

No alt text provided for this image
Line Chart
Bar Plot
Cat Plot

Then going forward i calculated Minimum Quantity, Maximum Quantity and Total Quantity by using aggregation and groupby function same for Sales also.

No alt text provided for this image
No alt text provided for this image

I extracted all the unique values across the dataset

No alt text provided for this image

I extracted all the duplicate values across the dataset

No alt text provided for this image

Correlation - Heatmap = All numeric variables

No alt text provided for this image

Regression plot = All numerical variables

No alt text provided for this image
Barplot - categorical variable vs numerical varibles
Pairplot = All Numerical variables
Lineplot

Then I Added the columns - Month, Day and Hour for the invoice 

No alt text provided for this image

TOP 5 customers with higher number of orders

No alt text provided for this image

How much money spent by the customers

No alt text provided for this image

TOP 5 customers with highest money spent

No alt text provided for this image

How many orders per month?

No alt text provided for this image

How many orders per day? 

No alt text provided for this image

How many orders per hour?

No alt text provided for this image

How many orders for each country?

No alt text provided for this image

Thank You !!!

Good job, Abhishek. I would like to suggest couple of things for you to try, as it can help you to bring more insights during data exploration. During EDA do check the distribution of numerical features to check if it's normally distributed or skewed. And if it skewed or not following normal distribution what sort of transformation you can try. :)

To view or add a comment, sign in

More articles by Abhishek Kad

  • Capstone Project on Customer Subscription Prediction

    Lest discuss my first capstone project The dataset I have taken from Kaggle website, I have built 5 different models…

    2 Comments
  • Dashboard on Customer Subscription Prediction

    Today I am going to share my Tableau project on Customer Subscription Prediction The dataset for above project i have…

    3 Comments
  • Machine Learning - Assignment 2 - Unsupervised data

    Today I want to discuss my second assignment of ML where I chosen dataset related Mall Customers. this is Unsupervised…

    1 Comment
  • Machine Learning - Assignment 1

    Machine learning (ML) is the study of computer algorithms that can improve automatically through experience and by the…

  • Python for Data Science - Assignment 1

    Python is an programming language used for Machine Learning, Data Analysis, Web development, Game Development etc. Let…

  • Mathematics and Statistics for Data Science

    Mathematics is very important in the field of data science as concepts within mathematics aid in identifying patterns…

  • SQL (Structured Query Language)

    SQL is Structured Query Language, which is a computer language for storing, manipulating and retrieving data stored in…

  • Computer Science for Business Professionals

    Learning Objectives 1. Programming Languages - - It is the language through which we communicate with computer.

  • Introduction to Computer Science

    **Computer Science** Definition - Basics of computer and study of computer. In this field computer and its appliances…

    2 Comments

Others also viewed

Explore content categories