E Commerce

The document analyzes shipping data from an e-commerce company to understand customer preferences and trends. The analysis finds that most shipments are sent by ship rather than road or flight, and that females do more online shopping than males. The highest average customer rating for products is 3 out of 5. Linear regression and data visualization techniques are used to analyze relationships between variables like shipment method and customer ratings, gender and ratings, and product cost versus discounts. The data is split into test and training sets for model validation.

Uploaded by

Huda waseem

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views

E Commerce

Uploaded by

Huda waseem

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

E-Commerce

Shipping Data
Introduction
We have selected a e commerce shipping data to know what kind of
shipping mode is preferred. who buys more males or females or the
people like online shopping or not. So, after the analysis we know
that the people prefer mode of shipment is ship than the road and the
flight. The females do more shopping than males. The highest ranking
is the 3 rank out of 5. For the detailed analysis and visual
representation the working is given below.
Libraries
– Pandas
– Numpy
– Matplotlib.ployty
– Matplotlib
– Sklearn matrix
Data Pre Processing
we can change the raw data into the understandable format by using the technique of the
data mining. The real world data is incomplete or have many errors but we can not
understand it by looking at the excel sheet. So we perform the data pre processing and
data mining. So to clean or to understand the large data we use the method of data
preprocessing in the data mining.
Shape function

Through the shape function in the data mining we know the total number of columns and rows of
our data set. So in our e commerce data set there are total 10999 rows and 12 columns. It means that
there are 10999 people data in the file.
Describe function
The Describe function tells us the whole dataset mean, minimum and maximum
values, standard deviation etc. This function tells us the whole statistics analysis of
the data frame. this excludes the character columns and give the values of the
numeric columns. The e commerce dataset statistical analysis is given below.
Columns
The column function shows the all column labels in the data frame.
The e commerce dataset has the columns: ID', 'Warehouse_block', 'Mode_of_Shipment',
'Customer_care_calls', 'Customer_rating', 'Cost_of_the_Product', 'Prior_purchases',
'Product_importance', 'Gender', 'Discount_offered', 'Weight_in_gms', 'Reached.on.Time_Y.N‘
Data cleanup
By using the data clean up we can remove the columns or data which in unnecessary for our data set
or the analysis. This help us to remove the tables, unfinished data, un reliable and the inaccurate
data. We can also re model our data set by using the data clean up function. So in the e commerce
data we don't need the column customer care calls so we remove that column.
Missing Values
The real world data is not accurate there may be the missing values or the data unavalibility. So for
looking the missing values in the data set we use the is null() , isna() functions. If the function print true
then it means that there are missing values if the function print false then it shoes there is no missing
values in our data set. So, there are no missing values in the data set. The results are given below:
Aggregation
Through applying aggregate we know that the shipment and purchases minimum and maximum
values.
Mean and Maximum function
Group by

– By applying groupby we know that the customers rate the product importance as the high
medium and low.
– By group by we know there are 5 warehouse in the dataset through which the shipment is
occurred. that are A B C D F.
– By applying groupby function we know the shipment occur through ship most than by the flight
or the road.
Data visualization
The data visualization help us to read the data easily. we can make the graphs of our data set by
using the data visualization. It is easy to understand the data in a visual form. It helps us to identify
the outliers from the data, the patterns and the trends.
shipment vs rating

– We made the bar graph on the shipment and the customer rating. Through the graphical
representation we know that through ship the customer gives more rating and the parcels reach
more safer than the road and flight through ship.
Gender vs. customer rating
Through the histogram we know that the females give more good rating to the products than the
male.
Product cost vs. discount
Importance vs. cost of product
Data splitting
If we want to split our data aur divide it for the testing and training we use the data splitting method.
We can make the portions of our data set in the lables and features to test the validity of our data set.
The train is used to develop a predictive model and test is used for the model performance.So we
divide our data set to half to test and train. so our data set is divided to 5499 rows for the testing
Regression
Linear Regression
Linear regression is used for finding linear relationship between target and one or more predictors. There
are many types of the regression but here we apply the linear regression. The Linear regression tells us
the linear relation between the two variables. There are two types of the linear regression one is simple
the other one is the multiple regression.
Residual error
Root Mean Square Error
The root mean square (RMSE) is essentially the square root of the MSE. Because
of this, the RMSE error is in the same units as the training data outcome. Low
RMSE values are desired.
RMSE=1n∑ni=1(y^i−yi)2√

Ritesh Tandon Machine Learning Project
100% (5)
Ritesh Tandon Machine Learning Project
23 pages
Data Mining Business Report Hansraj Yadav
83% (12)
Data Mining Business Report Hansraj Yadav
34 pages
CSA Dump - S-2
No ratings yet
CSA Dump - S-2
19 pages
Clustering Analysis: Prepared by Muralidharan N
100% (1)
Clustering Analysis: Prepared by Muralidharan N
16 pages
Project 5 Surabhi Sood - Report
No ratings yet
Project 5 Surabhi Sood - Report
34 pages
Data Mining Project
100% (1)
Data Mining Project
14 pages
Machine Learning (Project5) PDF
100% (2)
Machine Learning (Project5) PDF
13 pages
AML Assignment 1 1
No ratings yet
AML Assignment 1 1
4 pages
Machine Learning KNN - Supervised
No ratings yet
Machine Learning KNN - Supervised
9 pages
Data Mining Problem 2 Report
No ratings yet
Data Mining Problem 2 Report
13 pages
Autos Automobile.. EDA Project by Anjali Sinha
No ratings yet
Autos Automobile.. EDA Project by Anjali Sinha
26 pages
2 SVM Kernel
No ratings yet
2 SVM Kernel
8 pages
Data Preprocessing - 241024 - 215531
No ratings yet
Data Preprocessing - 241024 - 215531
40 pages
Ash Hair Salon DM-word
No ratings yet
Ash Hair Salon DM-word
6 pages
Engo 645
No ratings yet
Engo 645
9 pages
Insights
No ratings yet
Insights
2 pages
12 Useful Pandas Techniques in Python For Data Manipulation
100% (2)
12 Useful Pandas Techniques in Python For Data Manipulation
19 pages
Data Mining Graded Assignment: Problem 1: Clustering Analysis
100% (3)
Data Mining Graded Assignment: Problem 1: Clustering Analysis
39 pages
Data Wrangling
No ratings yet
Data Wrangling
30 pages
Quadexp IDS Project
No ratings yet
Quadexp IDS Project
22 pages
Article Review 11 Eng
No ratings yet
Article Review 11 Eng
18 pages
Deep Learning Ram
No ratings yet
Deep Learning Ram
21 pages
DSV Module-4
No ratings yet
DSV Module-4
36 pages
Predictive Model For E-Commerce
100% (1)
Predictive Model For E-Commerce
3 pages
DMDW 03
No ratings yet
DMDW 03
25 pages
Clustering Analysis: Reading The Data
100% (1)
Clustering Analysis: Reading The Data
15 pages
data-cleaning-using-pandas
No ratings yet
data-cleaning-using-pandas
9 pages
12 Dimensionality Reduction Techniqwues (with Python Codes)
No ratings yet
12 Dimensionality Reduction Techniqwues (with Python Codes)
20 pages
Pandas-1
No ratings yet
Pandas-1
13 pages
UNIT 1
No ratings yet
UNIT 1
27 pages
Group A Assignment No2 Writeup
No ratings yet
Group A Assignment No2 Writeup
9 pages
Machine Learning Da Ii Name: Mehakmeet Singh Regno: 16bce0376 Q6.)
No ratings yet
Machine Learning Da Ii Name: Mehakmeet Singh Regno: 16bce0376 Q6.)
48 pages
Machine Learning SVM - Supervised
No ratings yet
Machine Learning SVM - Supervised
32 pages
ML Unit 2
No ratings yet
ML Unit 2
33 pages
M4 Data Mining W4 Business Report
No ratings yet
M4 Data Mining W4 Business Report
22 pages
Data
No ratings yet
Data
36 pages
Exploratory Data Analysis-1
No ratings yet
Exploratory Data Analysis-1
10 pages
Data Cleansing Using R
0% (1)
Data Cleansing Using R
10 pages
Jupyter Lab
No ratings yet
Jupyter Lab
42 pages
November 2010)
No ratings yet
November 2010)
6 pages
Data Wrangling
No ratings yet
Data Wrangling
15 pages
Engo 645
No ratings yet
Engo 645
10 pages
GMC Final Project - Maha
No ratings yet
GMC Final Project - Maha
20 pages
Explorotary Data Analysis
100% (1)
Explorotary Data Analysis
30 pages
Social Media Geeta
No ratings yet
Social Media Geeta
33 pages
3 Awesome Visualization Techniques For Every Dataset: Mlwhiz
No ratings yet
3 Awesome Visualization Techniques For Every Dataset: Mlwhiz
13 pages
Sem Rpa
No ratings yet
Sem Rpa
61 pages
SEM MLOps
No ratings yet
SEM MLOps
58 pages
Deep Learning Workflow
No ratings yet
Deep Learning Workflow
11 pages
Detail Project Report SMDM
100% (1)
Detail Project Report SMDM
25 pages
Stock Price Prediction Using ARIMA Model by Dereje Workneh Medium
No ratings yet
Stock Price Prediction Using ARIMA Model by Dereje Workneh Medium
1 page
Data Science Assignment 2
No ratings yet
Data Science Assignment 2
14 pages
PPA-Building Prediction Model ML
No ratings yet
PPA-Building Prediction Model ML
26 pages
Predicting Credit Card Approvals
100% (1)
Predicting Credit Card Approvals
14 pages
Business Report Pradeep Chauhan 11june'23
100% (1)
Business Report Pradeep Chauhan 11june'23
25 pages
An Extensive Step by Step Guide To Exploratory Data Analysis
No ratings yet
An Extensive Step by Step Guide To Exploratory Data Analysis
26 pages
P-149 Final PPT
No ratings yet
P-149 Final PPT
57 pages
CQF EXAM 3-Answer
No ratings yet
CQF EXAM 3-Answer
14 pages
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
From Everand
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
Janet Laane Effron
No ratings yet
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Task 1: Introduction of Nestle
No ratings yet
Task 1: Introduction of Nestle
12 pages
Activity 1.1 What Is Entrepreneurship?
No ratings yet
Activity 1.1 What Is Entrepreneurship?
3 pages
Industry and Market Analysis
No ratings yet
Industry and Market Analysis
4 pages
Fashion Brand
No ratings yet
Fashion Brand
10 pages
New Idea Business
No ratings yet
New Idea Business
16 pages
"Technology Helps Starbucks Find New Ways To Compete": Threat of New Entrants
100% (1)
"Technology Helps Starbucks Find New Ways To Compete": Threat of New Entrants
2 pages
Development of Solar Powered Irrigation System
No ratings yet
Development of Solar Powered Irrigation System
14 pages
Object Oriented Programming Lab
No ratings yet
Object Oriented Programming Lab
2 pages
Attendance Monitoring System For Laboratory Rooms 503 and 601
No ratings yet
Attendance Monitoring System For Laboratory Rooms 503 and 601
15 pages
1 - Kinesthetic Perception - A Machine Learning Approach-Springer Singapore (2018)
No ratings yet
1 - Kinesthetic Perception - A Machine Learning Approach-Springer Singapore (2018)
146 pages
Privacy Policy
No ratings yet
Privacy Policy
8 pages
Introduction To Printed Circuit Board Designon
No ratings yet
Introduction To Printed Circuit Board Designon
33 pages
Chapter 6: Auditing in A Computer Information Systems (Cis) or Information Technology (It) Environment
No ratings yet
Chapter 6: Auditing in A Computer Information Systems (Cis) or Information Technology (It) Environment
29 pages
Module 4 - Decision Theory
No ratings yet
Module 4 - Decision Theory
32 pages
Sapt 5 String Processing Quizz Sect 3 L3
No ratings yet
Sapt 5 String Processing Quizz Sect 3 L3
3 pages
Radio-Frequency Block Arrangements For Fixed Wireless Access Systems in The Range 10.15-10.3/10.5-10.65 GHZ
No ratings yet
Radio-Frequency Block Arrangements For Fixed Wireless Access Systems in The Range 10.15-10.3/10.5-10.65 GHZ
5 pages
Exercise 7,8,9 Basic Commands
No ratings yet
Exercise 7,8,9 Basic Commands
7 pages
TN Appsvr222 Disabling The ItemErrorCntAlarm
No ratings yet
TN Appsvr222 Disabling The ItemErrorCntAlarm
2 pages
Image and Video Compression Techniques in Image Processesing An Overview
No ratings yet
Image and Video Compression Techniques in Image Processesing An Overview
8 pages
Autonomous Vehicle Navigation: Homework
No ratings yet
Autonomous Vehicle Navigation: Homework
6 pages
SIMOVERT Master Drives T100 Technology Board: Operating Instructions Hardware
No ratings yet
SIMOVERT Master Drives T100 Technology Board: Operating Instructions Hardware
24 pages
Pegasystems Prep4sure PEGAPCSSA80V1 - 2019 v2021-04-27 by Maya 15q
No ratings yet
Pegasystems Prep4sure PEGAPCSSA80V1 - 2019 v2021-04-27 by Maya 15q
8 pages
Install Androids DK
No ratings yet
Install Androids DK
14 pages
MicrosoftDefender MindMap-1
No ratings yet
MicrosoftDefender MindMap-1
1 page
Transistor Count - Wikipedia
No ratings yet
Transistor Count - Wikipedia
59 pages
General Concept of Information and Communication Technology
100% (1)
General Concept of Information and Communication Technology
6 pages
Compiler Design
No ratings yet
Compiler Design
85 pages
Chapter 18
No ratings yet
Chapter 18
9 pages
GE2B-06 - Koushik Dutta
No ratings yet
GE2B-06 - Koushik Dutta
23 pages
Exercises
No ratings yet
Exercises
32 pages
Skeyetech: Video Surveillance by Fully Automated Drones
No ratings yet
Skeyetech: Video Surveillance by Fully Automated Drones
4 pages
Ex 5
No ratings yet
Ex 5
9 pages
Fire Alarm Smoke Detector & Sprinkler
No ratings yet
Fire Alarm Smoke Detector & Sprinkler
19 pages
86515-01,03,05,07,09,11 Manual
No ratings yet
86515-01,03,05,07,09,11 Manual
19 pages
Understanding Generation Alpha: July 2020
0% (1)
Understanding Generation Alpha: July 2020
22 pages