Data Science With Python-Sasmita PDF
Data Science With Python-Sasmita PDF
Data Science With Python-Sasmita PDF
DataScience Training
Build your own predictive models in 45 days with zero prior knowledge
Project-1
Sale
Prediction
‣
Approach
Learn how to employ statistical and machine
learning algorithms to solve real life problems
‣ Interactive and live coding
by working on real time Projects.
session
‣ Introduction To DataScience
‣ Real Time UseCases Of DataScience
‣ Who is a DataScientist??
‣ Github Tutorial
‣ Skillsets needed for DataScientist
‣ 6 Steps to take in 3 Months for a
Welcome To The
complete transformation to DataScience
Course
from any other domain
‣ Machine Learning-Giving Computers
The ability to learn from data
‣ Supervised vs Unsupervised
‣ DeepLearning vs Machine Learning
‣ Link to get Free Data to Practice?
‣ Some Great self Learning DataScience
Resources(Books,Tutorials,Vedios,Papers)
‣ Software Installation
‣ Introduction To Python
Python Fundamentals
‣ “Hello Python Program” in IDLE
‣ Jupyter Notebook Tutorial
‣ Spyder Tutorial
Python Fundamentals begins with
‣ Introduction to Python
acquiring an in-depth knowledge of
the Python programming language. ‣ Variable,Operators,DataTypes
By the end of the week, students will ‣ If Else,For and While Loops
be expected to program ‣ Functions
intermediate level scripts in Python
‣ Lambda Expression
‣ Filter, Map,Reduce
‣ Taking input from keyboard
‣ HANDS ON-
‣ INTERVIEW QUESTION DISCUSSION
Module-2(Python Advance)
‣ Create Arrays
‣ Array Mathematics
‣ Array Operation
‣ HANDS ON
‣ Introduction to Pandas
‣ Series
‣ Introduction to Pandas
‣ Data Frames
‣ Data Merging,Concatenation,join
‣ Order By
‣ HANDS ON
‣ Line Plots
‣ Scatter Plots
Visualisation- ‣ Pair Plots
matplotlib,seaborn ‣ Histograms
‣ Heat Maps
‣ Bar Plots
we’ll begin curriculum focused on
various data visualization techniques ‣ Count Plots
and how they can help us engage ‣ Factor Plots
and learn from our data using ‣ Box Plots
Matplotlib, Seaborn,ggplot ‣ Violin Plots
‣ Swarm Plots
‣ Strip Plots
‣ Pandas Builtin Visualisation Library
‣ HANDS ON
‣ INTERVIEW QUESTION DISCUSSION
Project-1
Prcatice , Practice and Practice!!!!!!! Implement what you have learnt so far by
working in a real time Project……..
Pandas
Numpy
Seaborn
MatplotLib
Module-4 (Statistics)
‣ Mean,Median,Mode,Variance,Std. dev
‣ Co-Variance
‣ R - Square
‣ Adjusted R-Square
‣ Sample vs Population
This session is dedicated to creating a
deep understanding of mathematical ‣ Standardizing Data(Z-score)
concepts we’ll later see in topics like ‣ Hypothesis Testing
machine learning and statistical
analysis. Contrary to the traditional ‣ Normal Distribution
mathematics course, students will ‣ Bias Variance Tradeoff
learn statistics and linear algebra in
programmatic way to fit a problem’s ‣ Skewness
needs. ‣ P Value
‣ Z-test vs T-test
‣ The F distribution
‣ Annova
‣ HANDS ON
Supervised
‣ Training a model
‣ Validating results
‣ Overfitting vs Underfitting
structured data
‣ Intro to scikitLearn
‣ HANDS ON
Module-6 (Supervised)
‣ Linear regression
‣ Multivariate regression
‣ Polynomial regression
‣ Multi-Colinearity,
‣ Auto correlation
‣ Heteroscedascity
‣ Hands On
‣ KNN
‣ Svm
‣ Classification Report
Model Validation ‣ Confusion Report
‣ ROC
‣ RMSE
‣ MSE
‣ Cross validation
‣ Hands On
Module-7 (Unsupervised)
‣ Kmeans
‣ PCA
‣ Hands on
Module-8 (Ensemble)
‣ What is Ensembling
Ensemble Methods
‣ Types of Ensembling
‣ Bagging
‣ Boosting
‣ Stacking
‣ Random Forest
‣ XGBoost
‣ HANDS ON
Module-9 (NLP)
‣ Tokenizer
NLP ‣ Stop Word Removal
‣ Tf-idf
‣ Document similarity
‣ Word2vec Model
‣ t-SNE visualisation
‣ Sentiment Analysis
‣ HANDS ON
‣ Type of NN
Deep Learning
‣ Cost Function
‣ Tensorflow Basics
‣ HANDS ON