Data Science Syllabus

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 3

INTRODUCTION TO DATA SCIENCE

Data Science Overview, Data Science – Why all the excitement? Demand for Data
Science Professionals, Brief Introduction to Big data and Data Analytics, Life cycle of
data science, what does Data scientist Do. Tools and Technologies used in data Science

STATISTICS
FUNDAMENTALS OF MATHEMATICS AND PROBABILITY – Basic
understanding of linear algebra, linear regression, Matrices and Vectors, Addition and
Multiplication of matrices, Fundamentals of Probability, Probability distributed function
and cumulative distributed function, Problem solving using R for vector manipulation,
Problem solving for probability assignments.

DESCRIPTIVE STATISTICS -Describe or summarize a set of data Measure of central


tendency and measure of dispersion, The mean, median, mode and skewness, Computing
Standard deviation and Variance, Types of distribution, Sample covariance, Sample
Covariance Matrix, Order statistics ,Exploratory analytics R Methods.

INFERENTIAL STATISTICS -What is inferential statistics? Different types of


Sampling techniques, Central Limit Theorem, Point estimate and Interval estimate,
Creating confidence interval for population parameter, Characteristics of Z-distribution
and T-Distribution, Basics of Hypothesis Testing, Type of test and rejection region, Type
of errors in Hypothesis testing, Type-l error and Type-II errors, P-Value and Z-Score
Method, T-Test, Analysis of variance(ANOVA) and Analysis of Co variance(ANCOVA),
Problem solving for C.L.T, Problem solving Hypothesis Testing, Problem solving for T -
test, Z-score test, Case study and model run for ANOVA, ANCOVA, Type of test and
Rejection Region, Type o errors-Type 1 Errors, Type 2 Errors, P value method, Z score
Method, Types of distribution, Exploratory analytics R Methods.

MACHINE LEARNING
INTRODUCTION TO MACHINE LEARNING -What is Machine Learning? What is
the Challenge? Introduction to Supervised Learning, Unsupervised Learning, what is
Reinforcement Learning?

LINEAR REGRESSION -Introduction to Linear Regression, Linear Regression with


Multiple Variables, Disadvantage of Linear Models, Interpretation of Model Outputs,
Understanding Covariance and Co linearity, Understanding Heteroscedasticity, Case
study on Application of Linear Regression for housing price prediction
LOGISTIC REGRESSION-Introduction to Logistic Regression– Why Logistic
Regression, Introduction to classification Cost function for logistic regression,
Application of logistic regression to multi-class classification, Confusion Matrix, Odd’s
Ratio and ROC Curve, Advantages and Disadvantages of Logistic Regre ssion, Case study
on To classify an email as spam or not spam using logistic Regression.

DECISION TREES AND SUPERVISED LEARNING – Decision Tree, data set, How
to build decision tree? Understanding Kart Model, Classification Rules - Over fitting
Problem, Stopping Criteria And Pruning, How to find final size of Trees? Model a
decision Tree, Naive Bayes, Random Forests and Support Vector Machines,
Interpretation of Model Outputs, Business Case Study for Kart Model, Business Case
Study for Random Forest, and Business Case Study for SVM

UNSUPERVISED LEARNING -Hierarchical Clustering, k-Means algorithm for


clustering, groupings of unlabeled data points, Principal Component Analysis (PCA),
Independent components analysis(ICA), Anomaly Detection, Recommender System -
collaborative filtering algorithm, Case study on Recommendation Engine for E -
commerce/retail chain

DEEP LEARNING -Neural Network, Understanding Neural Network Model,


Understanding Tuning of Neural Network, Case study using Neural Network

NATURAL LANGUAGE PROCESSING -Introduction to Natural Language Processing


(NLP), Word Frequency Algorithms for NLP Sentiment Analysis, Case Study on Twitter
data analysis using NLP

PYTHON FOR DATA SCIENCE AND MACHINE


LEARNING
PYTHON PROGRAMMING BASICS – Installing Jupyter Notebooks, Python
Overview, Python 2.7 vs Python 3, Python Identifiers, Various Operators and Operators
Precedence, Getting input from User, Comments and Multi line Comments

MAKING DECISIONS AND LOOP CONTROL -Simple if Statement, if-else


Statement if-else-if Statement, Introduction to while Loops, Introduction To for Loops,
Using continue and break
PYTHON DATA TYPES: LIST, TUPLES AND DICTIONARIES -Python Lists,
Tuples, Dictionaries, Accessing Values, Basic Operations, Indexing, Slicing, and
Matrices, Built-in Functions & Methods, Exercises on List, Tuples and Dictionary

FUNCTIONS AND MODULES -Introduction to Functions, Why Defining Functions?


Calling Functions Functions with Multiple Arguments, Anonymous Functions – Lambda
Using Built-In Modules, User-Defined Modules, Module Namespaces, Iterators and
Generators

FILE I/O AND EXCEPTIONAL HANDLING – Opening and Closing Files, Open
Function, File Object Attributes, Close Method, Read, Write, Seek. Exception Handling, the
try-finally Clause, Raising an Exceptions, User-Defined Exceptions Regular Expression-
Search and Replace, Regular Expression Modifiers, Regular Expression Patterns and Re
module
NUMPY – Introduction to NumPy, Array Creation, Printing Arrays, Basic Operations-
Indexing, Slicing and Iterating Shape Manipulation – Changing shape, stacking and splitting
of array Vector stacking

PANDAS -Introduction to Pandas, Importing data into Python, Pandas Data Frames,
Indexing Data Frames, Basic Operations With Data frame, Renaming Columns, Subletting
and filtering a data frame.

MATPLOTLIB- Introduction, plot, Controlling Line Properties, Working with Multiple


Figures and Histograms.

ONLINE OPEN SOURCE PROJECTS

You might also like