Zen Data Science Syllabus

Download as pdf or txt
Download as pdf or txt
You are on page 1of 13

SriGeek - Initiative

Certification in

Data Science

3-Months Online Weekend Classes


with 100% Job Placement Assistance
About IIT MADRAS

The Indian Institute of Technology Madras or IIT Madras is


a public Technical and Research University established by
the Government of India. IIT-M is recognized globally and
holds the laureate of being the No.1 Engineering University
in India. With a faculty of international fame, bolstered by
a highly motivated and profound student community, IIT
Madras stands true as an Institute of Eminence.

GUVI In A Glance

Grab Ur Vernacular Imprint-GUVI (an IIT-Madras


Incubated Company) is World’s First Vernacular Ed-Tech
Learning Platform. Introduced by Ex PayPal Employees,

GUVI empowers students to master programming skills


with the comfort of their native language. Its mission is to
impart technical skills to all through focussed pedagogical
tools.
About the Co-Founders

Co - Founder at GUVI

20+ years of Technical Expertise

& more

Tech Women Entrepreneur who was selected


For Google Developers’ Launchpad Program
Sridevi

CEO & Founder at GUVI

20+ years of Technical Expertise

& more

Built 7 Products from Scratch Mentored 1000+

Arun Prakash
students Hosted 200+ sessions & 25+ webinars

Co - Founder at GUVI

17+ years of experience with IT industry

Technologist with 9+ years of Entrepreneurial


experience & Member of the Syllabus Sub-
Bala Murugan
Committee at Anna University
Why GUVI’s

Zen Class?

175% Highest Hike ₹21 Lakhs Highest Salary

40% Hike in Demand 200+ Hiring Partners

360+ Best Industry


100% Placement

Experts
Support

2 months

Internship
Program Curriculum
Module-1
We will go through the basics of python with all essential
beginner friendly concepts of python programming like
datatypes, loops, data structures and functions, followed
by assessments and assignments.

Python - Basics

Why Python

Python IDE

Hello World Program

Variables & Names

String Basics

List

Tuple

Dictionaries

Conditional Statements

Functions

Numbers and Math Functions

Common Errors in Python

Module-2
Since we have the essential basics of python we will see
some advanced concepts like Comprehensions, File
handling, Regular Expressions, Object-oriented
Programming, Pickling and many more essential
concepts.

Python - Advanced

Functions as Arguments

List Comprehension

File Handling

Debugging in Python

Class and Objects

Regular Expressions

Lambda, Filters and Map

Regular Expressions

Read Excel Data in Python

Iterators

Pickling

Python JSON
Module-3

We will explore the need for Algorithmic Thinking and the

necessity of efficient coding, we will drive through Data

Structures and Algorithms along with Memory

Management Techniques

Algorithmic thinking with Python

Introduction to algorithmic Thinking

Algorithm Efficiency and time complexity

Example algorithms - binary search,

Euclid’s algorithm

Data structures - stack, heap and binary trees

Memory Management/Technologies

Best Practices – Keeping it simple, dry code,

naming Conventions, Comments and docs.

Project-1

Module-4

Since we need to handle huge amounts of data, we will

be implementing data handling techniques with Pandas

library. And we will explore the different miscellaneous

functions of Pandas library in detail.

Data handling in Python - Pandas 

Introduction to Pandas

Series Data Structure - Querying and Indexing

DataFrame Data Structure - Querying,

Indexing and loading

Merging data frames

Group by operation

Pivot table

Date/Time functionality

Example: Manipulating DataFrame

Project-2
Module-5

After discussing the No-SQL database, we will dive into


the SQL-based databases. We will learn the basics of
SQL queries, schemas, and normalization.

SQL

Data Modeling

Normalization, and Star Schema

ACID transactions

Select, insert, update & delete (DML and DQL)


Join operations

Window functions (rank, dense rank, row


number etc)

Data Types, Variables and Constants

Conditional Structures (IF, CASE, GOTO and


NULL) Integrating python with SQL

Project-3

Module-6
We will go through Probability and Statistics where they
are key to understanding, processing and interpreting
the vast amount of data, we will deal with the basics of
probability and statistics like Probability theory, Bayes
theorem, distributions etc and their importance. Besides
that, we will do hands-on with Numpy upon those
concepts

Probability and Statistics with Numpy

Why counting and probability theory?

Basics of sample and event space

Axioms of probability

Total Probability theorem and Bayes Theorem


Random variables, PMF and CDF

Discrete Distributions - Bernoulli, Binomial and


geometric

Expectation and its properties

Variance and its properties

Continuous Distributions - uniform,


exponential and normal

Sampling from continuous distributions

Simulation techniques - simulating in NumPy


Assessment
Module-7
We will continue with statistics and probability and we
will deal with descriptive and inferential statistics along
with Hypothesis testing and a lot of other relevant
statistics methods

Probability and Statistics with Numpy - Continued

Inferential statistics - sample vs population

CLT and it’s proof

Chi-squared distribution and its properties

Point and Interval Estimators

Estimation technique - MLE

Interval Estimator of μ with unknown σ

Examples of estimators

Hypothesis Testing - I

Hypothesis Testing - II

Hypothesis Testing - III

Assessment

Module-8
We will continue with statistics and probability and we
will deal with descriptive and inferential statistics along
with HypotheData Visualization is used to understand
data in a visual context so that the patterns, trends and
correlations in the data can be understood. We will do a
lot of visualization with libraries like Seaborn, Matplotlib
etc in turn that leads to effective storytelling.sis testing
and a lot of other relevant statistics methods

Data Visualisation in Python (Matplotlib, Seaborn)

Read Complex JSON files

Styling Tabulation

Distribution of Data - Histogram

Box Plot

Data Visualization - Recap

Pie Chart

Donut Chart

Stacked Bar Plot

Relative Stacked Bar Plot

Stacked Area Plot

Scatter Plots

Bar Plot

Continuous vs Continuous Plot

Line Plot

Line Plot Covid Data

Project-4
Module-9
It is always needed to analyze the data and preprocess
it, since the real-world data is not always

industry ready, so in this week we will be dealing with a


lot of data cleaning and Exploratory data

Analysis techniques which is a very crucial stage for any


data science project

Data Engineering with Python

Handling missing data

Techniques to impute missing values

Encoding the data

Outlier detection and correction

Meaningful data transformation

Project-5

Module-10
It is always needed to analyze the data and preprocess
it, since the real-world data is not always

industry ready, so in this week we will be dealing with a


lot of data cleaning and Exploratory data

Analysis techniques which is a very crucial stage for any


data science project

Machine Learning with Sklearn

Introduction to machine learning

Expert systems and 6 Jars

Supervised Learning - Regression and

Classification

Evaluation metrics and measuring accuracy

Introduction to regression

Interpreting models

Feature selection

Regularisation - Ridge and Lasso

Project-6
Module-11

In continuation to the ML algorithms we are going to see


in detail about different classification algorithms along
with mathematical intuition and evaluation metrics

Machine Learning with Sklearn - Continued

Introduction to classification

Evaluation metrics - TP, FP and AUC

Classification using logistic regression

Classification using KNN

Project-7

Module-12
We are going to explore classification algorithms like
tree-based algorithms in detail like how to

interpret trees, pruning and ensemble methods like


bragging and boosting, etc.

Machine Learning with Sklearn - Continued

Introduction to decision trees

Building, pruning and interpreting trees

Ensemble techniques - Bagging and boosting


Random forests

Boosted trees - Gradient boosting

Project-8

Module-13
After dealing with a lot of Supervised Machine Learning
Algorithms we will compare and get to know when to use
what, Besides that we will deal with the do’s and don'ts
while training an ML

model.

Machine Learning with Sklearn - Continued

Comparison of supervised

techniques - when to use what?

Do’s and Don'ts while training ML models

Handling imbalanced data

Undersampling

Oversampling

Other methods - ROSE, SMOTE, etc.

Project-9

Final

Capstone Project

2 Months internship with certification


Hear it from our learners

“They are very approachable and friendly when “GUVI is one of the best platforms to
we ask any doubt or any clarification. Before start a new course and a new career.

joining guvi I have already done a course of data Advanced Programming and Master Data
science in another institution.When comparing Science is one of the best programs
these two institutions, there is a lot of difference which are been trained with industry
in teaching.I love that the mentor who is experts. It has its own software to
teaching the course is not only a mentor but a practise and a huge number of exercises
professional too. This is a very unique thing to master any topic.”

about guvi. I will rate 5/5 to Guvi.”

Vishally Tejas Samanthapudi

“I have attended several classes of Masters in Data


“Guvi helps me to improve my self-confidence science course conducted by Guvi. It is really
in coding skills . The zoom classes are totally helpful to gain knowledge as it is different from
comfortable,friendly and easy to learn .It helps other online courses. Here, we have mentors in live
me to understand the basic and the core sessions, so we will be more concentrated than
concepts and it helped me to. Build logical other online courses where we watch pre
skills.I got great mentor's which helped me to recorded videos. Also we are getting weekly tasks
bridge between the academics. I'm very proud that would make us learn even if there is no class.

Thanks to Guvi.”

I am thankful for all the people in Guvi for building


up such a valuable program for our career.”

Gokila Gokul

“I always liked coding but I didn't really get a


good platform to learn things as per industrial
“Guvi offers a cordial, supportive and friendly requirements. When I was in search I got to know
environment to learners. With excellent support about Guvi, I really felt trustworthy by their
and 24*7 assistance from the mentors guvi response When I joined the Data Science course
does not leave any stone unturned to improvise the weekend live classes and recorded course
your learning. Thanks for being such an videos has made learning easy to me. Eventually
inspiration to us.”

I started spending more time practicing in


Codekata. I loved the way Guvi took care of
clarifying doubts asap. Thank you!.”

Gokak Mohd Ishtiyaque Sonia kola

“Hello folks, if you are thinking of a career transition in


the ‘Data Science’ field then, “GUVI” is the best
“The datascience course is very good,
platform to get nourished, indulged and protruded in
the concepts are being explained in a
this upcoming field and also, it doesn’t matter from
crisp manner. The instructors have good
which engineering background you are or whether
depth in the subject and solve every
you are a working fellow. The best thing I found here
doubt one might have. Thanks to GUVI for
is you will always get motivated unknowingly and
setting a great structured program.”

become curious to learn more & more from the


tutorial videos conducted by the IITM professors.
GUVI helps me to think about the problem in
multidimensional ways. Thanks to the GUVI team”

Shubham Nehete Diliban Sibi

“This course is designed being dynamic, interactive


“The course videos help you to learn the tools by and range of materials to refer. This is very well
yourself and you can track the progress.The structured in such a way that it makes the
mentors are very patient and ensure that participants to perform, discuss, and to participate in
students understand the concept, sometimes assessments that will help the participants to
going the extra mile and explaining. Sometimes maximize the utilization. This program is suitable for all
the mentors try to teach in your native language, if students, freshers and working professionals. This
needed. The practice platforms are easy to learn course is excellent for those who would like to learn
and practice. By completing this data science the basics of program like Python and would like to
course, sure you can become a Data Scientist.”

broaden their knowledge in Data Science. I enjoyed


seeing videos in GUVI website from experts that also
explains the concepts in a detailed manner. ”

Sridharan K Anbazhagan
Program Details

3-Month Weekend

Please contact our program coordinator

+91-9736097320

Total Course Fee 1,00,000 ₹49,999

Pre-BootCamp Booking Fees -₹5000

Remaining Fee ₹44,999

Now become a proficient in AI at Affordable

Installments! Master Artificial Intelligence

at just ₹ 3750* / Month

Upto 12 Months*

Note: Valid documents are required for EMI Process.

Additional processing fee will be applied. EMI Amount

might vary with Vendors

Any woman aspiring to return to the workforce after a

career break can join this Career Relaunch Program.

“Gain Proficiency in

Data Science from GUVI


& Become recruiters

TOP PICK!”
Begin your Skill Development Journey Today!

For further information:


IITM Research park - phase 2

+91 9736097320 module #9, 3rd floor, D block,

Kanagam Rd, Tharamani, Chennai,

Tamil Nadu, India. 600113

You might also like