Applied AI Course
Applied AI Course
Schedule
DATE MODULE CHAPTER TOPIC
Keywords and identifiers,
comments, indentation and
Module statements, Variables and
2019-11-01 1:Fundamentals Python for DataScience data types in Python,
of Programming Standard Input and Output,
Operators, Control flow: if
else, Control flow: while loop
Control flow: for loop,
Module
Control flow: break and
2019-11-02 1:Fundamentals Python for DataScience
continue,Revision Python for
of Programming
DataScience
Module Lists, Tuples part 1, Tuples
Python for DataScience:Data
2019-11-03 1:Fundamentals part-2, Sets, Dictionary,
Structures
of Programming Strings
Module
Python for DataScience:Data Revision Python for
2019-11-04 1:Fundamentals
Structures DataScience:Data Structures
of Programming
Introduction, Types of
functions, Function
Module
Python for arguments, Recursive
2019-11-05 1:Fundamentals
DataScience:Functions functions, Lambda functions,
of Programming
Modules, Packages, File
Handling
Exception Handling,
Module Debugging Python,
Python for
2019-11-06 1:Fundamentals Assignment-1,Revision
DataScience:Functions
of Programming Python for
DataScience:Functions
Numpy Introduction,
Module
Python for Numerical operations on
2019-11-07 1:Fundamentals
DataScience:Numpy Numpy,Revision Python for
of Programming
DataScience:Numpy
Module Getting started with
Python for
2019-11-08 1:Fundamentals Matplotlib,Revision Python
DataScience:Matplotlib
of Programming for DataScience:Matplotlib
Getting started with pandas,
Module Data Frame Basics, Key
Python for
2019-11-09 1:Fundamentals Operations on Data
DataScience:Pandas
of Programming Frames,Revision Python for
DataScience:Pandas
Space and Time Complexity:
Find largest number in a
list , Binary search, Find
elements common in two
Module Python for
lists, Find elements common
2019-11-10 1:Fundamentals DataScience:Compputational
in two lists using a
of Programming Complexity
Hashtable/Dict,Revision
Python for
DataScience:Compputational
Complexity
Introduction to Databases,
Why SQL?, Execution of an
Module
SQL statement., IMDB
2019-11-11 1:Fundamentals SQL
dataset, Installing MySQL,
of Programming
Load IMDB data., USE,
DESCRIBE, SHOW TABLES
SELECT , LIMIT, OFFSET,
ORDER BY, DISTINCT ,
Module WHERE, Comparison
2019-11-12 1:Fundamentals SQL operators, NULL, Logical
of Programming Operators, Aggregate
Functions: COUNT, MIN,
MAX, AVG, SUM
GROUP BY, HAVING, Order
of keywords., Join and
Module
Natural Join, Inner, Left,
2019-11-13 1:Fundamentals SQL
Right and Outer joins., Sub
of Programming
Queries/Nested Queries/
Inner Queries, DML:INSERT
DML:UPDATE , DELETE,
DDL:CREATE TABLE,
DDL:ALTER: ADD, MODIFY,
DROP, DDL:DROP TABLE,
Module
TRUNCATE, DELETE, Data
2019-11-14 1:Fundamentals SQL
Control Language: GRANT,
of Programming
REVOKE, Learning
resources, Assignment-22:
SQL Assignment on IMDB
data,Revision SQL
Module
2019-11-15 1:Fundamentals SQL Revision SQL
of Programming
Introduction to IRIS dataset
and 2D scatter plot, 3D
scatter plot, Pair plots,
Module 2: Limitations of Pair Plots,
Datascience: Histogram and Introduction
Exploratory Data Plotting for exploratory data to PDF(Probability Density
2019-11-16
Analysis and analysis (EDA) Function), Univariate
Data Analysis using PDF,
Visualization CDF(Cumulative
Distribution Function),
Mean, Variance and
Standard Deviation
Median, Percentiles and
Quantiles, IQR(Inter
Quartile Range) and
MAD(Median Absolute
Module 2:
Deviation), Box-plot with
Datascience:
Whiskers, Violin Plots,
Exploratory Data Plotting for exploratory data
2019-11-17 Summarizing Plots,
Analysis and analysis (EDA)
Univariate, Bivariate and
Data
Multivariate analysis,
Visualization
Multivariate Probability
Density, Contour Plot,
Exercise: Perform EDA on
Haberman dataset
Module 2:
Datascience:
Revision Plotting for
Exploratory Data Plotting for exploratory data
2019-11-18 exploratory data analysis
Analysis and analysis (EDA)
(EDA)
Data
Visualization
Why learn it ?, Introduction
to Vectors(2-D, 3-D, n-D) ,
Row Vector and Column
Vector, Dot Product and
Angle between 2 Vectors,
Projection and Unit Vector,
Equation of a line (2-D),
Module 2:
Plane(3-D) and Hyperplane
Datascience:
(n-D), Plane Passing through
Exploratory Data
2019-11-19 Linear Algebra origin, Normal to a Plane,
Analysis and
Distance of a point from a
Data
Plane/Hyperplane, Half-
Visualization
Spaces, Equation of a Circle
(2-D), Sphere (3-D) and
Hypersphere (n-D), Equation
of an Ellipse (2-D), Ellipsoid
(3-D) and Hyperellipsoid (n-
D), Square ,Rectangle,
Hyper Cube,Hyper Cuboid
Module 2:
Datascience:
Exploratory Data Revision Questions,Revision
2019-11-20 Linear Algebra
Analysis and Linear Algebra
Data
Visualization
Introduction to Probability
and Statistics, Population
and Sample, Gaussian/
Normal Distribution and its
PDF(Probability Density
Module 2: Function), CDF(Cumulative
Datascience: Distribution function) of
Exploratory Data Gaussian/Normal
2019-11-21 Probability And Statistics
Analysis and distribution, Symmetric
Data distribution, Skewness and
Visualization Kurtosis, Standard normal
variate (Z) and
standardization, Kernel
density estimation, Sampling
distribution & Central Limit
theorem
Q-Q plot:How to test if a
random variable is normally
distributed or not?, How
Module 2:
distributions are used?,
Datascience:
Chebyshev’s inequality,
Exploratory Data
2019-11-22 Probability And Statistics Discrete and Continuous
Analysis and
Uniform distributions, How
Data
to randomly sample data
Visualization
points (Uniform
Distribution), Bernoulli and
Binomial Distribution
Log Normal Distribution,
Module 2: Power law distribution, Box
Datascience: cox transform, Applications
Exploratory Data of non-gaussian
2019-11-23 Probability And Statistics
Analysis and distributions?, Co-variance,
Data Pearson Correlation
Visualization Coefficient, Spearman Rank
Correlation Coefficient
Correlation vs Causation,
How to use correlations? ,
Confidence interval (C.I)
Introduction, Computing
Module 2: confidence interval given the
Datascience: underlying distribution, C.I
Exploratory Data for mean of a normal
2019-11-24 Probability And Statistics
Analysis and random variable, Confidence
Data interval using bootstrapping,
Visualization Hypothesis testing
methodology, Null-
hypothesis, p-value,
Hypothesis Testing Intution
with coin toss example
Resampling and permutation
Module 2: test, K-S Test for similarity
Datascience: of two distributions, Code
Exploratory Data Snippet K-S Test, Hypothesis
2019-11-25 Probability And Statistics
Analysis and testing: another example,
Data Resampling and Permutation
Visualization test: another example, How
to use hypothesis testing?
Module 2:
Datascience:
Propotional sampling,
Exploratory Data
2019-11-26 Probability And Statistics Revision Questions,Revision
Analysis and
Probability And Statistics
Data
Visualization
What is Dimensionality
reduction?, Row Vector and
Column Vector, How to
Module 2: represent a data set?, How
Datascience: to represent a dataset as a
Exploratory Data Dimensionality Reduction Matrix., Data Preprocessing:
2019-11-27
Analysis and And Visualization Feature Normalisation,
Data Mean of a data matrix, Data
Visualization Preprocessing: Column
Standardization, Co-variance
of a Data Matrix, MNIST
dataset (784 dimensional)
Module 2:
Datascience:
Code to Load MNIST Data
Exploratory Data Dimensionality Reduction
2019-11-28 Set,Revision Dimensionality
Analysis and And Visualization
Reduction And Visualization
Data
Visualization
Why learn PCA?, Geometric
intuition of PCA,
Mathematical objective
function of PCA, Alternative
Module 2:
formulation of PCA: Distance
Datascience:
minimization, Eigen values
Exploratory Data Principal Component
2019-11-29 and Eigen vectors (PCA):
Analysis and Analysis
Dimensionality reduction,
Data
PCA for Dimensionality
Visualization
Reduction and Visualization,
Visualize MNIST dataset,
Limitations of PCA, PCA
Code example
Module 2:
PCA for dimensionality
Datascience:
reduction (not-
Exploratory Data Principal Component
2019-11-30 visualization),Revision
Analysis and Analysis
Principal Component
Data
Analysis
Visualization
What is t-SNE?,
Neighborhood of a point,
Module 2:
Embedding, Geometric
Datascience:
intuition of t-SNE, Crowding
Exploratory Data
2019-12-01 T-Sne Problem, How to apply t-
Analysis and
SNE and interpret its
Data
output, t-SNE on MNIST,
Visualization
Code example of t-SNE,
Revision Questions
Module 2:
Datascience:
Exploratory Data
2019-12-02 T-Sne Revision T-Sne
Analysis and
Data
Visualization
Dataset overview: Amazon
Fine Food reviews(EDA),
Module 3: Data Cleaning:
Foundations of Deduplication, Why convert
Natural text to a vector?, Bag of
Predict rating given product
2019-12-03 Language Words (BoW), Text
reviews on amazon
Processing and Preprocessing: Stemming,
Machine Stop-word removal,
Learning Tokenization,
Lemmatization., uni-gram,
bi-gram, n-grams.
tf-idf (term frequency-
Module 3: inverse document
Foundations of frequency), Why use log in
Natural IDF?, Word2Vec., Avg-
Predict rating given product
2019-12-04 Language Word2Vec, tf-idf weighted
reviews on amazon
Processing and Word2Vec, Bag of Words(
Machine Code Sample), Text
Learning Preprocessing( Code
Sample)
Bi-Grams and n-grams (Code
Module 3: Sample), TF-IDF (Code
Foundations of Sample), Word2Vec (Code
Natural Sample), Avg-Word2Vec and
Predict rating given product
2019-12-05 Language TFIDF-Word2Vec (Code
reviews on amazon
Processing and Sample), Assignment-2 :
Machine Apply t-SNE,Revision
Learning Predict rating given product
reviews on amazon
How “Classification” works?,
Data matrix notation,
Classification vs Regression
(examples), K-Nearest
Module 3:
Neighbours Geometric
Foundations of
intuition with a toy example,
Natural Classification And
Failure cases of KNN,
2019-12-06 Language Regression Models: K-
Distance measures:
Processing and Nearest Neighbors
Euclidean(L2) ,
Machine
Manhattan(L1), Minkowski,
Learning
Hamming, Cosine Distance
& Cosine Similarity, How to
measure the effectiveness of
k-NN?
Module 3: Test/Evaluation time and
Foundations of space complexity, KNN
Natural Classification And Limitations, Decision surface
2019-12-07 Language Regression Models: K- for K-NN as K changes,
Processing and Nearest Neighbors Overfitting and Underfitting,
Machine Need for Cross validation, K-
Learning fold cross validation
Visualizing train, validation
Module 3: and test datasets, How to
Foundations of determine overfitting and
Natural Classification And underfitting?, Time based
2019-12-08 Language Regression Models: K- splitting, k-NN for
Processing and Nearest Neighbors regression, Weighted k-NN,
Machine Voronoi diagram, Binary
Learning search tree, How to build a
kd-tree
Module 3: Find nearest neighbours
Foundations of using kd-tree, Limitations of
Natural Classification And Kd tree, Extensions, Hashing
2019-12-09 Language Regression Models: K- vs LSH, LSH for cosine
Processing and Nearest Neighbors similarity, LSH for euclidean
Machine distance, Probabilistic class
Learning label
Code Sample:Decision
Module 3:
boundary ., Code
Foundations of
Sample:Cross Validation,
Natural Classification And
Question and
2019-12-10 Language Regression Models: K-
Answers,Revision
Processing and Nearest Neighbors
Classification And
Machine
Regression Models: K-
Learning
Nearest Neighbors
Introduction, Imbalanced vs
Module 3: balanced dataset, Multi-
Foundations of class classification, k-NN,
Natural given a distance or similarity
Classification Algorithms in
2019-12-11 Language matrix, Train and test set
Various Situations
Processing and differences, Impact of
Machine outliers, Local outlier Factor
Learning (Simple solution :Mean
distance to Knn)
K-Distance(A),N(A),
Reachability-Distance(A,B),
Module 3: Local reachability-
Foundations of density(A), Local outlier
Natural Factor(A), Impact of Scale &
Classification Algorithms in
2019-12-12 Language Column standardization,
Various Situations
Processing and Interpretability, Feature
Machine Importance and Forward
Learning Feature selection, Handling
categorical and numerical
features
Handling missing values by
Module 3:
imputation, Curse of
Foundations of
dimensionality, Bias-Variance
Natural
Classification Algorithms in tradeoff, Intuitive
2019-12-13 Language
Various Situations understanding of bias-
Processing and
variance., Best and worst
Machine
cases for an algorithm,
Learning
Question and Answers
Module 3:
Foundations of
Natural Revision Classification
Classification Algorithms in
2019-12-14 Language Algorithms in Various
Various Situations
Processing and Situations
Machine
Learning
Accuracy, Confusion matrix,
Module 3:
TPR, FPR, FNR, TNR,
Foundations of
Precision and recall, F1-
Natural
Performance Measurement score, Receiver Operating
2019-12-15 Language
of Models Characteristic Curve (ROC)
Processing and
curve and AUC, Log-loss, R-
Machine
Squared/Coefficient of
Learning
determination
Module 3:
Foundations of
Median absolute deviation
Natural
Performance Measurement (MAD), Distribution of
2019-12-16 Language
of Models errors, Assignment-3: Apply
Processing and
k-nearest neighbour
Machine
Learning
Module 3:
Foundations of
Natural
Performance Measurement Revision Performance
2019-12-17 Language
of Models Measurement of Models
Processing and
Machine
Learning
Module 3: Conditional probability,
Foundations of Independent vs Mutually
Natural exclusive events, Bayes
2019-12-18 Language Naive Bayes Theorem with examples,
Processing and Exercise problems on Bayes
Machine Theorem, Naive Bayes
Learning algorithm
Module 3:
Toy example: Train and test
Foundations of
stages, Naive Bayes on Text
Natural
data, Laplace/Additive
2019-12-19 Language Naive Bayes
Smoothing, Log-probabilities
Processing and
for numerical stability, Bias
Machine
and Variance tradeoff
Learning
Feature importance and
interpretability, Imbalanced
data, Outliers, Missing
Module 3:
values, Handling Numerical
Foundations of
features (Gaussian NB),
Natural
Multiclass classification,
2019-12-20 Language Naive Bayes
Similarity or Distance
Processing and
matrix, Large
Machine
dimensionality, Best and
Learning
worst cases, Code example,
Assignment-4: Apply Naive
Bayes
Module 3:
Foundations of
Natural
2019-12-21 Language Naive Bayes Revision Naive Bayes
Processing and
Machine
Learning
Module 3:
Foundations of Geometric intuition of
Natural Logistic Regression, Sigmoid
2019-12-22 Language Logistic Regression function: Squashing,
Processing and Mathematical formulation of
Machine Objective function
Learning
Module 3: Weight vector, L2
Foundations of Regularization: Overfitting
Natural and Underfitting, L1
2019-12-23 Language Logistic Regression regularization and sparsity,
Processing and Probabilistic Interpretation:
Machine Gaussian Naive Bayes, Loss
Learning minimization interpretation
Hyperparameter search:
Grid Search and Random
Module 3: Search, Column
Foundations of Standardization, Feature
Natural importance and Model
2019-12-24 Language Logistic Regression interpretability, Collinearity
Processing and of features, Test/Run time
Machine space and time complexity,
Learning Real world cases, Non-
linearly separable data &
feature engineering
Module 3: Code sample: Logistic
Foundations of regression, GridSearchCV,
Natural RandomSearchCV,
2019-12-25 Language Logistic Regression Assignment-5: Apply Logistic
Processing and Regression, Extensions to
Machine Logistic Regression:
Learning Generalized linear models
Module 3:
Foundations of
Natural
2019-12-26 Language Logistic Regression Revision Logistic Regression
Processing and
Machine
Learning
Geometric intuition of
Module 3:
Linear Regression,
Foundations of
Mathematical formulation,
Natural
Real world Cases, Code
2019-12-27 Language Linear Regression
sample for Linear
Processing and
Regression, Question and
Machine
Answers,Revision Linear
Learning
Regression
Module 3: Differentiation, Online
Foundations of differentiation tools, Maxima
Natural and Minima, Vector calculus:
Solving Optimization
2019-12-28 Language Grad, Gradient descent:
Problems
Processing and geometric intuition,
Machine Learning rate, Gradient
Learning descent for linear regression
SGD algorithm, Constrained
Module 3:
Optimization & PCA,
Foundations of
Logistic regression
Natural
Solving Optimization formulation revisited, Why
2019-12-29 Language
Problems L1 regularization creates
Processing and
sparsity?, Assignment 6:
Machine
Implement SGD for linear
Learning
regression
Module 3:
Foundations of
Natural
Solving Optimization Revision Solving
2019-12-30 Language
Problems Optimization Problems
Processing and
Machine
Learning
Geometric Intuition, Why we
Module 4:
take values +1 and and -1
Machine
for Support vector planes,
Learning-II
2019-12-31 Support Vector Machines Mathematical derivation,
(Supervised
Loss function (Hinge Loss)
Learning
based interpretation, Dual
Models)
form of SVM formulation
Kernel trick, Polynomial
Module 4: kernel, RBF-Kernel, Domain
Machine specific Kernels, Train and
Learning-II run time complexities, nu-
2020-01-01 Support Vector Machines
(Supervised SVM: control errors and
Learning support vectors, SVM
Models) Regression, Cases, Code
Sample
Module 4:
Machine
Assignment 7: Apply
Learning-II
2020-01-02 Support Vector Machines SVM,Revision Support
(Supervised
Vector Machines
Learning
Models)
Geometric Intuition of
decision tree: Axis parallel
hyperplanes, Sample
Decision tree, Building a
decision Tree:Entropy,
Building a decision
Tree:Information Gain,
Building a decision Tree:
Module 4: Gini Impurity, Building a
Machine decision Tree: Constructing
Learning-II a DT, Building a decision
2020-01-03 Decision Trees
(Supervised Tree: Splitting numerical
Learning features, Feature
Models) standardization, Building a
decision Tree:Categorical
features with many possible
values, Overfitting and
Underfitting, Train and Run
time complexity, Regression
using Decision Trees, Cases,
Code Samples, Assignment
8: Apply Decision Trees
Module 4:
Machine
Learning-II
2020-01-04 Decision Trees Revision Decision Trees
(Supervised
Learning
Models)
What are ensembles?,
Bootstrapped Aggregation
(Bagging) Intuition, Random
Forest and their
Module 4:
construction, Bias-Variance
Machine
tradeoff, Bagging :Train and
Learning-II
2020-01-05 Ensemble Models Run-time Complexity.,
(Supervised
Bagging:Code Sample,
Learning
Extremely randomized trees,
Models)
Random Tree :Cases,
Boosting Intuition,
Residuals, Loss functions
and gradients
Gradient Boosting,
Regularization by Shrinkage,
Module 4: Train and Run time
Machine complexity, XGBoost:
Learning-II Boosting + Randomization,
2020-01-06 Ensemble Models
(Supervised AdaBoost: geometric
Learning intuition, Stacking models,
Models) Cascading classifiers,
Kaggle competitions vs Real
world
Module 4:
Machine Assignment-9: Apply
Learning-II Random Forests &
2020-01-07 Ensemble Models
(Supervised GBDT,Revision Ensemble
Learning Models
Models)
Module 5:
Introduction, Moving
Feature
window for Time Series
Engineering, Featurization And Feature
2020-01-08 Data, Fourier decomposition,
Productionization Importance
Deep learning features:
and Deployment
LSTM, Image histogram
of ML Models
Keypoints: SIFT., Deep
learning features: CNN,
Module 5: Relational data, Graph data,
Feature Indicator variables, Feature
Engineering, Featurization And Feature binning, Interaction
2020-01-09
Productionization Importance variables, Mathematical
and Deployment transforms, Model specific
of ML Models featurizations, Feature
orthogonality, Domain
specific featurizations
Module 5:
Feature Feature slicing, Kaggle
Engineering, Featurization And Feature Winners solutions,Revision
2020-01-10
Productionization Importance Featurization And Feature
and Deployment Importance
of ML Models
Calibration of Models:Need
for calibration, Calibration
Module 5: Plots., Platt’s Calibration/
Feature Scaling., Isotonic
Engineering, Regression, Code Samples,
2020-01-11 Miscellaneous Topics
Productionization Modeling in the presence of
and Deployment outliers: RANSAC,
of ML Models Productionizing models,
Retraining models
periodically., A/B testing.
Module 5:
Feature
Data Science Life cycle, VC
Engineering,
2020-01-12 Miscellaneous Topics dimension,Revision
Productionization
Miscellaneous Topics
and Deployment
of ML Models
Business/Real world
problem : Problem
definition , Business
objectives and constraints.,
Mapping to an ML problem
: Data overview , Mapping to
Module 6: an ML problem : ML
Machine problem and performance
Quora Question Pair
2020-01-13 Learning Real metric., Mapping to an ML
Similarity
World Case problem : Train-test split,
studies EDA: Basic Statistics., EDA:
Basic Feature Extraction,
EDA: Text Preprocessing,
EDA: Advanced Feature
Extraction, EDA: Feature
analysis., EDA: Data
Visualization: T-SNE.
EDA: TF-IDF weighted
Module 6: Word2Vec featurization., ML
Machine Models :Loading Data, ML
Quora Question Pair
2020-01-14 Learning Real Models: Random Model, ML
Similarity
World Case Models : Logistic Regression
studies and Linear SVM, ML Models
: XGBoost, Assignments
Module 6:
Machine
Quora Question Pair Revision Quora Question
2020-01-15 Learning Real
Similarity Pair Similarity
World Case
studies
Business/Real world
problem : Overview,
Business objectives and
constraints., ML problem
formulation :Data, ML
problem formulation:
Mapping real world to ML
Module 6:
problem., ML problem
Machine
Personalized Cancer formulation :Train, CV and
2020-01-16 Learning Real
Diagnosis Test data construction,
World Case
Exploratory Data
studies
Analysis:Reading data &
preprocessing, Exploratory
Data Analysis:Distribution of
Class-labels, Exploratory
Data Analysis: “Random”
Model, Univariate
Analysis:Gene feature
Univariate Analysis:Variation
Feature, Univariate
Analysis:Text feature,
Machine Learning
Module 6:
Models:Data preparation,
Machine
Personalized Cancer Baseline Model: Naive
2020-01-17 Learning Real
Diagnosis Bayes, K-Nearest Neighbors
World Case
Classification, Logistic
studies
Regression with class
balancing, Logistic
Regression without class
balancing, Linear-SVM.
Random-Forest with one-hot
encoded features, Random-
Module 6:
Forest with response-coded
Machine
Personalized Cancer features, Stacking Classifier,
2020-01-18 Learning Real
Diagnosis Majority Voting classifier,
World Case
Assignment,Revision
studies
Personalized Cancer
Diagnosis
Problem definition. ,
Overview of Graphs: node/
Module 6: vertex, edge/link, directed-
Machine Facebook Friend edge, path. , Data format &
2020-01-19 Learning Real Recommendation Using Limitations. , Mapping to a
World Case Graph Mining supervised classification
studies problem. , Business
constraints & Metrics. ,
EDA:Basic Stats
EDA:Follower and following
Module 6: stats., EDA:Binary
Machine Facebook Friend Classification Task,
2020-01-20 Learning Real Recommendation Using EDA:Train and test split.,
World Case Graph Mining Feature engineering on
studies Graphs:Jaccard & Cosine
Similarities, PageRank
Module 6: Shortest Path, Connected-
Machine Facebook Friend components, Adar Index,
2020-01-21 Learning Real Recommendation Using Kartz Centrality, HITS
World Case Graph Mining Score, SVD, Weight features,
studies Modeling, Assignment
Module 6:
Machine Facebook Friend Revision Facebook Friend
2020-01-22 Learning Real Recommendation Using Recommendation Using
World Case Graph Mining Graph Mining
studies
Business/Real world
problem Overview,
Objectives and Constraints,
Mapping to ML problem
:Data, Mapping to ML
problem :dask dataframes,
Mapping to ML problem
:Fields/Features., Mapping
to ML problem :Time series
Module 6:
forecasting/Regression,
Machine
Taxi Demand Prediction in Mapping to ML problem
2020-01-23 Learning Real
New York City :Performance metrics, Data
World Case
Cleaning :Latitude and
studies
Longitude data, Data
Cleaning :Trip Duration.,
Data Cleaning :Speed., Data
Cleaning :Distance., Data
Cleaning :Fare, Data
Cleaning :Remove all
outliers/erroneous points,
Data Preparation:Clustering/
Segmentation
Data Preparation:Time
binning, Data
Preparation:Smoothing time-
series data., Data
Preparation:Smoothing time-
series data cont.., Data
Preparation: Time series and
Module 6: Fourier transforms., Ratios
Machine and previous-time-bin
Taxi Demand Prediction in
2020-01-24 Learning Real values, Simple moving
New York City
World Case average, Weighted Moving
studies average., Exponential
weighted moving average,
Results., Regression models
:Train-Test split & Features,
Linear regression., Random
Forest regression, Xgboost
Regression, Model
comparison, Assignment.
Module 6:
Machine
Taxi Demand Prediction in Revision Taxi Demand
2020-01-25 Learning Real
New York City Prediction in New York City
World Case
studies
Business/Real world
problem, Business objectives
and constraints, Mapping to
an ML problem: Data
overview, Mapping to an ML
Module 6:
problem:ML problem
Machine
Stack Overflow Tag formulation., Mapping to an
2020-01-26 Learning Real
Predictor ML problem:Performance
World Case
metrics., Hamming loss,
studies
EDA:Data Loading,
EDA:Analysis of tags,
EDA:Data Preprocessing,
Data Modeling : Multi label
Classification
Data preparation., Train-Test
Split, Featurization, Logistic
Module 6:
regression: One VS Rest,
Machine
Stack Overflow Tag Sampling data and
2020-01-27 Learning Real
Predictor tags+Weighted models.,
World Case
Logistic regression revisited,
studies
Why not use advanced
techniques, Assignments.
Module 6:
Machine
Stack Overflow Tag Revision Stack Overflow Tag
2020-01-28 Learning Real
Predictor Predictor
World Case
studies
Problem Definition,
Objectives and Constraints,
Data Overview, ML Problem,
Train and Test Splitting,
Exploratory Data
Analysis:Class Distribution,
Exploratory Data
Module 6: Analysis:Feature Extraction
Machine from Byte Files, Exploratory
Microsoft Malware
2020-01-29 Learning Real Data Analysis:Multivariate
Detection
World Case analysis of features from
studies byte files, Train-Test class
Distribution, ML models –
using byte files only
:Random Model, K-NN,
Logistic regression, Random
Forest and XGBoost, Feature
Extraction and Multi
Threading
File Size Feature, Univariate
Analysis, T-SNE Analysis,
ML Models on ASM File
Module 6:
features, Models on all
Machine
Microsoft Malware features: t-SNE, Models on
2020-01-30 Learning Real
Detection all features: RandomForest
World Case
and XGBoost,
studies
Assignment,Revision
Microsoft Malware
Detection
What is Clustering?,
Module 7: Data Unsupervised learning,
Mining Applications, Metrics for
(Unsupervised Clustering, K-Means:
Learning) and Geometric intuition,
2020-01-31 Clustering
Recommender Centroids, K-Means:
systems+Real Mathematical formulation:
World Case Objective function, K-Means
studies Algorithm., How to initialize:
K-Means++
Module 7: Data
Mining Failure cases/Limitations, K-
(Unsupervised Medoids, Determining the
Learning) and right K, Code Samples, Time
2020-02-01 Clustering
Recommender and space
systems+Real complexity,Revision
World Case Clustering
studies
Agglomerative & Divisive,
Dendrograms,
Module 7: Data Agglomerative Clustering,
Mining Proximity methods:
(Unsupervised Advantages and Limitations.,
Learning) and Time and Space Complexity,
2020-02-02 Hierarchical Clustering
Recommender Limitations of Hierarchical
systems+Real Clustering, Code sample,
World Case Assignment 10: Apply k-
studies means, agglomerative,
DBSCAN Clustering
algorithms
Module 7: Data
Mining
(Unsupervised
Learning) and Revision Hierarchical
2020-02-03 Hierarchical Clustering
Recommender Clustering
systems+Real
World Case
studies
Density based clustering,
MinPts and Eps: Density,
Core, Border and Noise
Module 7: Data
points, Density edge and
Mining
Density connected points.,
(Unsupervised
DBSCAN Algorithm, Hyper
Learning) and
2020-02-04 DBSCAN Technique Parameters: MinPts and Eps,
Recommender
Advantages and Limitations
systems+Real
of DBSCAN, Time and Space
World Case
Complexity, Code samples.,
studies
Question and
Answers,Revision DBSCAN
Technique
Problem formulation: IMDB
Module 7: Data
Movie reviews, Content
Mining
based vs Collaborative
(Unsupervised
Filtering, Similarity based
Learning) and Recommender Systems and
2020-02-05 Algorithms, Matrix
Recommender Matrix Factorization
Factorization: PCA, SVD,
systems+Real
Matrix Factorization: NMF,
World Case
Matrix Factorization for
studies
Collaborative filtering
Matrix Factorization for
Module 7: Data
feature engineering,
Mining
Clustering as MF,
(Unsupervised
Hyperparameter tuning,
Learning) and Recommender Systems and
2020-02-06 Matrix Factorization for
Recommender Matrix Factorization
recommender systems:
systems+Real
Netflix Prize Solution, Cold
World Case
Start problem, Word vectors
studies
as MF
Module 7: Data
Mining
Eigen-Faces, Code example.,
(Unsupervised
Assignment-11: Apply
Learning) and Recommender Systems and
2020-02-07 Truncated SVD,Revision
Recommender Matrix Factorization
Recommender Systems and
systems+Real
Matrix Factorization
World Case
studies
Problem Statement:
Recommend similar apparel
products in e-commerce
using product descriptions
and Images, Plan of action,
Amazon product advertising
API, Data folders and paths,
Overview of the data and
Terminology, Data cleaning
Module 7: Data and understanding:Missing
Mining data in various features,
(Unsupervised Understand duplicate rows,
Learning) and Amazon Fashion Discovery Remove duplicates : Part 1 ,
2020-02-08
Recommender Engine Remove duplicates: Part 2,
systems+Real Text Pre-Processing:
World Case Tokenization and Stop-word
studies removal, Stemming, Text
based product similarity
:Converting text to an n-D
vector: bag of words, Code
for bag of words based
product similarity, TF-IDF:
featurizing text based on
word-importance, Code for
TF-IDF based product
similarity
Code for IDF based product
similarity, Text Semantics
based product similarity:
Word2Vec(featurizing text
based on semantic
similarity), Code for Average
Word2Vec product similarity,
TF-IDF weighted Word2Vec,
Code for IDF weighted
Word2Vec product similarity,
Weighted similarity using
Module 7: Data
brand and color, Code for
Mining
weighted similarity, Building
(Unsupervised
a real world solution, Deep
Learning) and Amazon Fashion Discovery
2020-02-09 learning based visual
Recommender Engine
product similarity:ConvNets:
systems+Real
How to featurize an image:
World Case
edges, shapes, parts, Using
studies
Keras + Tensorflow to
extract features, Visual
similarity based product
similarity, Measuring
goodness of our solution :A/
B testing, Exercise :Build a
weighted Nearest neighbor
model using Visual, Text,
Brand and Color,Revision
Amazon Fashion Discovery
Engine
01 Business/Real World
00
Problem:Problem Definition,
Objectives and Constraints,
Mapping to ML problem :
Data Overview, Mapping to
ML problem : ML problem
formulation, Exploratory
Module 7: Data Data Analysis: Data
Mining preprocessing, Exploratory
(Unsupervised Data Analysis: Temporal
Learning) and Netflix Movie Train-Test split, Exploratory
2020-02-10
Recommender Recommendation system Data Analysis: Preliminary
systems+Real Data Analysis, Exploratory
World Case Data Analysis: Sparse matrix
studies representation, Exploratory
Data Analysis:Average
ratings for various slices ,
Exploratory Data
Analysis:Cold start problem,
Computing Similarity
matrices:User-User
similarity matrix
Computing Similarity
matrices:Movie-Movie
similarity , Computing
Similarity matrices:Does
movie-movie similarity
work?, ML Models:Surprise
library , Overview of the
Module 7: Data modelling strategy. , Data
Mining Sampling. , Google drive
(Unsupervised with intermediate files ,
Learning) and Netflix Movie Featurizations for
2020-02-11
Recommender Recommendation system regression. , Data
systems+Real transformation for
World Case Surprise. , Xgboost with 13
studies features , Surprise Baseline
model. , Xgboost + 13
features +Surprise baseline
model , Surprise KNN
predictors , Matrix
Factorization models using
Surprise , SVD ++ with
implicit feedback
Module 7: Data
Final models with all
Mining
features and predictors.,
(Unsupervised
Comparison between various
Learning) and Netflix Movie
2020-02-12 models.,
Recommender Recommendation system
Assignments,Revision
systems+Real
Netflix Movie
World Case
Recommendation system
studies
History of Neural networks
and Deep Learning., How
Module 8: Neural Biological Neurons work?,
Networks, Growth of biological neural
2020-02-13 Computer Vision Neural Networks networks, Diagrammatic
and Deep representation: Logistic
Learning Regression and Perceptron,
Multi-Layered Perceptron
(MLP).
Module 8: Neural
Notation, Training a single-
Networks,
neuron model., Training an
2020-02-14 Computer Vision Neural Networks
MLP: Chain Rule, Training
and Deep
an MLP:Memoization
Learning
Module 8: Neural Backpropagation., Activation
Networks, functions, Vanishing
2020-02-15 Computer Vision Neural Networks Gradient problem., Bias-
and Deep Variance tradeoff., Decision
Learning surfaces: Playground
Module 8: Neural
Networks,
2020-02-16 Computer Vision Neural Networks Revision Neural Networks
and Deep
Learning
Deep Multi-layer
Module 8: Neural perceptrons:1980s to 2010s,
Networks, Dropout layers &
Deep Multi Layer
2020-02-17 Computer Vision Regularization., Rectified
Perceptrons
and Deep Linear Units (ReLU)., Weight
Learning initialization., Batch
Normalization.
Optimizers:Hill-descent
analogy in 2D,
Module 8: Neural
Optimizers:Hill descent in
Networks,
Deep Multi Layer 3D and contours., SGD
2020-02-18 Computer Vision
Perceptrons Recap, Batch SGD with
and Deep
momentum., Nesterov
Learning
Accelerated Gradient (NAG),
Optimizers:AdaGrad
Optimizers : Adadelta
andRMSProp, Adam, Which
Module 8: Neural algorithm to choose when?,
Networks, Gradient Checking and
Deep Multi Layer
2020-02-19 Computer Vision clipping, Softmax and Cross-
Perceptrons
and Deep entropy for multi-class
Learning classification., How to train
a Deep MLP?, Auto
Encoders.
Word2Vec :CBOW,
Module 8: Neural
Word2Vec: Skip-gram,
Networks,
Deep Multi Layer Word2Vec :Algorithmic
2020-02-20 Computer Vision
Perceptrons Optimizations.,Revision
and Deep
Deep Multi Layer
Learning
Perceptrons
Tensorflow and Keras
overview, GPU vs CPU for
Module 8: Neural
Deep Learning., Google
Networks,
Colaboratory., Install
2020-02-21 Computer Vision Tensorflow And Keras
TensorFlow, Online
and Deep
documentation and tutorials,
Learning
Softmax Classifier on MNIST
dataset.
MLP: Initialization, Model 1:
Sigmoid activation., Model
2: ReLU activation., Model
Module 8: Neural 3: Batch Normalization.,
Networks, Model 4 : Dropout., MNIST
2020-02-22 Computer Vision Tensorflow And Keras classification in Keras.,
and Deep Hyperparameter tuning in
Learning Keras., Exercise: Try
different MLP architectures
on MNIST dataset.,Revision
Tensorflow And Keras
Biological inspiration: Visual
Module 8: Neural Cortex, Convolution:Edge
Networks, Detection on images.,
2020-02-23 Computer Vision Convolutional Neural Nets Convolution:Padding and
and Deep strides, Convolution over
Learning RGB images., Convolutional
layer.
Max-pooling., CNN Training:
Module 8: Neural Optimization, Example CNN:
Networks, LeNet [1998], ImageNet
2020-02-24 Computer Vision Convolutional Neural Nets dataset., Data
and Deep Augmentation., Convolution
Learning Layers in Keras, AlexNet,
VGGNet, Residual Network.
Inception Network., What is
Module 8: Neural Transfer learning., Code
Networks, example: Cats vs Dogs.,
2020-02-25 Computer Vision Convolutional Neural Nets Code Example: MNIST
and Deep dataset., Assignment: Try
Learning various CNN networks on
MNIST dataset.
Module 8: Neural
Networks,
Revision Convolutional
2020-02-26 Computer Vision Convolutional Neural Nets
Neural Nets
and Deep
Learning
Module 8: Neural
Why RNNs? , Recurrent
Networks,
Long Short-Term Neural Network., Training
2020-02-27 Computer Vision
Memory(LSTMS) RNNs: Backprop., Types of
and Deep
RNNs., Need for LSTM/GRU.
Learning
Module 8: Neural
LSTM., GRUs., Deep RNN.,
Networks,
Long Short-Term Bidirectional RNN., Code
2020-02-28 Computer Vision
Memory(LSTMS) example : IMDB Sentiment
and Deep
classification
Learning
Module 8: Neural
Exercise: Amazon Fine Food
Networks,
Long Short-Term reviews LSTM
2020-02-29 Computer Vision
Memory(LSTMS) model.,Revision Long Short-
and Deep
Term Memory(LSTMS)
Learning
Human Activity Recognition
Problem definition, Dataset
understanding, Data
cleaning & preprocessing,
Module 9: Deep
EDA:Univariate analysis.,
Learning Real
2020-03-01 Human Activity Recognition EDA:Data visualization using
World Case
t-SNE, Classical ML models.,
Studies
Deep-learning Model.,
Exercise: Build deeper
LSTM models and hyper-
param tune them
Module 9: Deep
Learning Real Revision Human Activity
2020-03-02 Human Activity Recognition
World Case Recognition
Studies
Problem Definition,
Datasets., Data
understanding & Analysis
:Files and folders., Dash-cam
images and steering angles.,
Split the dataset: Train vs
Module 9: Deep
Test, EDA: Steering angles,
Learning Real
2020-03-03 Self Driving Car Mean Baseline model:
World Case
simple, Deep-learning
Studies
model:Deep Learning for
regression: CNN,
CNN+RNN, Batch load the
dataset., NVIDIA’s end to
end CNN model., Train the
model.
Module 9: Deep Test and visualize the
Learning Real output., Extensions.,
2020-03-04 Self Driving Car
World Case Assignment.,Revision Self
Studies Driving Car
Real-world problem, Music
Module 9: Deep representation, Char-RNN
Learning Real Music Generation Using with abc-notation :Char-
2020-03-05
World Case Deep Learning RNN model, Char-RNN with
Studies abc-notation :Data
preparation.
Char-RNN with abc-
notation:Many to Many RNN
,TimeDistributed-Dense
layer, Char-RNN with abc-
notation : State full RNN,
Module 9: Deep Char-RNN with abc-notation
Learning Real Music Generation Using :Model architecture,Model
2020-03-06
World Case Deep Learning training., Char-RNN with
Studies abc-notation :Music
generation., Char-RNN with
abc-notation :Generate tabla
music, MIDI music
generation., Survey blog,
Assignment
Module 9: Deep
Learning Real Music Generation Using Revision Music Generation
2020-03-07
World Case Deep Learning Using Deep Learning
Studies