Python Data Science Cookbook - (Index)
Python Data Science Cookbook - (Index)
A columns
preprocessing 43, 44
accuracy 234 comprehension 19
AdaBoost 326 confusion matrix 234
anonymous functions cost function 269
creating, with lambda 34 counter
arange function about 5
about 60 reference link 5
URL 60 CountVectorizer class
arrays about 146
processing, from tabular data 39-42 reference link 146
axioms cross-validation iterators
about 186 reference link 313
URL 186 used, with L1 and L2 shrinkage 301-313
csv library
B URL 42
curse of dimensionality 152
Bagging
about 316, 317
leveraging 317-325
D
BaseEstimator data
URL 82 clustering, k-means method used 196-202
Boosting dimension, reducing with random
about 316, 325 projection 171-174
two-class classification problem 325-340 grouping 95-99
Bootstrapping 317 imputing 117-119
Copyright © 2015. Packt Publishing, Limited. All rights reserved.
411
Gopi, Subramanian. Python Data Science Cookbook, Packt Publishing, Limited, 2015. ProQuest Ebook Central, http://ebookcentral.proquest.com/lib/manchester/detail.action?docID=4191189.
Created from manchester on 2020-07-07 02:46:03.
decision trees filters
advantages 256 using 36
building, for multiclass problems 255-265 function
disadvantages 256 behavior altering, with decorators 31-33
reference link 255 embedding, in another function 28
decorators passing, as parameter 29
used, for altering function behavior 31-33 passing, as variable 27
deque returning 30, 31
about 19 functools
URL 19 about 31
derivational patterns URL 31
reference link 138
dictionaries G
about 5
dictionary objects, using 2-5 generator
dictionary of dictionaries, using 6, 7 generating 24, 25
URL 4 Gradient Boosting
dimensionality reduction 152 about 341
distance measures demonstrating 343-354
calculating 187-189 reference link 354
URL 192 simple regression problem 341, 342
working with 186-191 Graphviz package
documents URL 265
classifying, Naïve Bayes used 242-254
dot plots H
using 95-99
heat maps
URL 104
E using 104-108
ensemble methods
Bagging 317-325 I
Boosting 325-340
information gain
Gradient Boosting 341
about 259
error rate 234
reference link 259
Exploratory Data Analysis (EDA) 86
instance-based learning 235
ExtraTreesClassifier class
inverse document frequencies
reference link 374
calculating 147-150
Copyright © 2015. Packt Publishing, Limited. All rights reserved.
412
Gopi, Subramanian. Python Data Science Cookbook, Packt Publishing, Limited, 2015. ProQuest Ebook Central, http://ebookcentral.proquest.com/lib/manchester/detail.action?docID=4191189.
Created from manchester on 2020-07-07 02:46:03.
Itertools.dropwhile list
reference link 22 list comprehension, creating 19-21
izip sorting 45, 46
using 37-39 writing 15-18
loadtxt method
K about 42
reference link 42
kernel-based perceptron Local Outlier Factor (LOF)
URL 395 used, for discovering outliers 216-225
kernel methods
learning 192-196 M
linear kernel 196
polynomial kernel 196 machine learning
using 192-196 with scikit-learn 75-84
kernel PCA map function
using 160-166 using 35
key matplotlib
used, for sorting 46-51 about 55
k-fold cross-validation 301 plotting with 65-74
k-means method URL 74
cluster evaluation, measures 197 matrix decomposition 152
used, for data clustering 196-202 multiclass problems
K-Nearest Neighbor (KNN) 235 solving, with decision trees 255-265
multivariate data
L scatter plots, using 100-103
L1 shrinkage (LASSO) N
used, with regression 293-300
L2 shrinkage (ridge) Naïve Bayes
used, with regression 283-292 used, for classifying documents 242-254
lambda namedtuple
used, for creating anonymous functions 34 URL 12
Last In, First Out (LIFO) 18 nearest neighbors
Latent Semantic Analysis (LSA) obtaining 234-241
about 170 Non-negative Matrix Factorization (NMF)
reference link 170 reference link 175
lazy learner 235 used, for decomposing feature
Least absolute shrinkage and selection matrices 175-183
Copyright © 2015. Packt Publishing, Limited. All rights reserved.
413
Gopi, Subramanian. Python Data Science Cookbook, Packt Publishing, Limited, 2015. ProQuest Ebook Central, http://ebookcentral.proquest.com/lib/manchester/detail.action?docID=4191189.
Created from manchester on 2020-07-07 02:46:03.
OrderedDict container Random Forest
about 4 about 358
URL 4 implementing 359-368
outliers RandomForestClassifier class
discovering, local outlier factor method about 364
used 216-225 reference link 364
finding, in univariate data 208-216 randomization 317
out-of-bag estimation (OOB) random projection
about 368 data dimension, reducing with 171-174
reference link 368 reference link 174
random sampling
P performing 120, 121
real-valued numbers
pairwise_distance method predicting, regression used 268-282
about 192 recursive feature selection 279
URL 192 regression
partial_fit method about 268
about 410 stochastic gradient descent, using 396-405
URL 410 used, for predicting real-valued
percentiles, NumPy numbers 268-282
reference link 92 with L1 shrinkage (LASSO) 293-300
perceptron with L2 shrinkage (ridge) 283-292
reference link 395 ridge regression 284
used, as online learning algorithm 388-395 Rotational Forest
polynomial kernel about 376
about 196 building 376-384
URL 196 rote classifier algorithm 235
polysemy 170
Principal Component Analysis (PCA) 153 S
principal components
extracting 153-159 scatter plots
priority queues used, for multivariate data 100-103
URL 220 scikit-learn
progressive sampling 122 machine learning with 75-84
pyplot URL 84
about 87 SciPy
reference link 87 URL, for documentation 45
Copyright © 2015. Packt Publishing, Limited. All rights reserved.
414
Gopi, Subramanian. Python Data Science Cookbook, Packt Publishing, Limited, 2015. ProQuest Ebook Central, http://ebookcentral.proquest.com/lib/manchester/detail.action?docID=4191189.
Created from manchester on 2020-07-07 02:46:03.
Singular Value Decomposition (SVD) TFIDF transformer
used, for extracting features 166-170 calculating 150
snowball stemmers tokenization
reference link 137 performing 127-130
sparse matrix representation tuples
reference link 143 about 7
standardization 124 creating 8-12
star convex-shaped data points manipulating 8-12
URL 202
stemming, words U
performing 135-137
stochastic gradient descent univariate data
used, for classification 405-409 analyzing, graphically 87-93
used, for regression 396-405 outliers, finding 208-216
stop words
removing 131-134 V
stratified sampling 121
vector quantization 202-208
summary statistics
performing 109-113
plotting 109-113
W
word lemmatization
T performing 138, 139
words
tabular data
stemming 135-137
arrays, processing 39-42
word_tokenize function
term frequencies
URL 129
calculating 147-150
using 129
Term Frequency Inverse Document Frequency
(TFIDF) 169
text
Z
representing, as bag of words 140-146 zip
text mining using 37-39
reference link 133
Copyright © 2015. Packt Publishing, Limited. All rights reserved.
415
Gopi, Subramanian. Python Data Science Cookbook, Packt Publishing, Limited, 2015. ProQuest Ebook Central, http://ebookcentral.proquest.com/lib/manchester/detail.action?docID=4191189.
Created from manchester on 2020-07-07 02:46:03.
Copyright © 2015. Packt Publishing, Limited. All rights reserved.
Gopi, Subramanian. Python Data Science Cookbook, Packt Publishing, Limited, 2015. ProQuest Ebook Central, http://ebookcentral.proquest.com/lib/manchester/detail.action?docID=4191189.
Created from manchester on 2020-07-07 02:46:03.
Thank you for buying
Python Data Science Cookbook
Gopi, Subramanian. Python Data Science Cookbook, Packt Publishing, Limited, 2015. ProQuest Ebook Central, http://ebookcentral.proquest.com/lib/manchester/detail.action?docID=4191189.
Created from manchester on 2020-07-07 02:46:03.
Python Data Science
Essentials
ISBN: 978-1-78528-042-9 Paperback: 258 pages
Gopi, Subramanian. Python Data Science Cookbook, Packt Publishing, Limited, 2015. ProQuest Ebook Central, http://ebookcentral.proquest.com/lib/manchester/detail.action?docID=4191189.
Created from manchester on 2020-07-07 02:46:03.
Practical Data Science
Cookbook
ISBN: 978-1-78398-024-6 Paperback: 396 pages
in Python.
Gopi, Subramanian. Python Data Science Cookbook, Packt Publishing, Limited, 2015. ProQuest Ebook Central, http://ebookcentral.proquest.com/lib/manchester/detail.action?docID=4191189.
Created from manchester on 2020-07-07 02:46:03.