Python for Data Science
Python for Data Science
SEMESTER I
PYTHON FOR DATA SCIENCE
THEORY
Programme: M.Sc. Data Science Max. Hours: 60
Course Code: P22/MDS/DSC/104 Hours per week: 4
Course Type: DSC Max. Marks: 100
No. of credits: 4
Course Objective:
To explore the programming skills relevant to data science and to gain knowledge of various
libraries and packages like NumPy, Pandas and Matplotlib required for data analysis, data
visualization, natural language processing and machine learning.
Course Outcomes:
CO1: To understand data types in python and to apply array concepts using NumPy.
CO2: Structuring data using NumPy and manipulating the data using Pandas.
CO3: Using Pandas to analyze and work with data sets.
CO4: Graphical visualization of data using Matplotlib.
___________________________________________________________________________
Department of Computer Science, St. Francis College for Women, Begumpet, Hyderabad- 16
CBCS -2022 Sem I – Python for Data Science
IPython Beyond Normal Python: Help and documentation in IPython, Keyboard Shortcuts in
the IPython Shell, IPython Magic Commands, Input and Output History, IPython and Shell
Commands, Errors and Debugging, Profiling and Timing Code. Introduction to NumPy:
Understanding Data Types in Python, The Basics of NumPy Arrays, Computation on NumPy
Arrays: Universal Functions, Aggregations.
MODULE II: Working with NumPy, Data Manipulation with Pandas (15 Hrs)
Hierarchical Indexing, Combining Datasets: Concat and Append, Combining Datasets: Merge
and Join, Aggregation and Grouping, Pivot Tables, Vectorized String Operations, Working
with Time Series, High-Performance Pandas -eval() and query().
General MatplotLib, Simple Line Plots, Simple Scatter Plots, Density and Contour Plots,
Histograms, Binnings, and Density, Customizing Plot Legends, Customizing Colorbars, Text
and Annotation, Three-Dimensional Plotting in Matplotlib, Geographic Data with Basemap,
Visualization with Seaborn.
Textbook:
Python Data Science Handbook: Essential Tools for Working with Data, by Jake VanderPlas,
O’reilly Media, 2017.
Reference Books:
1. Data Science From Scratch: First Principles with Python by Joel Grus, Second Edition,
2019, O’reilly Media.
2. Python for Data Science by Mohd. Abdul Hameed, May 2021, Wiley.
3. Python for Data Science: A Crash Course for Data Science and Analysis, Python
Machine Learning and Big Data by Computer Science Academy.
4. Python for Data Science: The Ultimate Step-by-Step Guide to Python Programming by
Daniel, March 2021, O’reilly.
Web references:
1. https://nptel.ac.in/courses/106/106/106106212/
2. https://www.coursera.org/professional-certificates/ibm-data-science
___________________________________________________________________________
Department of Computer Science, St. Francis College for Women, Begumpet, Hyderabad- 16
CBCS -2022 Sem I – Python for Data Science
3. https://www.geeksforgeeks.org/python-for-data-science/
___________________________________________________________________________
Department of Computer Science, St. Francis College for Women, Begumpet, Hyderabad- 16