0% found this document useful (0 votes)
51 views3 pages

Python for Data Science

Uploaded by

rakshithasai22
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
Download as pdf or txt
0% found this document useful (0 votes)
51 views3 pages

Python for Data Science

Uploaded by

rakshithasai22
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
Download as pdf or txt
Download as pdf or txt
You are on page 1/ 3

CBCS -2022 Sem I – Python for Data Science

SEMESTER I
PYTHON FOR DATA SCIENCE
THEORY
Programme: M.Sc. Data Science Max. Hours: 60
Course Code: P22/MDS/DSC/104 Hours per week: 4
Course Type: DSC Max. Marks: 100
No. of credits: 4

Course Objective:
To explore the programming skills relevant to data science and to gain knowledge of various
libraries and packages like NumPy, Pandas and Matplotlib required for data analysis, data
visualization, natural language processing and machine learning.

Course Outcomes:

CO1: To understand data types in python and to apply array concepts using NumPy.
CO2: Structuring data using NumPy and manipulating the data using Pandas.
CO3: Using Pandas to analyze and work with data sets.
CO4: Graphical visualization of data using Matplotlib.

___________________________________________________________________________
Department of Computer Science, St. Francis College for Women, Begumpet, Hyderabad- 16
CBCS -2022 Sem I – Python for Data Science

MODULE I: Introduction to IPython and NumPy (15 Hrs)

IPython Beyond Normal Python: Help and documentation in IPython, Keyboard Shortcuts in
the IPython Shell, IPython Magic Commands, Input and Output History, IPython and Shell
Commands, Errors and Debugging, Profiling and Timing Code. Introduction to NumPy:
Understanding Data Types in Python, The Basics of NumPy Arrays, Computation on NumPy
Arrays: Universal Functions, Aggregations.

MODULE II: Working with NumPy, Data Manipulation with Pandas (15 Hrs)

Computation on Arrays: Broadcasting, Comparisons, Masks, and Boolean Logic, Fancy


Indexing, Sorting Arrays, Structured Data. Data Manipulation with Pandas: Introducing Pandas
Objects, Data Indexing and Selection, Operating on Data in Pandas, Handling Missing Data.

MODULE III: Working with Datasets using Pandas (15 Hrs)

Hierarchical Indexing, Combining Datasets: Concat and Append, Combining Datasets: Merge
and Join, Aggregation and Grouping, Pivot Tables, Vectorized String Operations, Working
with Time Series, High-Performance Pandas -eval() and query().

MODULE IV: Visualization with Matplotlib (15 Hrs)

General MatplotLib, Simple Line Plots, Simple Scatter Plots, Density and Contour Plots,
Histograms, Binnings, and Density, Customizing Plot Legends, Customizing Colorbars, Text
and Annotation, Three-Dimensional Plotting in Matplotlib, Geographic Data with Basemap,
Visualization with Seaborn.

Textbook:
Python Data Science Handbook: Essential Tools for Working with Data, by Jake VanderPlas,
O’reilly Media, 2017.

Reference Books:
1. Data Science From Scratch: First Principles with Python by Joel Grus, Second Edition,
2019, O’reilly Media.
2. Python for Data Science by Mohd. Abdul Hameed, May 2021, Wiley.
3. Python for Data Science: A Crash Course for Data Science and Analysis, Python
Machine Learning and Big Data by Computer Science Academy.
4. Python for Data Science: The Ultimate Step-by-Step Guide to Python Programming by
Daniel, March 2021, O’reilly.

Web references:
1. https://nptel.ac.in/courses/106/106/106106212/
2. https://www.coursera.org/professional-certificates/ibm-data-science

___________________________________________________________________________
Department of Computer Science, St. Francis College for Women, Begumpet, Hyderabad- 16
CBCS -2022 Sem I – Python for Data Science

3. https://www.geeksforgeeks.org/python-for-data-science/

___________________________________________________________________________
Department of Computer Science, St. Francis College for Women, Begumpet, Hyderabad- 16

You might also like