18ETCS002122 Assignment (Data Science)

ASSIGNMENT
Course Code 20CSE421A

Course Name Data Science Foundation
Programme B-Tech
Department Computer Science and Engineering
Faculty FET
Name of the Student SUDARSHAN C

Reg. No 18ETCS002122
Semester/Year 7th semester/4th year
Course Leader/s Dr. Yatish Bathla/Dr. Mohan Kumar K N
i
Name: SUDARSHAN C Registration number: 18ETCS002122
Declaration Sheet
Student Name SUDARSHAN C
Reg. No 18ETCS002122
Programme B. Tech Semester/Year 7th/4th
Course Code 20CSE421A
Course Title Data Science Foundation

Course Date to
Course Leader Dr. Yatish Bathla/Dr. Mohan Kumar K N
Declaration
The assignment submitted herewith is a result of my own investigations and that I have
conformed to the guidelines against plagiarism as laid out in the Student Handbook. All sections
of the text and results, which have been obtained from other sources, are fully referenced. I
understand that cheating and plagiarism constitute a breach of University regulations and will
be dealt with accordingly.
Signature of the
Date
Student
Submission date stamp

(by Examination & Assessment
Section)
Signature of the Course Leader and date Signature of the Reviewer and date
Part A
A1.1
Python is open source, interpreted, high level language and provides great approach for object-
oriented programming. It is one of the best languag used by data scientist for various data science
projects/application. Python provide great functionality to deal with mathematics, statistics and
scientific function. It provides great libraries to deals with data science application.
One of the main reasons why Python is widely used in the scientific and research communities is
because of its ease of use and simple syntax which makes it easy to adapt for people who do not
have an engineering background. It is also more suited for quick prototyping. One of the main
reasons why Python is widely used in the scientific and research communities is because of its
ease of use and simple syntax which makes it easy to adapt for people who do not have an
engineering background. It is also more suited for quick prototyping.
some useful features of Python language:

• It uses the elegant syntax, hence the programs are easier to read.
• It is a simple to access language, which makes it easy to achieve the program working.
• The large standard library and community support.
• The interactive mode of Python makes its simple to test codes.
• In Python, it is also simple to extend the code by appending new modules that are
implemented in other compiled language like C++ or C.
• Python is an expressive language which is possible to embed into applications to offer
a programmable interface.
• Allows developer to run the code anywhere, including Windows, Mac OS X, UNIX, and
Linux.
• It is free software in a couple of categories. It does not cost anything to use or
download Pythons or to add it to the application.
Python is used in Data science of these factors:
• Scalability: Python is a programming language that scales very fast. Among all available
languages, Python is a leader in scaling. That means that Python has more and more
possibilities.
• Libraries and Frameworks: Due to its popularity, Python has hundreds of different
libraries and frameworks which is a great addition to your development process. They
save a lot of manual time and can easily replace the whole solution. As a Data Scientist,
you will find that many of these libraries will be focused on Data Analytics and Machine
Learning. Also, there is a huge support for Big Data. I suppose there should be a strong
pro why you need to learn Python as your first language.
• Web Development: To make your development process as easy as it is possible only,

learn Python. There are a lot of Django and Flask libraries and frameworks that make
your coding productive and speed up your work. If you compare PHP and Python, you
can find that the same task can be created within a few hours of code via PHP. But with
Python, it will take only a few minutes.
• Huge Community: As I have mentioned before, Python has a powerful community. You
might think that it shouldn`t be one of the main reasons why you need to select Python.
But the truth is vice versa.
A1.2
an identifier is a user-defined name to represent the basic building blocks of Python. It can be a
variable, a function, a class, a module, or any other object.
Examples of a valid identifier:
• num1.
• FLAG.
Python is completely object oriented, and not statically typed. You do not need to declare variables
before using them, or declare their type. Every variable in Python is an object. A variable is created
the moment we first assign a value to it. A variable is a name given to a memory location. It is the
basic unit of storage in a program.
• The value stored in a variable can be changed during program execution.

• A variable is only a name given to a memory location, all the operations done on the
variable effects that memory location.
A variable name cannot start with a number. A variable name can only contain alpha-numeric
characters and underscores. Variable names are case-sensitive ex:- name, Name and NAME are three
different variables. The reserved words i.e. keywords cannot be used naming the variable
Underscore is used in python system, interpreter and in built-in identifiers, therefore we avoid the use
of beginning variable name with the underscore.
A1.3
Namespaces in Python
A namespace is a collection of currently defined symbolic names along with information about the
object that each name references. You can think of a namespace as a dictionary in which the keys
are the object names and the values are the objects themselves. Each key-value pair maps a name to
its corresponding object.
In a Python program, there are four types of namespaces:
1. Built-In
2. Global
3. Enclosing
4. Local
The built-in namespace contains the names of all of Python’s built-in objects. These are available at
all times when Python is running. The global namespace contains any names defined at the level of
the main program. Python creates the global namespace when the main program body starts, and it
remains in existence until the interpreter terminates. Strictly speaking, this may not be the only
global namespace that exists. The interpreter also creates a global namespace for any module that
your program loads with the import statement.
Python Variable Scope
Through various python namespaces, not each can be accessed from every part of the program. A
namespace is in variable scope in a part of a program, if it lets you access the python namespace
without having to use a prefix.
At any instant, we have at least three nested python scopes:
• Current function’s variable scope- has local names

• Module’s variable scope- has global names
• The outermost variable scope- has built-in names
This in accordance with the three kinds of namespaces in python, we just discussed. This also
decides the order of searching for when a reference is made.
The order is- the local Python namespace, the global namespace, the built-in namespace. Also, a
nested function creates a nested variable scope inside the outer function’s scope.
Example:
a=1
def func1():
b=2
def func2():
c=3
In this code, ‘a’ is in the global namespace in python. ‘b’ is in the local namespace of func1, and ‘c’ is
in the nested local python namespace of func2. To func2, ‘c’ is local, ‘b’ is nonlocal, and ‘a’ is global.
By nonlocal, we mean it isn’t global, but isn’t local either. Of course, here, you can write ‘c’, and read
both ‘b’ and ‘c’. But you can’t access ‘a’, that would create a new local variable ‘a’
There are three types of Python namespaces global, local, and built-in. It’s the same with a variable
scope in python. Also, the global keyword lets us refer to a name in a global scope. Likewise, the
‘nonlocal’ keyword lets us refer to a name in a nonlocal scope.
A1.4
An exception is an error which happens at the time of execution of a program. However, while
running a program, Python generates an exception that should be handled to avoid your program to
crash. In Python language, exceptions trigger automatically on errors, or they can be triggered and
intercepted by your code. The exception indicates that, although the event can occur, this type of
event happens infrequently. When the method is not able to handle the exception, it is thrown to its
caller function. Eventually, when an exception is thrown out of the main function, the program is
terminated abruptly
Python uses try and except keywords to handle exceptions. Both keywords are followed by
indented blocks.
try: statements in try block
except: executed when error in try block
The try block contains one or more statements which are likely to encounter an exception. If the
statements in this block are executed without an exception, the subsequent except: block is
skipped. If the exception does occur, the program flow is transferred to the except: block. The
statements in the except block are meant to handle the cause of the exception appropriately. For
example, returning an appropriate error message.
Try Except in Python: Try and Except statement is used to handle these errors within our code in
Python. The try block is used to check some code for errors i.e. the code inside the try block will
execute when there is no error in the program. Whereas the code inside the except block will
execute whenever the program encounters some error in the preceding try block.
Syntax:
try:
# Some Code
except:
# Executed if error in the
# try block
Python provides a keyword finally, which is always executed after try and except blocks. The finally
block always executes after normal termination of try block or after try block terminates due to
some exception.
try-except
allow one to detect and handle exceptions. There is even an optional else clause for situations
where code needs to run only when no exceptions are detected.
try-finally
statements allow only for detection and processing of any obligatory clean-up (whether or not
exceptions occur), but otherwise has no facility in dealing with exceptions
A1.5
Python libraries helps you to perform data analysis and data manipulation in Python language.
Additionally, it provides us with fast and flexible data structures that make it easy to work with
Relational and structured data
Some of the standard libraries of python in data science

1. Pandas:
You’ve heard the saying. 70 to 80% of a data scientist’s job is understanding and cleaning the data,
aka data exploration and data munging. Pandas is primarily used for data analysis, and it is one of
the most commonly used Python libraries. It provides you with some of the most useful set of tools
to explore, clean, and analyse your data. With Pandas, you can load, prepare, manipulate, and
analyse all kinds of structured data. Machine learning libraries also revolve around Pandas Data-
Frames as an input.
• structured data fast, easy, and expressive.
• powerful and productive data analysis environment.
• primary object in pandas is the Data-Frame, a two dimensional tabular, column-oriented

data structure with both row and column labels
• It provides sophisticated indexing functionality to make it easy to reshape, slice and dice,
perform aggregations, and select subsets of data.
• ideal tool for financial data analysis applications, statistical computing
2. NumPy:
NumPy is mainly used for its support for N-dimensional arrays. These multi-dimensional arrays are
50 times more robust compared to Python lists, making NumPy a favourite for data scientists.
NumPy is also used by other libraries such as TensorFlow for their internal computation on tensors.
NumPy also provides fast precompiled functions for numerical routines, which can be hard to
manually solve. To achieve better efficiency, NumPy uses array-oriented computations, so working
with multiple classes becomes easy.
• A fast and efficient multidimensional array object n-d array
• Functions for performing element-wise computations with arrays or mathematical

operations between arrays
• Tools for reading and writing array-based data sets to disk
• Linear algebra operations, Fourier transform, and random number generation
• Tools for integrating connecting C, C++, and Fortran code to Python

• For numerical data, NumPy arrays are a much more efficient way of storing and
manipulating data
3. Scikit-learn:
Scikit-learn is arguably the most important library in Python for machine learning. After cleaning and
manipulating your data with Pandas or NumPy, scikit-learn is used to build machine learning models
as it has tons of tools used for predictive modelling and analysis. There are many reasons to use
scikit-learn. To name a few, you can use scikit-learn to build several types of machine learning
models, supervised and unsupervised, cross-validate the accuracy of models, and conduct feature
importance
4. TensorFlow
TensorFlow is one of the most popular libraries of Python for implementing neural networks. It uses
multi-dimensional arrays, also known as tensors, which allows it to perform several operations on a
particular input. Because it is highly parallel in nature, it can train multiple neural networks and
GPUs for highly efficient and scalable models. This feature of TensorFlow is also called pipelining.
5. SciPy
As the name suggests, SciPy is mainly used for its scientific functions and mathematical functions
derived from NumPy. Some useful functions which this library provides are stats functions,
optimization functions, and signal processing functions. To solve differential equations and provide
optimization, it includes functions for computing integrals numerically
A1.6:
NumPy is extremely fast for binary data loading and storage, including support for memorymapped
array. It is a Python library used for working with arrays. It also has functions for working in domain
of linear algebra, Fourier transform, and matrices. NumPy was created in 2005 by Travis Oliphant. It
is an open-source project and you can use it freely. NumPy stands for Numerical Python.
Plotly and pandas these two libraries combines to provide interactive features like zooming and
panning. The popular Pandas data analysis and manipulation tool provides plotting functions on
its DataFrame and Series objects, which have historically produced matplotlib plots. Since version
0.25, Pandas has provided a mechanism to use different backends, and as of version 4.8 of plotly,
you can now use a Plotly Express-powered backend for Pandas plotting. This means you can now
produce interactive plots directly from a data frame, without even needing to import Plotly.
Panda’s library is used for flexible and high-performance group by facility, enabling slice and dice,
and summarize data sets in a natural way. It is primarily used for data analysis, and it is one of the
most commonly used Python libraries. It provides you with some of the most useful set of tools to
explore, clean, and analyse your data.
Part B
B.1
B1.1
The Pandas Series Object:
A Pandas Series is a one-dimensional array of indexed data. It can be created from a list or array as
follows
data=pd.Series([0.25,0.5,0.75,1.0])
DataFrame Object is an analog of a two-dimensional array with both flexible row indices and
flexible column names. Just as you might think of a two-dimensional array as an ordered
sequence of aligned one-dimensional columns, you can think of a Data Frame as a sequence of
aligned Series objects
states = pd.DataFrame({'population': population,
'area': area})
Parsing of JSON Dataset using pandas is much more convenient. Pandas allow you to convert a list
of lists into a DataFrame and specify the column names separately. A JSON parser transforms a
JSON text into another representation must accept all texts that conform to the JSON grammar. It
may accept non-JSON forms or extensions.
Working with large JSON datasets can be deteriorating, particularly when they are too large to fit
into memory. In cases like this, a combination of command line tools and Python can make for an
efficient way to explore and analyse the data.
Importing JSON Files

Manipulating the JSON is done using the Python Data Analysis Library, called pandas.
import pandas as pd
Now you can read the JSON and save it as a pandas data structure, using the command read_json.
import pandas as pd
data = pd.read_json('http://api.population.io/1.0/population/India/today-and-tomorrow/?format
= json')
print(data)
pandas.ExcelFile.parse it is also a parsing funct
Syntax for reading the data in text format:
import pandas as pd
Text = pd.read_csv(" path ")
Syntax for reading the data in text format that does not have header:
import pandas as pd
Text = pd.read_csv(" path ",header=None)
Syntax for reading the data in text format and initializing the header
names = ['sub1', 'sub2', 'sub3','RegNo']
df = pd.read_csv(“ path ”,names=names, index_col='RegNo')
B1.2
An SQLite database is normally stored in a single ordinary disk file. However, in certain
circumstances, the database might be stored in memory.
The most common way to force an SQLite database to exist purely in memory is to open the
database using the special filename ":memory:". In other words, instead of passing the name of a
real disk file into one of the sqlite3_open(), sqlite3_open16(), or sqlite3_open_v2() functions, pass in
the string ":memory:". For example:
rc = sqlite3_open(":memory:", &db);
When this is done, no disk file is opened. Instead, a new database is created purely in memory. The
database ceases to exist as soon as the database connection is closed. Every :memory: database is
distinct from every other. So, opening two database connections each with the filename ":memory:"
will create two independent in-memory databases.
The special filename ":memory:" can be used anywhere that a database filename is permitted. For
example, it can be used as the filename in an ATTACH command:
ATTACH DATABASE ':memory:' AS aux1;
Note that in order for the special ":memory:" name to apply and to create a pure in-memory
database, there must be no additional text in the filename. Thus, a disk-based database can be
created in a file by prepending a pathname, like this: "./:memory:".
The special ":memory:" filename also works when using URI filenames. For example:
rc = sqlite3_open("file::memory:", &db);
Or,
ATTACH DATABASE 'file::memory:' AS aux1;
In-memory Databases And Shared Cache
In-memory databases are allowed to use shared cache if they are opened using a URI filename. If the
unadorned ":memory:" name is used to specify the in-memory database, then that database always
has a private cache and is this only visible to the database connection that originally opened it.
However, the same in-memory database can be opened by two or more database connections as
follows:
rc = sqlite3_open("file::memory:?cache=shared", &db);
Or,
ATTACH DATABASE 'file::memory:?cache=shared' AS aux1;
This allows separate database connections to share the same in-memory database. Of course, all
database connections sharing the in-memory database need to be in the same process. The
database is automatically deleted and memory is reclaimed when the last connection to the
database closes.
If two or more distinct but shareable in-memory databases are needed in a single process, then the
mode=memory query parameter can be used with a URI filename to create a named in-memory
database:
rc = sqlite3_open("file:memdb1?mode=memory&cache=shared", &db);
Or,
ATTACH DATABASE 'file:memdb1?mode=memory&cache=shared' AS aux1;
When an in-memory database is named in this way, it will only share its cache with another
connection that uses exactly the same name.
B1.3
Tasks are the building blocks of Celery applications. A task is a class that can be created out of any
callable. It performs dual roles in that it defines both what happens when a task is called (sends a
message), and what happens when a worker receives that message. Every task class has a unique
name, and this name is referenced in messages so the worker can find the right function to execute.
A task message is not removed from the queue until that message has been acknowledged by a
worker. A worker can reserve many messages in advance and even if the worker is killed – by power
failure or some other reason – the message will be redelivered to another worker.
From celery import shared_task
@Shared_task
Def add (x, y):
Return x + y
You can easily create a task from any callable by using the app.task () decorator:
from .models import User
@ app.task
def create_user (username, password):
User.objects.create (username = username, password = password)

There are also many options that can be set for the task, such as these can be specified as arguments
to the decorator:
@ app.task (serializer = 'json')
def create_user (username, password):
User.objects.create (username = username, password = password)
Python decorators allow you to change the behaviour of a function without modifying the function
itself. we'll use a decorator when you need to change the behaviour of a function without modifying
the function itself. A few good examples are when you want to add logging, test performance,
perform caching, verify permissions, and so on.
Decorators are usually called before the definition of a function you want to decorate. Create a
simple decorator that will convert a sentence to uppercase. We do this by defining a wrapper inside
an enclosed function. Python provides a much easier way for us to apply decorators. We simply use
the @ symbol before the function we'd like to decorate. We can use multiple decorators to a single
function.
def my_function():
print('I am a function.')
# Assign the function to a variable without parenthesis. We don't want to execute the function.
description = my_function
# Accessing the function from the variable I assigned it to.

print(description())
# Output
'I am a function.'
Dispatching a simple task
• At this point, we have a ready environment. Let's test it by sending a task that will calculate
the square root of a value and return a result. First, we must define our task module tasks.py
inside the server. Let's check the description of the tasks.py module. the following chunk of
code, we have imports necessary for our function that will calculate the square root:
from math import sqrt
from celery import Celery
• Now, let's create the following instance of Celery, which will represent our client application:
app = Celery('tasks', broker='redis://192.168.25.21:6379/0')
• Then, we have to set up our result backend, which will also be in Redis, as follows:
app.config.CELERY_RESULT_BACKEND = 'redis://192.168.25.21:6379/0’
• With the basics ready, let's define our task with the @app.task decorator:
@app.task
def square_root(value):
return sqrt(value)
• At this point, since we have our tasks.py module defined, we need to initiate our workers
inside the server, where Redis and Celery (with support to Redis) are installed.
$celery –A tasks worker –-loglevel=INFO
• Now, we have a Celery server waiting to receive tasks and send them to workers. The next
step is to create an application on the client side to call tasks.
• In the machine that represents the client, we have our virtual environment celery_env
already set up as we did in the last slides. So, now it is simpler to create a step-by-step
module task_dispatcher.py, as follows:
1. We import the logging module to exhibit information referring to the execution of the
program and the Celery class inside the celery module, as follows:
import logging
2. The next step is to create an instance of the Celery class informing the module
containing the tasks and then the broker, as done in the server side. This is done with the following
code:
app = Celery('tasks’, broker='redis://192.168.25.21:6379/0')
app.conf.CELERY_RESULT_BACKEND = 'redis://192.168.25.21:6397/0'
3. let us create a function to encapsulate the sending of the sqrt_task(value) task. We will
create the manage_sqrt_task(value) function as follows:
def manage_sqrt_task(value):
result = app.send_task('tasks.sqrt_task', args=(value,))
logging.info(result.get())
4. In the __main__ block, we executed the call to the manage_sqrt_task(value) function by

passing the input value as 4:
if __name__ == '__main__’:
manage_sqrt_task(4)
Broker
• It is also known as Message Transport.
• It is definitely a key component in Celery. Through it, we get to send and receive messages
and communicate with workers
• The most complete in terms of functionality are RabbitMQ and Redis. We will use Redis as a
broker as well as result backend.
• A broker has the function of providing a means of communication between client

applications that send tasks and workers that will execute them. This is done by using task
queues. We can have several network machines with brokers waiting to receive messages to
be consumed by workers.
B1.4
Celery has an architecture based on pluggable components and a mechanism of message exchange
that uses a protocol according to a selected message transport (broker).
• The client components, as presented in the previous diagram, have the function of creating
and dispatching tasks to the brokers.
• Demonstrate the definition of a task by using the @app.task decorator, which is accessible
through an instance of Celery application that, for now, will be called app.
• There are several types of tasks: synchronous, asynchronous, periodic, and scheduled. When
we perform a task call, it returns an instance of type AsyncResult.
• The AsyncResult object is an object that allows the task status to be checked, its ending, and
obviously, its return when it exists. However, to make use of this mechanism, another
component, the result backend, has to be active.
• The Message transport (broker) is definitely a key component in Celery. Through it, we get
to send and receive messages and communicate with workers
• The most complete in terms of functionality are RabbitMQ and Redis. We will use Redis as a
broker as well as result backend.
• A broker has the function of providing a means of communication between client

applications that send tasks and workers that will execute them. This is done by using task
queues. We can have several network machines with brokers waiting to receive messages to
be consumed by workers.
• Workers are responsible for executing the tasks they have received. Celery displays a series
of mechanisms so that we can find the best way to control how workers will behave. We can
define the mechanisms as follows: Concurrency mode, Remote control, Revoking tasks
• The result backend component has the role of storing the status and result of the task to
return to the client application. From the result backend supported by Celery, we can
highlight RabbitMQ, Redis, MongoDB, Memcached, among others.
Virtual environment:
we will set up two machines in Linux. The first one, hostname foshan, will perform the client role,
where app Celery will dispatch the tasks to be executed. The other machine, hostname Phoenix, will
perform the role of a broker, result backend, and the queues consumed by workers.
Client Machine
• We will set up a virtual environment with Python 3.3, using the tool pyvenv. The goal of
pyvenv is to not pollute Python present in the operating system with additional modules, but
to separate the developing environments necessary for each project.
• Now, we have a virtual environment and starting off from the point from where you already
installed setuptools or pip, we will install the necessary packages for our client. Let's install
the Celery framework with the following command:
$pip install celery
Setting up the Server Machine
• To set up the server machine, we will start by installing Redis, which will be our broker and
result backend. We will do this using the following command:
$sudo apt-get install redis-server
• To start Redis, just execute the following command:
• $redis-server
If it was successful, an output similar to the following screenshot will be exhibited:
simple task for square root of a value
• At this point, we have a ready environment. Let's test it by sending a task that will calculate
the square root of a value and return a result. First, we must define our task module tasks.py
inside the server. Let's check the description of the tasks.py module. the following chunk of
code, we have imports necessary for our function that will calculate the square root:
• Now, let's create the following instance of Celery, which will represent our client application:
app = Celery('tasks', broker='redis://192.168.25.21:6379/0')
• Then, we have to set up our result backend, which will also be in Redis, as follows:
app.config.CELERY_RESULT_BACKEND = 'redis://192.168.25.21:6379/0’
• With the basics ready, let's define our task with the @app.task decorator:
@app.task
def square_root(value):
return sqrt(value)
• At this point, since we have our tasks.py module defined, we need to initiate our workers
inside the server, where Redis and Celery (with support to Redis) are installed.
$celery –A tasks worker –-loglevel=INFO
• Now, we have a Celery server waiting to receive tasks and send them to workers. The next
step is to create an application on the client side to call tasks.
• In the machine that represents the client, we have our virtual environment celery_env
already set up as we did in the last slides. So, now it is simpler to create a step-by-step
module task_dispatcher.py, as follows:
1. We import the logging module to exhibit information referring to the execution of

the program and the Celery class inside the celery module, as follows:
import logging
2. The next step is to create an instance of the Celery class informing the module
containing the tasks and then the broker, as done in the server side. This is done
with the code:
app = Celery('tasks’, broker='redis://192.168.25.21:6379/0')
app.conf.CELERY_RESULT_BACKEND = 'redis://192.168.25.21:6397/0'
3. let us create a function to encapsulate the sending of the sqrt_task(value) task.

We will create the manage_sqrt_task(value) function as follows:
def manage_sqrt_task(value):
result = app.send_task('tasks.sqrt_task', args=(value,))
logging.info(result.get())
4. In the __main__ block, we executed the call to the manage_sqrt_task(value)

function by passing the input value as 4:
if __name__ == '__main__’:
manage_sqrt_task(4)
B.2
B2.1
Figure 1.1 Text file with Comma Separated Values
Reading text file with comma delimited

Figure 2.1 Text file with Tabbed Space Values
Reading text file without comma delimited

Figure 3.1 Text file with no header
To read file with no header.

To read modified file with default header name
To read Modified file with assigned header name
B2.2
import sqlite3
query = """
CREATE TABLE StudentDetails
(RegisterNumber VARCHAR(20), DATA_SCIENCE INTEGER,
COMPUTER_VISION INTEGER, COMPUTATIONAL_INTELLIGENCE INTEGER
);"""
con = sqlite3.connect(':memory:')
con.execute(query)
con.commit()
data = [('18ETCS002112',36,44,29),
('18ETCS002122',32,36,34)]
stmt = "INSERT INTO StudentDetails VALUES(?, ?, ?, ?)"
con.executemany(stmt, data)
con.commit()
import pandas.io.sql as sql
sql.read_sql('select * from StudentDetails', con)
For improving using pandas
import pandas.io.sql as sql
import pandas as pd
df=pd.read_sql_query('select DISTINCT registernumber, data_Science, Computer_Vision,

Computational_Intelligence from StudentDetails', con)
df
B2.3
code for a task that will calculate the square root of a value
import celery from Celery and import math from sqrt and using @app.task decorator we can find the
sqrt of the value.
app = Celery('task',broker = '192.168.25.21.6')
@app.task
def square(n):
return sqrt(n)
from task import square
square(9)

18ETCS002122 Assignment (Data Science)

Uploaded by

18ETCS002122 Assignment (Data Science)

Uploaded by

ASSIGNMENT

Course Code 20CSE421A

Name of the Student SUDARSHAN C

Programme B. Tech Semester/Year 7th/4th

Course Code 20CSE421A

Course Title Data Science Foundation

Course Leader Dr. Yatish Bathla/Dr. Mohan Kumar K N

Submission date stamp

some useful features of Python language:

• Web Development: To make your development process as easy as it is possible only,

• The value stored in a variable can be changed during program execution.

In a Python program, there are four types of namespaces:

Python Variable Scope

At any instant, we have at least three nested python scopes:

• Current function’s variable scope- has local names

try: statements in try block

except: executed when error in try block

Some of the standard libraries of python in data science

• structured data fast, easy, and expressive.

• powerful and productive data analysis environment.

• primary object in pandas is the Data-Frame, a two dimensional tabular, column-oriented

• ideal tool for financial data analysis applications, statistical computing

• A fast and efficient multidimensional array object n-d array

• Functions for performing element-wise computations with arrays or mathematical

• Tools for reading and writing array-based data sets to disk

• Linear algebra operations, Fourier transform, and random number generation

• Tools for integrating connecting C, C++, and Fortran code to Python

states = pd.DataFrame({'population': population,

Importing JSON Files

Syntax for reading the data in text format:

Text = pd.read_csv(" path ")

Text = pd.read_csv(" path ",header=None)

names = ['sub1', 'sub2', 'sub3','RegNo']

df = pd.read_csv(“ path ”,names=names, index_col='RegNo')

ATTACH DATABASE ':memory:' AS aux1;

ATTACH DATABASE 'file::memory:' AS aux1;

In-memory Databases And Shared Cache

ATTACH DATABASE 'file::memory:?cache=shared' AS aux1;

ATTACH DATABASE 'file:memdb1?mode=memory&cache=shared' AS aux1;

From celery import shared_task

Def add (x, y):

from .models import User

def create_user (username, password):

User.objects.create (username = username, password = password)

@ app.task (serializer = 'json')

def create_user (username, password):

User.objects.create (username = username, password = password)

# Accessing the function from the variable I assigned it to.

Dispatching a simple task

from math import sqrt

from celery import Celery

app = Celery('tasks', broker='redis://192.168.25.21:6379/0')

$celery –A tasks worker –-loglevel=INFO

from celery import Celery

app = Celery('tasks’, broker='redis://192.168.25.21:6379/0')

result = app.send_task('tasks.sqrt_task', args=(value,))

4. In the __main__ block, we executed the call to the manage_sqrt_task(value) function by

• It is also known as Message Transport.

• A broker has the function of providing a means of communication between client

• A broker has the function of providing a means of communication between client

$pip install celery

Setting up the Server Machine

$sudo apt-get install redis-server

• To start Redis, just execute the following command:

If it was successful, an output similar to the following screenshot will be exhibited:

simple task for square root of a value

from math import sqrt

from celery import Celery

app = Celery('tasks', broker='redis://192.168.25.21:6379/0')

$celery –A tasks worker –-loglevel=INFO

1. We import the logging module to exhibit information referring to the execution of

from celery import Celery

app = Celery('tasks’, broker='redis://192.168.25.21:6379/0')

3. let us create a function to encapsulate the sending of the sqrt_task(value) task.

result = app.send_task('tasks.sqrt_task', args=(value,))

4. In the __main__ block, we executed the call to the manage_sqrt_task(value)

4. In the main block, we executed the call to the manage_sqrt_task(value) function by

4. In the main block, we executed the call to the manage_sqrt_task(value)