A Very Basic Introduction To Scientific Python Programming
A Very Basic Introduction To Scientific Python Programming
Python programming
Hans Petter Langtangen1,2 (hpl@simula.no)
Leif Rune Hellevik3,1 (leif.r.hellevik@ntnu.no)
1
Nov 1, 2015
Contents.
This note introduces very basic programming elements, such as
variables for numbers, lists, and arrays
while loops and for loops
functions
if tests
plotting
through examples involving a mathematical formula. A glimpse of vectorization of code is also given.
Contents
1 Variables, loops, lists, and arrays
1.1 Getting access to Python . . . . . . . . .
1.2 Mathematical example . . . . . . . . . . .
1.3 A program for evaluating a formula . . . .
1.4 Formatted output with text and numbers
1.5 While loops . . . . . . . . . . . . . . . . .
1.6 Lists . . . . . . . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
2
2
3
3
4
5
7
1.7
1.8
1.9
1.10
For loops . . . . . . . .
Arrays . . . . . . . . . .
Mathematical functions
Plotting . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
8
10
11
12
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
14
14
16
17
19
20
3 Files
3.1 File reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.2 File writing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
21
21
22
4 Classes
23
4.1 A very simple class . . . . . . . . . . . . . . . . . . . . . . . . . . 23
4.2 A class for representing a mathematical function . . . . . . . . . 24
5 Exercises
25
Index
33
1.1
Simple mathematical calculations can be done in plain Python, but for more
advanced scientific computing, as we do here, several add-on packages are needed.
Getting all software correctly installed used to be quite challenging, but today
there are several easy-to-use approaches to get access to Python.
Mac and Windows. We recommend to download and install Anaconda1 ,
which is a free distribution of Python that comes with most of the packages you
need for advanced scientific computing.
Ubuntu. Debian-based Linux systems, such as Ubuntu, can also use Anaconda,
but it is more common to use the apt-get command-line tool or the Ubuntu
installer to install a set of Debian packages. Here is a list of the packages you
need to install for this introduction:
1 https://store.continuum.io/cshop/anaconda/
In addition, run
Terminal>
Terminal>
Terminal>
Terminal>
Terminal>
pip
pip
pip
pip
pip
install
install
install
install
install
nose
pytest
ipython --upgrade
tornado --upgrade
pyzmq --upgrade
Web access. You can also access Python directly through a web browser
without having it installed on your local computer. We refer to the document
How to access Python for doing scientific computing2 for more details on such
tools and also installations based on Anaconda and Ubuntu.
1.2
Mathematical example
1.3
The program is pure text and must be placed in a pure text file using a text editor.
Popular text editors are Gedit, Nano, Emacs, and Vim on Linux, TextWrangler
on Mac OS X, and Notepad++ on Windows. Save the text to a program file
whose name ends in .py, say distance.py.
The program is run in a terminal window (Command Prompt on Windows,
Terminal application on Mac OS X, xterm or gnome-terminal on Linux):
Terminal> python distance.py
1.025
2 http://hplgit.github.io/edu/accesspy/accesspy.html
The result of the print statement is the number 1.025 in the terminal window.
As an alternative to writing programs in a text editor and executing them in
a terminal window, you may use the Spyder3 graphical interface which gives a
more Matlab-style environment to work in. Anaconda installations come with
Spyder.
The distance.py program first contains four assignment statements where
we assign numbers or the results of arithmetic expressions to variables. The
variables have names coinciding with the mathematical notation used in the
formula: t, v0, a, and s. You may think of variables in this programs just as
variables in mathematics.
More technical details.
A statement like t = 0.5 works as follows. First, the right-hand side is
interpreted by Python as a real number and a float object containing
the value 0.5 is created. Then the name t is defined as a reference to this
object.
In the statement s = v0*t + 0.5*a*t**2, Python will first go to the
right-hand side and observe that it is an arithmetic expression involving
variables with known values (or names referring to existing objects). The
arithmetic expression is calculated, resulting in a real number that is saved
as a float object with value 1.025 in the computers memory.
Everything in Python is an object of some type. Here, t, a, and s
are float objects, representing real (floating-point) numbers, while v0 is
an int object, representing the integer 2. There are many other types of
objects: strings, lists, tuples, dictionaries, arrays, files, ...
1.4
For any object s in Python, print s will (normally) print the contents of s.
However, sometimes we want to combine the content of an object with some
text. Say we want to print s=1.025 rather than just the number. This is easily
accomplished using printf syntax:
print s=%g % s
print s=%.2f % s
where the f signifies a decimal number and the preceding .2 means 2 decimals.
Scientific notation, as in s=1.03E+00 (1.03100 ), is specified as %.2E (2 decimals).
The printf syntax is available in numerous programming languages. Python
also offers a related variant, called format string syntax:
print s={s:.2f}.format(s=s)
1.5
While loops
Suppose we want to make a table with two columns, one with t values and one
with the corresponding s values. Now we have to repeat a lot of calculations
with the formula (1). This is easily done with a loop. There are two types of
loops in Python: while loops and for loops.
Let the t values go from 0 to 2 in increments of 0.1. The following program
applies a while loop:
v0 = 2
a = 0.2
dt = 0.1 # Increment
t = 0
# Start value
while t <= 2:
s = v0*t + 0.5*a*t**2
print t, s
t = t + dt
So, how do we interpret the contents of this program? First we initialize four
variables: v0, a, dt, and t. Everything after # on a line is a comment and does
not affect what happens in the program, but is meant to be of help for a human
reading the program. Then comes the while loop:
5
while condition:
<intented statement>
<intented statement>
<intented statement>
Observe the colon at the end of the while line. The set of indented statements
are repeated as long as the expression or variable condition evaluates to True.
In the present case, the condition is the boolean expression t <= 2, so as long
as the value is less than or equal to 2, t <= 2 evaluates to True, otherwise it
evaluates to False.
Error in the code.
According to the code, the last pass in the while loop should correspond
to t = 2, but looking at the output, we see that the last print statement
has t = 1.9. The next test in the while condition involves t = 2 and the
boolean condition is expected to be 2 == 2. However, it seems that the
condition is False since the computations for t = 2 are not printed. Why
do we experience this behavior?
The Python Online Tutor4 is a great tool to examine the program flow.
Consider this little loop run in the Python Online Tutor:
a = 0
da = 0.4
while a <= 1.2:
print a
a = a + da
The Python Online Tutor executes each statement and displays the contents of
variables such that you can track the program flow and the evolution of variables.
So, why is not a printed for 1.2? That is, why does a == 1.2 evaluate to
True when a is (expected to be) 2? We must look at the accuracy of a to
investigate this question and write it out with 16 decimals in scientific notation
(printf specification %.16E):
a = 0
da = 0.4
while a <= 1.2:
print a
a = a + da
# Inspect all decimals in da and a
print da=%.16E\na=%.16E % (da, a)
print a <= 1.2: %g % (a <= 1.2)
The problem is that da is not exactly 0.4, but contains a small round-off
error. Doing a = a + da then results in a slightly inaccurate a. When the
exact a should reach 1.2, the a in the program has a an error and equals
1.2000000000000002. Obviously the test for exact equality, a == 1.2, becomes
False, and the loop is terminated.
Rule: use exact equality == when comparing real numbers.
Always compare real numbers using the difference and a tolerance:
a = 1.2
b = 0.4 + 0.4 + 0.4
boolean_condition1 = a == b
tol = 1E-14
boolean_condition2 = abs(a - b) < tol
# False
# True
The Python Online Tutor is ideal for demonstrating program flow and
contents of variables in small and simple programs. However, you should use a
real debugger instead when searching for errors in real programs.
1.6
Lists
The table created in the previous section has two columns of data. We could
store all the numbers in each column in a list object. A list is just a collection
of objects, here numbers, in a given sequence. For example,
L = [-1, 1, 8.0]
is a list of three numbers. Lists are enclosed in square brackets and may contain
any type of objects separated by commas. Here we mix a filename (string), a
real number, and an integer:
L = [mydata.txt, 3.14, 10]
The different list elements can be reached via indexing: L[0] is the first
element, L[1] is the second, and L[-1] is the last element. Here is an interactive
Python shell where we can write Python statements and examine the contents
of variables as we perform various operations:
>>> L = [mydata.txt, 3.14, 10]
>>> print L[0]
mydata.txt
>>> print L[1]
3.14
>>> del L[0] # delete the first element
>>> print L
[3.14, 10]
# length of L
# add -1 at the end of the list
Let us store the numbers in the previous table in lists, one for each column.
We can start with empty lists [] and use append to add a new element to the
lists in each pass of the while loop. Thereafter, we can run a new while loop and
print the contents of the lists in a nice, tabular fashion:
v0 = 2
a = 0.2
dt = 0.1 # Increment
t = 0
t_values = []
s_values = []
while t <= 2:
s = v0*t + 0.5*a*t**2
t_values.append(t)
s_values.append(s)
t = t + dt
print s_values # Just take a look at a created list
# Print a nicely formatted table
i = 0
while i <= len(t_values)-1:
print %.2f %.4f % (t_values[i], s_values[i])
i += 1
# Same as i = i + 1
Note that print s_values here leads to output with many decimals and small
round-off errors. To get complete control of the formatting of real numbers in
the table, we use the printf syntax.
Lists come with a lot of functionality. See the Python Tutorial5 for many
more examples.
1.7
For loops
>>> L = [1, 4, 8, 9]
>>> for e in L:
...
print e
...
1
4
8
9
The variable e is successively set equal to the elements in the list, in the right
order. Note the colon at the end of the for line. The statements to be executed
in each pass of the loop (here only print e) must be indented. When e is set
equal to the last element and all indented statements are executed, the loop
is over, and the program flow continues with the next statement that is not
indented. Try the following code out in the Python Online Tutor:
list1 = [0, 0.1, 0.2]
list2 = []
for element in somelist:
p = element + 2
list2.append(p)
print list2
We can now rewrite our program that used lists and while loops to use for
loops instead:
v0 = 2
a = 0.2
dt = 0.1 # Increment
t_values = []
s_values = []
n = int(round(2/dt)) + 1 # No of t values
for i in range(n):
t = i*dt
s = v0*t + 0.5*a*t**2
t_values.append(t)
s_values.append(s)
print s_values # Just take a look at a created list
# Make nicely formatted table
for t, s in zip(t_values, s_values):
print %.2f %.4f % (t, s)
# Alternative implementation
for i in range(len(t_values)):
print %.2f %.4f % (t_values[i], s_values[i])
Observe that we have here used a slightly different technique for computing the
t values inside the first loop: now we set t as it, where t (dt in the code) is
the increment (0.1) between each t value. The computation of n, the number
of t values, makes use of round to make a correct mathematical rounding to
the nearest integer (and int makes an integer object out of the rounded real
number). (In an interval [a, b] divided into subintervals of equal length t, there
will be 1 + (b a)/t points in total.)
Running through multiple lists simultaneously is done with the zip construction:
for e1, e2, e3, ... in zip(list1, list2, list3, ...):
One may instead create a for loop over all the legal index values instead and
index each array,
for i in range(len(list1)):
e1 = list1[i]
e2 = list2[i]
...
1.8
Arrays
Lists are useful for collecting a set of numbers or other objects in a single variable.
Arrays are much like lists, but tailored for collection of numbers. The primary
advantage of arrays is that you can use them very efficiently and conveniently in
mathematical computations, but the downside is that an array has (in practice)
a fixed length and all elements must be of the same type. This is usually no
important restriction in scientific computations.
To use arrays, you must import the numpy module:
>>> import numpy
>>> L = [1, 4, 10.0]
>>> a = numpy.array(L)
# List of numbers
# Make corresponding array
10
>>> print a
[ 1.
4. 10.]
>>> print a[1]
4.0
>>> print a.dtype
float64
>>> b = 2*a + 1
>>> print b
[ 3.
9. 21.]
Note that all elements in the a array are of float type (because one element in
L was float). Arithmetic expressions such as 2*a+1 work with a as array, but
not as list. In fact, we can pass arrays to mathematical functions:
>>> c = numpy.log(a)
>>> print c
[ 0.
1.38629436
2.30258509]
The numpy module has a lot of very useful utilities. To create n + 1 uniformly
distributed coordinates in an interval [a, b], stored in an array, one can use
linspace:
t = numpy.linspace(a, b, n+1)
This construction makes it easy to create arrays for the t and s values in our
tables:
import numpy
v0 = 2
a = 0.2
dt = 0.1 # Increment
n = int(round(2/dt)) + 1
# No of t values
1.9
Mathematical functions
Python offers access to all standard mathematical functions such as sin x, cos x,
tan x, sinh x, cosh x, tanh x, all their inverses (called asin(x), asinh(x), and
so forth), ex (exp(x)), ln x (log(x)), and x! (factorial(x)). However, one has
to import a module to get access to these functions. For scalars (single numbers)
the relevant module is math:
>>> import math
>>> print math.sin(math.pi)
1.2246467991473532e-16
which shows that the sine function is only approximate (to 16 digits). Many
prefer to write mathematical expressions without the math prefix:
11
The numpy module contains sine, cosine, and other mathematical functions
that work on scalars as well as arrays.
Import of numpy.
To get Python code that is as similar to Matlab as possible, one would do
a start import of everything,
from numpy import *
x = linspace(0, 1, 101)
y = exp(-x)*sin(pi*x)
Our convention is to use the np prefix for typical Matlab functions, but
skip the prefix when working with mathematical functions like exp(x)*sin(pi*x)to
get a one-to-one correspondence between formulas in the program and in
the mathematical description of the problem.
import numpy as np
from numpy import sin, exp, pi
x = np.linspace(0, 1, 101)
y = exp(-x)*sin(pi*x)
1.10
Plotting
We can easily make a graph of a function s(t) using the module matplotlib.
The technique is to compute an array of t values and a corresponding array of
function values s(t). Plotting programs will draw straight lines between the
points on the curve, so a sufficient number of points are needed to give the
impression of a smooth curve. Our s(t) function is plotted by the following code:
import numpy as np
import matplotlib.pyplot as plt
v0 = 0.2
a = 2
n = 21 # No of t values for plotting
12
t = np.linspace(0, 2, n+1)
s = v0*t + 0.5*a*t**2
plt.plot(t, s)
plt.savefig(myplot.png)
plt.show()
such that they can use linspace and plot without any prefix, just as in Matlab.
Two curves can easily be plotted, this time also with labels on the axis and a
box with legends such that we can distinguish the two curves:
import numpy as np
import matplotlib.pyplot as plt
v0 = 0.2
a = 2
n = 21 # No of t values for plotting
t = np.linspace(0, 2, n+1)
s = v0*t + 0.5*a*t**2
plt.plot(t, s)
plt.savefig(myplot.png)
plt.show()
13
2.1
Functions
Note that
functions start with the keyword def
statements belonging to the function must be indented
function input is represented by arguments (separated by comma if more
than one)
function output is returned to the calling code
In this program, v0 and a are global variables, which must be defined before
calling the s(t) function, otherwise v0 and a becomes undefined variables in
the expression inside the function.
Instead of having v0 and a as global variables, we may let them be function
arguments:
14
=
=
=
=
=
=
s(3, 0.2, 4)
s(3)
s(3, a=2)
s(3, v0=2)
s(t=3, v0=2, a=2)
s(a=2, t=3, v0=2)
#
#
#
#
#
#
Notice.
Arguments without the argument name are called positional arguments.
Positional arguments must always be listed before the keyword arguments in the function and in any call.
The sequence of the keyword arguments can be arbitrary.
If argument names are used for all arguments (as in the last line
above) the sequence of the arguments is not important.
Vectorized functions.
Applying the function s(t, v0, a) to an array t can be done in two ways:
# Scalar code: work with one element at a time
for i in range(len(t)):
s_values[i] = s(t_values[i], v0, a)
# Vectorized code: apply s to the entire array
s_values = s(t_values, v0, a)
For the last line to work, the function s must contain statements that work
correctly when t is an array argument.
15
When n values are returned, we list n variables on the left-hand side in the call.
Python functions return only one object.
Even when we return several values, as in return s, v, actually only one
object is returned. The s and v values are packed together in a tuple object
(which is very similar to a list).
>>> def f(x):
...
return x+1, x+2, x+3
...
>>> r = f(3)
# Store all three return values in one object r
>>> print r
(4, 5, 6)
>>> type(r)
# What type of object is r?
<type tuple>
>>> print r[1]
5
Tuples are constant lists, so you can index them as lists, but you cannot
change the contents (append or del is illegal).
2.2
The formula (1) arises from the basic differential equations in kinematics:
ds
, s(0) = s0 ,
dt
dv
a=
, v(0) = v0 .
dt
v=
(2)
(3)
Given any acceleration a(t), we can solve for s(t) through integration. First, we
integrate to find v(t):
16
Z
a(t)dt =
0
dv
dt,
dt
which gives
Z
v(t) = v0 +
a(t)dt .
0
(4)
2.3
If tests
The else part can be omitted when not needed. Several branches are also
possible:
if condition1:
<statements when condition1 is True>
elif condition2:
<statements when condition1 is False and condition2 is True>
elif condition3:
<statements when condition1 and conditon 2 are False
and condition3 is True>
else:
<statements when condition1/2/3 all are False>
17
To plot this s(t), we need to compute points on the curve. Trying to call
s_func with an array t as argument will not work because of the if test. In
general, functions with if tests will not automatically work with arrays because
a test like if t <= t1 evaluates to an if applied to a boolean array (t <= t1
becomes a boolean array, not just a boolean).
One solution is to compute the function values one by one in a loop such that
the s function is always called with a scalar value for t. Appropriate code is
n = 201 # No of t values for plotting
t1 = 1.5
t = np.linspace(0, 2, n+1)
s = np.zeros(n+1)
for i in range(len(t)):
s[i] = s_func(t=t[i], v0=0.2, a0=20, t1=t1)
18
Note that t <= t1 with array t and scalar t1 results in a boolean array b
where b[i] = t[i] <= t1.
Using array indexing. It is possible to index a subset of indices in an
array s using a boolean array b: s[b]. This construction picks out all the
elements s[i] where b[i] is True. On the right-hand side we can then
assign some array expression expr of the same length as s[b]:
s[b] = (expr)[b]
Our example can utilize this technique with b as t <= t1 and t > t1:
s = np.zeros_like(t) # Make s as zeros, same size & type as t
s[t <= t1] = (v0*t + 0.5*a0*t**2)[t <= t1]
s[t > t1] = (v0*t + 0.5*a0*t1**2 + a0*t1*(t-t1))[t > t1]
2.4
Arrays are usually large and most array libraries have carefully designed functionality for avoiding unnecessary copying of large of amounts of data. If you do
a simple assignment,
a = np.linspace(1, 5, 5)
b = a
b becomes a just view of a: the variables b and a point to the same data. In
other languages one may say that b is a pointer or reference to the array. This
means that changing a changes b and vice versa:
array([ 1., 2.,
>>> b[0] = 5
>>> a
array([ 5., 2.,
>>> a[1] = 9
>>> b
array([ 5., 9.,
3.,
4.,
5.])
# changes a[0] to 5
3.,
4.,
5.])
# changes b[1] to 9
3.,
4.,
5.])
4.,
5.])
3.,
4.,
5.])
3.,
4.,
5.])
Note that b remains unchanged by the change in c since the b and c variables
now refer to different data. Copying of the elements of a sub-array is also done
by the copy() method: b = a[1:-1].copy().
19
2.5
Linear systems
Solving linear systems of the form Ax = b with a matrix A and vectors x and b
is a frequently encountered task. A sample 2 2 system can be coded as
import numpy as np
A = np.array([[1., 2],
[4, 2]])
b = np.array([1., -2])
x = np.linalg.solve(A, b)
20
Files
This section outlines basic file handling in Python, including file utilities in the
numpy package.
3.1
File reading
Suppose we create a file with typical input data to our little demo program for
evaluating the formula s(t) = v0 t + 12 at2 :
v0 = 2
a = 0.2
dt = 0.1
interval = [0, 2]
We want to read the parameters in the file into Python variables and create a
table with columns of t and S(t) for t [0, 2] with steps of t = 0.1 (as specified
by dt and interval in the file).
The code for reading the lines in the file, interpreting them, and assigning
values to variables is given next.
infile = open(.input.dat, r)
for line in infile:
# Typical line: variable = value
variable, value = line.split(=)
variable = variable.strip() # remove leading/traling blanks
if variable == v0:
v0 = float(value)
elif variable == a:
a = float(value)
elif variable == dt:
dt = float(value)
elif variable == interval:
interval = eval(value)
infile.close()
21
The name of the file is here .input.dat, and it is opened with the parameter
r for reading. The for loop for line in infile will read the lines from the
file, one by one, and in the current line is available in the string line. To split a
line into words separated by a character =, we use line.split(=), resulting
in a list of the words. For example,
>>> line = v0 = 5.3
>>> variable, value = line.split(=)
>>> variable
v0
>>> value
5.3
Note that there are blanks in the strings. Leading and trailing blanks can be
removed by variable.strip(). We do this before comparing the variable name
with v0, a, etc.
It is important to note that value (the text to the right of = on each line) is
a string variable. We need to convert to a float object to get a variable that
we can compute with. The assignment interval = eval(value) does some
magic and deserve an explanation: eval(s) interprets the text in the string s
as Python code. In the present example, value is [0, 2] and this is interpreted
as Python code, i.e., as a list, and interval becomes a name for a list object
with content [0, 2].
A more modern Python way of opening files is
with open(.input.dat, r) as infile:
for line in infile:
...
Now it is not necessary to close the file as it will be automatically done after the
with block.
3.2
File writing
Suppose we generate t and s(t) values in two lists, t_values and s_values, and
want to write these as a nicely formatted table to file. The following code does
the work:
outfile = open(table1.dat, w)
outfile.write(# t
s(t)\n) # write table header
for t, s in zip(t_values, s_values):
outfile.write(%.2f %.4f\n % (t, s))
Files intended for writing must be opened with a w parameter. The key
statement is outfile.write(s) for writing a string to a file (recall that while
print automatically adds a newline at the end of the string to be printed,
outfile.write(s) does not append a newline so s must contain the newline).
The numpy package contains a convenient function savetxt for saving tabular
data. The data must be stored in a two-dimensional numpy array. The savetxt
function enables control of the format of the numbers in each column (fmt)
through the printf syntax, a header can be added (header) and the header lines
begin with a comment character (comment). The code reads
22
import numpy as np
# Make two-dimensional array of [t, s(t)] values in each row
data = np.array([t_values, s_values]).transpose()
# Write data array to file in table format
np.savetxt(table2.dat, data, fmt=[%.2f, %.4f],
header=t
s(t), comments=# )
The tabular data in the file can be read back into a numpy array by the
loadtxt function:
data = np.loadtxt(table2.dat, comments=#)
Lines beginning with the comment character are skipped in the reading. The
resulting object data is a two-dimensional array: data[i,j] contains row number
i and column number j in the table, i.e., data[i,0] holds the t value and
data[i,1] the s(t) value in the i-th row.
Classes
All objects in Python are in fact implemented as classes, but you can program
with objects without knowing about classes. Nevertheless, the class concept is a
powerful tool and some basic knowledge will make it easier to understand much
useful Python information that is floating around.
4.1
A class packs together a set of variables and a set of functions. All functions
can access all variables.7 The idea is to encapsulate variables and functions in
logical units such that a larger program can be composed by combing such units
(classes).
Here is a trivial class that has one variable a and one function dump that
writes the contents of a:
class Trivial:
def __init__(self, a):
self.a = a
def dump(self):
print self.a
How can we use this class? First, we must make an instance (object) of the
class:
t = Trivial(a=4)
classes, the functions are called methods and the variables are called attributes.
23
class, all of them could then access self.a (as if a were some global variable
in the class). The __init__ function is called a constructor since it is used to
construct an instance (object) of the class.
Having an instance t of class Trivial, we can call the dump method as
follows:
t.dump()
Even though both __init__ and dump have self as first argument, this argument
is not used in a call.
The self argument.
It takes time and experience to understand the self argument in class
methods.
1. self must always be the first argument.
2. self is never used in calls.
3. self is used to access attributes and methods inside methods.
We refer to a more comprehensive text on classesa for better explanation
of self.
a http://hplgit.github.io/primer.html/doc/pub/class/index.html
4.2
24
Dissection. The class has two methods (functions). The name of a method
can be freely chosen by the programmer, say dump as we used above, but here
we have used three special names, starting and ending with double underscores,
which allows us to use special attractive syntax in the calls (such methods are
actually known as special methods).
The constructor __init__ has one purpose: storing data in class attributes,
here self.v0 and self.a. We can then access these data in class methods.
The __call__ method is used to evaluate s(t). It has one argument (self
does not count since it is never used in calls). This special methods allows us to
view a class instance as if were a function. Let us illustrate by some code:
s = Distance(v0=2, a=0.5)
v = s(t=0.2)
# create instance
# runs s.__call__(t=0.2)
The last line has some magic: s is a class instance, not a function, but still we
can write s(t=0.2) or s(0.2) as if s were a function. This is the purpose of
the special method __call__: to allow such syntax. Actually, s(0.2) means
s.__call__(0.2). The nice result is that s looks like a function, it takes one
argument t, as the mathematical function s(t), but it also contains values of the
two parameters v0 and a.
In general, for a function f (x, y, z; p1 , p2 , . . . , pn ), here with three independent
variables and n parameters p1 , p2 , . . . , pn , we can pack the function and the
parameters together in a class:
class F:
def __init__(self, p1, p2, ...):
self.p1 = p1
self.p2 = p2
...
def __call__(self, x, y, z):
# return formula involving x, y, z and self.p1, self.p2 ...
Exercises
25
b) Make a Python function that takes x as argument and returns y. Call the
function for x = 2 and print the answer.
Solution. Code:
def f(x):
return 6*x**2 + 3*x + 2
y = f(2)
print y
if x = 2.55. The values of x and y should be written with three decimals. Run
the program for x = too (the value if is available as the variable pi in the
math module).
Solution. Here is the code:
x = 2.55
y = x**2
print y(%.3f)=%.3f % (x, y)
Changing x = 2.55 to x = ,
from math import pi
x = pi
y = x**2
print y(%.3f)=%.3f % (x, y)
26
One can also make the code shorter using a list comprehension:
x = [n**2 + 1 for n in range(21)]
print x
27
as expected.
28
0.8
0.6
g( y )
0.4
0.2
0.0
0.2
0.4
0.0
0.5
1.0
1.5
2.0
y
2.5
3.0
3.5
4.0
As Exercise 9, but add a black dashed curve for the function h(y) = e 2 y sin(4y).
Include a legend for each curve (with names g and h).
Solution. Here is the program:
import numpy as np
import matplotlib.pyplot as plt
from numpy import exp, sin # avoid np. prefix in g(y) and h(y)
def g(y):
return exp(-y)*sin(4*y)
def h(y):
return exp(-(3./2)*y)*sin(4*y)
y = np.linspace(0, 4, 501)
plt.figure()
plt.plot(y, g(y), r-, y, h(y), k--)
plt.xlabel($y$); plt.ylabel($g(y)$)
plt.title(Damped sine wave)
plt.legend([g, h])
plt.savefig(tmp.png); plt.savefig(tmp.pdf)
plt.show()
29
0.8
g
h
0.6
g( y )
0.4
0.2
0.0
0.2
0.4
0.0
0.5
1.0
1.5
2.0
y
2.5
3.0
3.5
4.0
Here, %timeit ran our function once, but the vectorized function 10 times. The
most relevant CPU times measured are listed, and we realize that the vectorized
code is 2.68/(40.1/1000) 67 times faster than the loop-based scalar code.
8 http://ipython.org/
30
Use the recipe above to investigate the speed up of the vectorized computation
of the s(t) function in Section 2.1.
31
32
Index
array, 10
assignment statement, 4
boolean expression, 5, 17
branching, 17
default arguments, 15
False, 5
float, 4
for loop, 8
format string syntax, 5
formatting of numbers, 4
function (in Python), 14
text editor, 3
tridigonal matrix, 20
True, 5
variable, 4
vectorization, 15
vectorized functions, 15
vectorizing if tests, 18
visualization, 12
while loop, 5
zip, 10
global variables, 14
graphics, 12
if test, 17
import of modules, 12
int, 4
keyword arguments, 15
linear systems, 20
linspace, 11
list, 7
list comprehension, 9
math, 11
mathematical functions, 11
matplotlib, 12
multiple lists traversal with zip, 10
multiple return values, 16
numpy, 10
objects, 4
plotting, 12
print statement, 4
printf syntax, 4
program file, 3
Python Online Tutor, 6
sparse matrix, 20
33