Python Notes
Python Notes
Using Python 3
Module 1:Python Data Types
String Data Type
• String are identified as a contiguous set of characters represented in the
quotation marks.
• Python allows for either pairs of single or double quotes.
• Strings are immutable sequence data type, i.e each time one makes any
changes to a string, completely new string object is created.
a_str = 'Hello World'
print(a_str) #output will be whole string. Hello World
print(a_str[0]) #output will be first character. H
print(a_str[0:5]) #output will be first five characters. Hello
Set Data Types
Sets are unordered collections of unique objects, there are two types of set:
1. Sets - They are mutable and new elements can be added once sets are
defined
basket = {'apple', 'orange', 'apple', 'pear', 'orange', 'banana'}
print(basket) # duplicates will be removed
> {'orange', 'banana', 'pear', 'apple'}
a = set('abracadabra')
print(a) # unique letters in a
> {'a', 'r', 'b', 'c', 'd'}
a.add('z')
print(a)
> {'a', 'c', 'r', 'b', 'z', 'd'}
2. Frozen Sets - They are immutable and new elements cannot added after its
defined.
b = frozenset('asdfagsa')
print(b)
> frozenset({'f', 'g', 'd', 'a', 's'})
cities = frozenset(["Frankfurt", "London", "New York"])
print(cities)
frozenset({'Frankfurt', ‘London', 'Freiburg'})
Numbers data type
Numbers have four types in Python. Int, float, complex, and long.
int_num = 10 #int value
float_num = 10.2 #float value
complex_num = 3.14j #complex value
long_num = 1234567L #long value
List Data Type
• A list contains items separated by commas and enclosed within square brackets
[].
• lists are almost similar to arrays in C.
• One difference is that all the items belonging to a list can be of different data
type.
list = [123,'abcd',10.2,'d'] #can be an array of any data type or single data type.
list1 = ['hello','world']
print(list) #will output whole list. [123,'abcd',10.2,'d']
print(list[0:2]) #will output first two element of list. [123,'abcd']
print(list1 * 2) #will gave list1 two times. ['hello','world','hello','world']
print(list + list1) #will gave concatenation of both the lists.
[123,'abcd',10.2,'d','hello','world']
Dictionary Data Type
• Dictionary consists of key-value pairs.
• It is enclosed by curly braces {} and values can be assigned and accessed using
square brackets[].
dic={'name':'red','age':10}
print(dic) #will output all the key-value pairs. {'name':'red','age':10}
print(dic['name']) #will output only value with 'name' key. 'red'
print(dic.values()) #will output list of values in dic. ['red',10]
print(dic.keys()) #will output list of keys. ['name','age']
Tuple Data Type
• Lists are enclosed in brackets [ ] and their elements and size can be changed,
while tuples are enclosed in parentheses ( ) and cannot be updated.
• Tuples are immutable.
tuple = (123,'hello')
tuple1 = ('world')
print(tuple) #will output whole tuple. (123,'hello')
print(tuple[0]) #will output first value. (123)
print(tuple + tuple1) #will output (123,'hello','world')
tuple[1]='update' #this will give you error.
Module 2: Date and Time
Python 3.x Version ≥ 3.2
import datetime
dt = datetime.datetime.strptime("2016-04-15T08:27:18-0500", "%Y-%m-
%dT%H:%M:%S%z")
For other versions of Python, you can use an external library such as dateutil,
which makes parsing a string with timezone into a datetime object is quick.
import dateutil.parser
dt = dateutil.parser.parse("2016-04-15T08:27:18-0500")
The dt variable is now a datetime object with the following value:
datetime.datetime(2016, 4, 15, 8, 27, 18, tzinfo=tzoffset(None, -18000))
Constructing timezone-aware datetimes
• By default all datetime objects are naive.
• To make them timezone-aware, you must attach a tzinfo object, which provides the UTC offset and
timezone abbreviation as a function of date and time.
Fixed Offset Time Zones
For time zones that are a fixed offset from UTC, in Python 3.2+, the datetime module provides the
timezone class, a concrete implementation of tzinfo, which takes a timedelta and an (optional) name
parameter:
Python 3.x Version ≥ 3.2
from datetime import datetime, timedelta, timezone
JST = timezone(timedelta(hours=+9))
dt = datetime(2015, 1, 1, 12, 0, 0, tzinfo=JST)
print(dt)
# 2015-01-01 12:00:00+09:00
print(dt.tzname())
# UTC+09:00
dt = datetime(2015, 1, 1, 12, 0, 0, tzinfo=timezone(timedelta(hours=9), 'JST'))
print(dt.tzname)
# 'JST'
For Python versions before 3.2, it is necessary to use a third party library, such
as dateutil. dateutil provides an equivalent class, tzoffset, which (as of version
2.5.3) takes arguments of the form dateutil.tz.tzoffset(tzname,offset), where
offset is specified in seconds:
Python 3.x Version < 3.2
Python 2.x Version < 2.7
from datetime import datetime, timedelta
from dateutil import tz
JST = tz.tzoffset('JST', 9 * 3600) # 3600 seconds per hour
dt = datetime(2015, 1, 1, 12, 0, tzinfo=JST)
print(dt)
# 2015-01-01 12:00:00+09:00
print(dt.tzname)
# 'JST'
Zones with daylight savings time
For zones with daylight savings time, python standard libraries do not provide a standard
class, so it is necessary to use a third party library. pytz and dateutil are popular libraries
providing time zone classes.
from datetime import datetime
from dateutil import tz
local = tz.gettz() # Local time
PT = tz.gettz('US/Pacific') # Pacific time
dt_l = datetime(2015, 1, 1, 12, tzinfo=local) # I am in EST
dt_pst = datetime(2015, 1, 1, 12, tzinfo=PT)
dt_pdt = datetime(2015, 7, 1, 12, tzinfo=PT) # DST is handled automatically
print(dt_l)
# 2015-01-01 12:00:00-05:00
print(dt_pst)
# 2015-01-01 12:00:00-08:00
print(dt_pdt)
# 2015-07-01 12:00:00-07:00
Computing time di erences
The timedelta module comes in handy to compute differences between times:
from datetime import datetime, timedelta
now = datetime.now()
then = datetime(2019, 6, 15)
#Specifying time is optional when creating a new datetime object
delta = now-then
print(delta.days)
print(delta.seconds)
To get n day's after and n day's before date we could use:
n day's after date:
def get_n_days_after_date(date_format="%d %B %Y", add_days=120):
date_n_days_after = datetime.datetime.now() + timedelta(days=add_days)
return date_n_days_after.strftime(date_format)
n day's before date:
def get_n_days_before_date(self, date_format="%d %B %Y", days_before=120):
date_n_days_ago = datetime.datetime.now() - timedelta(days=days_before)
return date_n_days_ago.strftime(date_format)
Basic datetime objects usage
The datetime module contains three primary types of objects - date, time, and
datetime.
import datetime
# Date object
today = datetime.date.today()
new_year = datetime.date(2017, 01, 01) #datetime.date(2017, 1, 1)
# Time object
noon = datetime.time(12, 0, 0) #datetime.time(12, 0)
# Current datetime
now = datetime.datetime.now()
# Datetime object
millenium_turn = datetime.datetime(2000, 1, 1, 0, 0, 0)
#datetime.datetime(2000, 1, 1, 0, 0)
Arithmetic operations for these objects are only supported within same datatype and
performing simple arithmetic with instances of different types will result in a
TypeError.
# subtraction of noon from today
noon-today
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
TypeError: unsupported operand type(s) for -: 'datetime.time' and 'datetime.date'
However, it is straightforward to convert between types.
# Do this instead
print('Time since the millenium at midnight: ',
datetime.datetime(today.year, today.month, today.day) - millenium_turn)
# Or this
print('Time since the millenium at noon: ',
datetime.datetime.combine(today, noon) - millenium_turn)
Switching between time zones
To switch between time zones, you need datetime objects that are timezone-
aware.
from datetime import datetime
from dateutil import tz
utc = tz.tzutc()
local = tz.tzlocal()
utc_now = datetime.utcnow()
utc_now # Not timezone-aware.
utc_now = utc_now.replace(tzinfo=utc)
utc_now # Timezone-aware.
local_now = utc_now.astimezone(local)
local_now # Converted to local time.
SETS
Operations on sets with other sets
# Intersection
{1, 2, 3, 4, 5}.intersection({3, 4, 5, 6}) # {3, 4, 5}
{1, 2, 3, 4, 5} & {3, 4, 5, 6} # {3, 4, 5}
# Union
{1, 2, 3, 4, 5}.union({3, 4, 5, 6}) # {1, 2, 3, 4, 5, 6}
{1, 2, 3, 4, 5} | {3, 4, 5, 6} # {1, 2, 3, 4, 5, 6}
# Difference
{1, 2, 3, 4}.difference({2, 3, 5}) # {1, 4}
{1, 2, 3, 4} - {2, 3, 5} # {1, 4}
# Symmetric difference with
{1, 2, 3, 4}.symmetric_difference({2, 3, 5}) # {1, 4, 5}
{1, 2, 3, 4} ^ {2, 3, 5} # {1, 4, 5}
# Superset check
{1, 2}.issuperset({1, 2, 3}) # False
{1, 2} >= {1, 2, 3} # False
# Subset check
{1, 2}.issubset({1, 2, 3}) # True
{1, 2} <= {1, 2, 3} # True
# Disjoint check
{1, 2}.isdisjoint({3, 4}) # True
{1, 2}.isdisjoint({1, 4}) # False
with single elements
# Existence check
2 in {1,2,3} # True
4 in {1,2,3} # False
4 not in {1,2,3} # True
# Add and Remove
s = {1,2,3}
s.add(4) # s == {1,2,3,4}
s.discard(3) # s == {1,2,4}
s.discard(5) # s == {1,2,4}
s.remove(2) # s == {1,4}
a = {1, 2}
a.update({3, 4}) # a == {1, 2, 3, 4}
Get the unique elements of a list
restaurants = ["McDonald's", "Burger King", "McDonald's", "Chicken Chicken"]
unique_restaurants = set(restaurants)
print(unique_restaurants)
# prints {'Chicken Chicken', "McDonald's", 'Burger King'}
• Note that the set is not in the same order as the original list; that is because
sets are unordered, just like dicts.
• This can easily be transformed back into a List with Python's built in list
function, giving another list that is the same list as the original but without
duplicates:
list(unique_restaurants)
# ['Chicken Chicken', "McDonald's", 'Burger King']
It's also common to see this as one line:
# Removes all duplicates and returns another list
list(set(restaurants))
Set Operations using Methods and Builtins
a = {1, 2, 2, 3, 4}
b = {3, 3, 4, 4, 5}
• NOTE: {1} creates a set of one element, but {} creates an empty dict. The
correct way to create an empty set is set().
Intersection
a.intersection(b)
# returns a new set with elements present in both a and b
Union
a.union(b) returns a new set with elements present in either a and b
a.union(b)
{1, 2, 3, 4, 5}
Difference
a.difference(b) returns a new set with elements present in a but not in b
a.difference(b)
{1, 2}
b.difference(a)
{5}
Symmetric Difference
a.symmetric_difference(b) returns a new set with elements present in either a or
b but not in both
a.symmetric_difference(b)
{1, 2, 5}
b.symmetric_difference(a)
{1, 2, 5}
NOTE: a.symmetric_difference(b) == b.symmetric_difference(a)
Subset and superset
c.issubset(a) tests whether each element of c is in a.
a.issuperset(c) tests whether each element of c is in a.
c = {1, 2}
c.issubset(a)
True
a.issuperset(c)
True
Disjoint sets
Sets a and d are disjoint if no element in a is also in d and vice versa.
d = {5, 6}
a.isdisjoint(b) # {2, 3, 4} are in both sets
False
a.isdisjoint(d)
True
Method Operator
a.intersection(b) a&b
a.union(b) a|b
a.difference(b) a-b
a.symmetric_difference(b) a ^ b
a.issubset(b) a <= b
a.issuperset(b) a >= b