Python Tutorial Basics
Python Tutorial Basics
▪ If you are working on DATA You should know python and python is
commonly for
▪ For Data Engineering
▪ For Machine Learning
▪ For Data Science
▪ For Deep Learning
Benefits Python? pyspark telugu
techlake
Reduce development time
No compile
Basics
Python Data Types
Numbers ,Strings....
Data Structures
Lists, and Tuples ,Dictionaries and Sets
Conditionals and Loop Control Statements
if , for, while,pass,break,continue...
Regular Expressions
Functions
Advanced
Files and Input/Output
Errors and Exceptions
pyspark telugu
What are Python Identifiers? techlake
Python Identifier is the name we give to identify a variable, list, tuple, sets, dictionary, function, class, moduleor other
object. That means whenever we want to give an entity a name, that’s called identifier.
Sometimes variable and identifier are often misunderstood as same but they are not. Well for clarity, let’s see what is a
variable?
What is print()?
The print() function prints the specified message to the screen, or other standard output device.
The message can be a string, or any other object, the object will be converted into a string before written to the screen.
Keywords
The following identifiers are used as reserved words, or keywords of the language, and cannot be used as ordinary
identifiers. They must be spelled exactly as written here (case-sensitive):
Variables : A variable, as the name indicates is something whose value is changeable over time. In fact a
variable is a memory location where a value can be stored
Multiple Assignment pyspark telugu
Python allows you to assign a single value to several variables simultaneously techlake
Strings
Besides numbers, Python can also manipulate strings, which can be expressed in several ways. They can be enclosed in single
quotes ('...') or double quotes ("...") with the same result 2. \ can be used to escape quotes:
Numbers
The interpreter acts as a simple calculator: you can type an expression at it and it will write the value. Expression syntax is
straightforward: the operators +, -, * and / work just like in most other languages (for example, Pascal or C); parentheses (())
can be used for grouping. For example:
Division (/) always returns a float. To do floor division and get an integer result (discarding any fractional result) you can use
the // operator; to calculate the remainder you can use %:
it is possible to use the ** operator to calculate powers pyspark telugu
techlake
int() - constructs an integer number from an integer literal, a float literal (by removing all decimals), or a string literal
(providing the string represents a whole number)
float() - constructs a float number from an integer literal, a float literal or a string literal (providing the string represents a
float or an integer)
str() - constructs a string from a wide variety of data types, including strings, integer literals and float literals
F-strings
F-Strings provide a way to embed expressions inside string literals, using a minimal syntax. It should be noted that an f-string is
really an expression evaluated at run time, not a constant value. In Python source code, an f-string is a literal string, prefixed
with 'f', which contains expressions inside braces. The expressions are replaced with their values.
format() function
* `str.format()` is one of the string formatting methods in Python3, which allows multiple substitutions and value
formatting. This method lets us concatenate elements within a string through positional formatting.
* __Syntax__ : `{ } .format(value)`
Parameters : `(value)` : Can be an integer, floating point numeric constant, string, characters or even variables.
Returntype : Returns a formatted string with the value passed as parameter in the placeholder position.
The placeholders can be identified using named indexes {price}, numbered indexes {0}, or even empty placeholders {}.
String Formatting
Python uses C-style string formatting to create new, formatted strings. The "%" operator is used to format a set of variables
enclosed in a "tuple" (a fixed size list), together with a format string, which contains normal text together with "argument
specifiers", special symbols like "%s" and "%d".
s – strings
d – decimal integers (base-10)
f – floating point display
c – character
b – binary
o – octal
x – hexadecimal with lowercase letters after 9
X – hexadecimal with uppercase letters after 9
e – exponent notation
Python Data Types
There are different types of data types in Python. Some built-in Python data types are:
If you omit the first index, the slice starts at the beginning. If you omit the second, the slice goes to the end. So if
you omit both, the slice is a copy of the whole list.
Slicing starting from minimum index 0
if we want add any value in between we can go with `insert` specifying index value
extend takes a list as an argument and appends all of the elements pyspark telugu
techlake
sort with reverse arranges the elements of the list from high to low
List Length pyspark telugu
To determine how many items a list has, use the len() function techlake
Clearing all values using clear method and making empty list []
COPY
You cannot copy a list simply by typing list2 = list1, because: list2 will only be a reference to list1, and changes made in list1
will automatically also be made in list2.
There are ways to make a copy, one way is to use the built-in List method copy().
copylist will only be a reference to thislist. If we add a value in thislist and same value will be available in copylist. Bcz its referring
Original list.
Tuple
A tuple is a collection which is ordered and unchangeable. In Python tuples are written with round brackets ().
Python tuple is much like a list except that it is immutable or unchangeable once created.
Tuples use parentheses and creating them is as easy as putting different items separated by a comma between
parentheses.
Range of Indexes
You can specify a range of indexes by specifying where to start and where to end the range.
When specifying a range, the return value will be a new tuple with the specified items.
Negative Indexing
Negative indexing means beginning from the end, -1 refers to the last item, -2 refers to the second last item etc.
Slicing
If you omit the first index, the slice starts at the beginning. If you omit the second, the slice goes to the end. So if you omit
both, the slice is a copy of the whole list.
Change Tuple Values
Once a tuple is created, you cannot change its values. Tuples are unchangeable, or immutable as it also is called.
But there is a workaround. You can convert the tuple into a list, change the list, and convert the list back into a tuple.
Remove Items
Tuples are unchangeable, so you cannot remove items from it, but you can delete the tuple completely
The del keyword can delete the tuple completely
Join Two Tuples
To join two or more tuples you can use the + operator
Nested Tuples
It is also possible to create a tuple of tuples or tuple of lists.
Sets
Python also includes a data type for sets. A set is an unordered collection with no duplicate elements. Basic uses include
membership testing and eliminating duplicate entries. Set objects also support mathematical operations like union,
intersection, difference, and symmetric difference.
Curly braces or the set() function can be used to create sets. Note: to create an empty set you have to use set(), not {}; the
latter creates an empty dictionary, a data structure that we discuss in the next section.
To add more than one item to a set use the update() method.
You can also use the pop(), method to remove an item, but this method will remove the last item. Remember that sets are
unordered, so you will not know what item that gets removed.
Intersection : will get common matching data items from both datasets.
A union B, B union A and A intersection B and B intersection A will get same results
Difference (Minus or Subtract) Set Operator pyspark telugu
Subtracting right side data set values in left dataset and displaying remaining left dataset values. techlake
Note: A Difference B and B Difference A will get different result set
Dictionaries
A dictionary is like a list, but more general. In a list, the index positions have to be integers; in a dictionary, the indices can
be (almost) any type.
You can think of a dictionary as a mapping between a set of indices (which are called keys) and a set of values. Each key
maps to a value. The association of a key and a value is called a key-value pair or sometimes an item.
The function dict creates a new dictionary with no items. Because dict is the name of a built-in function, you should avoid
using it as a variable name.
A dictionary is a collection which is unordered, changeable and indexed. In Python dictionaries are written with curly
brackets, and they have keys and values.
Accessing Items
You can access the items of a dictionary by referring to its key name, inside square brackets
There is also a method called get() that will give you the same result pyspark telugu
techlake
Change Values
You can change the value of a specific item by referring to its key name
Loop Through a Dictionary
You can loop through a dictionary by using a for loop.
When looping through a dictionary, the return value are the keys of the dictionary, but there are methods to return the
values as well.
You can also use the values() method to return values of a dictionary
You can also use the items() method to return keys and values of a dictionary pyspark telugu
techlake
Adding Items
Adding an item to the dictionary is done by using a new index key and assigning a value to it
pyspark telugu
Removing Items techlake
There are several methods to remove items from a dictionary
The popitem() method removes the last inserted item (in versions before 3.7, a random item is removed instead)
The del keyword removes the item with the specified key name pyspark telugu
techlake
Nested Dictionaries
A dictionary can also contain many dictionaries, this is called nested dictionaries.
Conditional execution
In order to write useful programs, we almost always need the ability to check conditions and change the behaviour of the
program accordingly. Conditional statements give us this ability. The simplest form is the if statement:
The Boolean expression after the if statement is called the condition. We end the if statement with a colon character (:) and
the line(s) after the if statement are indented.
Alternative execution (If Else)
A second form of the if statement is alternative execution, in which there are two possibilities and the condition determines
which one gets executed. The syntax looks like this:
If the remainder when x is divided by 2 is 0, then we know that x is even, and the program displays a message to that effect.
If the condition is false, the second set of statements is executed.
Chained conditionals
Sometimes there are more than two possibilities and we need more than two branches. One way to express a computation
like that is a chained conditional:
If-Then-Else Logic
Since the condition must either be true or false, exactly one of the alternatives will be executed. The alternatives are called
branches, because they are branches in the flow of execution.
The outer conditional contains two branches. The first branch contains a simple statement. The second branch contains
another if statement, which has two branches of its own. Those two branches are both simple statements, although they
could have been conditional statements as well.
pyspark telugu
Short Hand If
If you have only one statement to execute, you can put it on the same line as the if statement. techlake
`code`
pyspark telugu
For loops with lists techlake
For loops can also be run using Python lists.
If a list is used, the loop will run as many times as there are items in the list.
The general structure is:
` <statements>`
The break Statement In For Loop pyspark telugu
With the break statement we can stop the loop before it has looped through all the items: techlake
Nested functions
pyspark telugu
Python Lambda Function
A lambda function is a small anonymous function. techlake
A lambda function can take any number of arguments, but can have one line expression.
Syntax
`lambda arguments : expression`
Local Vs Global Variables pyspark telugu
if we want to change the Global Variable value inside functions Use `GLOBAL` Keyword.. techlake
If a finally clause is present, the finally clause will execute as the last task before the try statement completes. The finally
clause runs whether or not the try statement produces an exception. The following points discuss more complex cases when
an exception occurs:
If an exception occurs during execution of the try clause, the exception may be handled by an except clause. If the exception is not
handled by an except clause, the exception is re-raised after the finally clause has been executed.
An exception could occur during execution of an except or else clause. Again, the exception is re-raised after the finally clause has been
executed.
If the try statement reaches a break, continue or return statement, the finally clause will execute just prior to the break, continue or
return statement’s execution.
If a finally clause includes a return statement, the returned value will be the one from the finally clause’s return statement, not the value
from the try clause’s return statement.
pyspark telugu
techlake
The try … except statement has an optional else clause, which, when present, must follow all except clauses. It is useful for
code that must be executed if the try clause does not raise an exception. For example:
pyspark telugu
Reading and Writing Files (file I/O) techlake
open() returns a file object, and is most commonly used with two arguments: open(filename, mode).
The key function for working with files in Python is the open() function.
"r" - Read - Default value. Opens a file for reading, error if the file does not exist
"a" - Append - Opens a file for appending, creates the file if it does not exist
"w" - Write - Opens a file for writing, creates the file if it does not exist
"x" - Create - Creates the specified file, returns an error if the file exists
pyspark telugu
techlake
f.readline() reads a single line from the file; a newline character (\n) is left at the end of the string, and is only omitted on the
last line of the file if the file doesn’t end in a newline. This makes the return value unambiguous; if f.readline() returns an
empty string, the end of the file has been reached, while a blank line is represented by '\n', a string containing only a single
newline.
pyspark telugu
File Handling
Reading and Writing Files techlake
`open()` returns a file object, and is most commonly used with two arguments: `open(filename, mode)`.
The regular expression library re must be imported into your program before you can use it. The simplest use of the regular
expression library is the search() function. The following program demonstrates a trivial use of the search function.
findall Returns a list containing all matches
search Returns a Match object if there is a match anywhere in the string
split Returns a list where the string has been split at each match
sub Replaces one or many matches with a string
Examples pyspark telugu
techlake
Thank you For Reading
All The Best