Artificial Intelegence

What is Intelligence?
Intelligence is a complex and multifaceted concept that generally refers to the ability to
learn from experience, adapt to new situations, understand and handle abstract
concepts, and use knowledge to manipulate one's environment. It encompasses a
variety of cognitive processes and abilities.
Key Aspects of Human Intelligence

1. Learning:
● The capacity to acquire new knowledge or skills through experience,
study, or teaching.
● Involves processes such as memory, association, and pattern recognition.
2. Reasoning:
● The ability to think logically, make sense of things, and solve problems.
● Includes deductive reasoning (drawing specific conclusions from general
principles) and inductive reasoning (drawing general conclusions from
specific instances).
3. Problem-Solving:
● The capability to find solutions to difficult or complex issues.
● Involves identifying problems, generating potential solutions, evaluating
those solutions, and implementing the best one.
4. Adaptability:
● The ability to adjust one's thinking and behavior in response to new and
changing circumstances.
● Essential for survival and success in a dynamic environment.
5. Abstract Thinking:
● The capacity to understand complex concepts that are not directly tied to
physical experiences.
● Includes thinking about ideas, principles, and relationships that are not
immediately tangible.
6. Comprehension:
● The ability to understand and make sense of language, symbols, and
information.
● Critical for communication, learning, and applying knowledge.
History
The history of Artificial Intelligence (AI) began in antiquity, with myths, stories and
rumors of artificial beings endowed with intelligence or consciousness by master
craftsmen. The seeds of modern AI were planted by classical philosophers who
attempted to describe the process of human thinking as the mechanical manipulation
of symbols. This work culminated in the invention of the programmable digital
computer in the 1940s, a machine based on the abstract essence of mathematical
reasoning. This device and the ideas behind it inspired a handful of scientists to begin
seriously discussing the possibility of building an electronic brain. The field of AI
research was founded at a workshop held on the campus of Dartmouth College during
the summer of 1956. Those who attended would become the leaders of AI research for
decades. Many of them predicted that a machine as intelligent as a human being would
exist in no more than ageneration and they were given millions of dollars to make this
vision come true. Eventually, it became obvious that they had grossly underestimated
the difficulty of the project. In 1973, in response to the criticism from James Lighthill
and ongoing pressure from congress, the U.S. and British Governments stopped funding
undirected research into artificial intelligence, and the difficult years that followed would
later be known as an "AI winter". Seven years later, a visionary initiative by the Japanese
Government inspired governments and industry to provide AI with billions of dollars, but
by the late 80s the investors became disillusioned by the absence of the needed
computer power (hardware) and withdrew funding again. In the 1980s a form of AI
program called "expert systems" was adopted by corporations around the world and
knowledge became the focus of mainstream AI research. In those same years, the
Japanese government aggressively funded AI with its fifth generation computer project.
Another encouraging event in the early 1980s was the revival of connectionism in the
work of John Hopfield and David Rumelhart. Once again, AI had achieved success. In
the first decades of the 21st century, access to large amounts of data (known as "big
data"), cheaper and faster computers and advanced machine learning techniques were
successfully applied to many problems throughout the economy. In fact, McKinsey
Global Institute estimated in their famous paper "Big data: The next frontier for
innovation, competition, and productivity" that "by 2009, nearly all sectors in the US
economy had at least an average of 200 terabytes of stored data". By 2016, the market
for AI-related products, hardware, and software reached more than 8 billion dollars, and
the New York Times reported that interest in AI had reached a "frenzy". The applications
of big data began to reach into other fields as well, such as training models in ecology
and for various applications in economics. Advances in deep learning (particularly deep
convolutional neural networks and recurrent neural networks) drove progress and
research in image and video processing, text analysis, and even speech recognition.
Investment and interest in AI boomed in the first decades of the 21st century, when
machine learning was successfully applied to many problems in academia and industry
due to new methods, the application of powerful computer hardware, and the collection
of immense data sets.
Foundations of AI
◼ Foundation of AI is based on
❑ Mathematics
❑ Neuroscience
❑ Control Theory
❑ Linguistics
4.1. Foundations – Mathematics

◼ More formal logical methods
❑ Boolean logic
❑ Fuzzy logic
◼ Uncertainty
❑ The basis for most modern approaches to handle uncertainty in AI applications can be handled by
✓ Probability theory
✓ Modal and Temporal logics
4.2. Foundations – Neuroscience

◼ How do the brain works?
❑ Early studies (1824) relied on injured and abnormal people to
understand what parts of brain work
❑ More recent studies use accurate sensors to correlate brain activity to
human thought
◼ By monitoring individual neurons, monkeys can now controla
computer mouse using thought alone
❑ Moore’s law states that computers will have as many gates as
humans have neurons in 2020
❑ How close are we to have a mechanical brain?
◼ Parallel computation, remapping, interconnections,….
4.3. Foundations – Control Theory

❑ Machines can modify their behavior in response to the
environment (sense/action loop)
◼ Water-flow regulator, steam engine governor, thermostat
❑ The theory of stable feedback systems (1894)
◼ Build systems that transition from initial
state to goal state with minimum energy
◼ In 1950, control theory could only describe
linear systems and AI largely rose as a
response to this shortcoming
4.4. Foundations – Linguistics

◼ Speech demonstrates so much of human intelligence
❑ Analysis of human language reveals thought taking place in waysnot
understood in other settings
◼ Children can create sentences they have never heard before
◼ Language and thought are believed to be tightly intertwined
AI problems:
Toy Problem:
Precept - : bump , dirt , home

Actions -: shutoff, forward , suck , turn left, turn right.
Goal-: clean dirt
Environment-: walls, floor, room
Real World problems:

TIC TAC TOE:
The Tic-Tac-Toe problem is a classic example in artificial intelligence (AI) used to
demonstrate concepts in search algorithms, game theory, and decision-making. It
involves creating an AI that can play Tic-Tac-Toe optimally against a human opponent
or another AI. Let's break down the problem using the PAGE (Percepts, Actions, Goals,
Environment) framework:
PAGE Framework for Tic-Tac-Toe AI

1. Percepts:
● Definition: The information the AI agent receives about the current state of
the game.
● In Tic-Tac-Toe: The percepts are the current layout of the game board.
Each position on the 3x3 grid can either be empty, marked with an 'X', or
marked with an 'O'. The agent perceives the board state after each move.
2. Actions:
● Definition: The set of all possible moves the agent can make.
● In Tic-Tac-Toe: The actions are the available moves the agent can make.
These are the empty positions on the board where the agent can place its
mark (either 'X' or 'O'). For example, if the top-left corner is empty, placing
a mark there is one possible action.
3. Goals:
● Definition: The objectives the agent is trying to achieve.
● In Tic-Tac-Toe: The primary goal is to win the game by placing three of the
agent's marks in a horizontal, vertical, or diagonal row. A secondary goal is
to block the opponent from achieving this.
4. Environment:
● Definition: The context within which the agent operates, including any
constraints or rules.
● In Tic-Tac-Toe: The environment is a 3x3 grid game board. The rules are:
● Players take turns to place their marks on an empty spot.
● The game ends when a player achieves three marks in a row or
when all spots are filled, resulting in a draw if no player has won.
Developing a Tic-Tac-Toe AI
1. State Representation:
● Represent the game board as a 3x3 matrix or a 9-element list, where each
element can be 'X', 'O', or empty.
2. Move Generation:
● The AI generates a list of all possible moves by identifying empty spots on
the board.
3. Evaluation Function:
● The AI evaluates board states to determine which move is the best. This
can be done using a scoring system (e.g., +10 for a win, -10 for a loss, 0
for a draw).
4. Search Algorithm:
● Minimax Algorithm: A common algorithm used in game AI. The AI
simulates all possible moves (and the opponent's responses) to a given
depth and chooses the move that maximizes its chances of winning while
minimizing the opponent's chances.
● Alpha-Beta Pruning: An optimization of Minimax that eliminates branches
in the search tree that don't need to be explored because they cannot
affect the final decision.
Example of Minimax in Tic-Tac-Toe

1. Initial Board:
2. css
3. Copy code
[X, O, X]
[O, X, ]
[ , , ]
4.
5. Possible Moves for 'O':
● Place 'O' at (1, 2)
● Place 'O' at (2, 0)
● Place 'O' at (2, 1)
● Place 'O' at (2, 2)
6. Simulate Each Move:
● For each possible move, simulate the resulting board state, then simulate
all possible responses by 'X', and so on.
7. Evaluate Outcomes:
● Calculate the score for each move using the evaluation function. For
example:
● If 'O' can win immediately by placing at (1, 2), that move scores +10.
● If the move leads to a draw after optimal play from both sides, it
scores 0.
● If the move leads to a loss, it scores -10.
8. Choose Best Move:
● Select the move with the highest score (or least negative score in the
worst-case scenario).
Water Jug Problem

The Water Jug Problem is a classic problem in artificial intelligence and computer
science, often used to illustrate problem-solving and search algorithms. The problem
typically involves two jugs with different capacities and an objective to measure a
specific amount of water using these jugs. Here’s a detailed explanation of the problem
and its solution approach using the PAGE (Percepts, Actions, Goals, Environment)
framework:
Problem Description
You are given two jugs with capacities of
a liters and
b liters, and you need to measure exactly
c liters of water. The operations you can perform are:
1. Fill a jug completely.

2. Empty a jug completely.
3. Pour water from one jug to another until one is either full or the other is empty.
PAGE Framework for the Water Jug Problem

1. Percepts:
● Definition: The current state of the jugs, represented by the amount of
water in each jug.
● In the Water Jug Problem: The percepts are the current volumes of water
in the two jugs. This state can be represented as a tuple
● (x,y), where x is the amount of water in the first jug and
● y is the amount of water in the second jug.
2. Actions:
● Definition: The possible operations that can change the state of the jugs.
● In the Water Jug Problem: The actions include:
1. Fill the first jug.
2. Fill the second jug.
3. Empty the first jug.
4. Empty the second jug.
5. Pour water from the first jug to the second jug.
6. Pour water from the second jug to the first jug.
3. Goals:
● Definition: The desired state or outcome that the agent is trying to achieve.
● In the Water Jug Problem: The goal is to reach a state where one of the
jugs contains exactly
● c liters of water. Mathematically, the goal is to find a state
● (x,y) such that x=c or y=c.
4. Environment:
● Definition: The context within which the agent operates, including any
constraints or rules.
● In the Water Jug Problem: The environment consists of the two jugs with
fixed capacities a and b. The actions are constrained by the capacities of
the jugs, and the state changes must adhere to the physical properties of
the jugs (e.g., you cannot pour more water than the current amount in a
jug or more than the capacity of the other jug).
Solving the Water Jug Problem

The Water Jug Problem can be solved using various search algorithms, such as
Breadth-First Search (BFS) or Depth-First Search (DFS). Here’s a step-by-step outline
using BFS, which is particularly suited for finding the shortest path to the solution:
1. Initial State: Start with both jugs empty: (0,0)

2. Goal Test: Check if either jug has c liters of water: x=c or y=c.
3. State Transition: Generate new states by applying all possible actions to the
current state.
4. Search Algorithm:
● Use BFS to explore the state space:
1. Initialize a queue with the initial state (0,0) and a set to keep track
of visited states.
2. While the queue is not empty, dequeue a state If (x,y) meets the
goal, return the sequence of actions that led to this state.
3. Otherwise, generate all possible new states from (x,y) using the
actions, and enqueue the new states if they have not been visited.
Example Solution
Consider two jugs with capacities 3 liters and 5 liters, and the goal is to measure exactly
4 liters:
1. Initial state:
2. (0,0)
3. (0,0)
4. Actions:
● Fill the 5-liter jug: (0,5)
● Pour from the 5-liter jug to the 3-liter jug: (3,2)
● Empty the 3-liter jug: (0,2)
● Pour from the 2 liters in the 5-liter jug to the 3-liter jug: (2,0)
● Fill the 5-liter jug again: (2,5)
● Pour from the 5-liter jug to the 3-liter jug until the 3-liter jug is full: (3,4)
In this sequence of actions, we reach the state
(3,4)
(3,4), where the second jug contains exactly 4 liters, achieving the goal.
Agents:
In artificial intelligence, an agent is an entity that perceives its environment through
sensors and acts upon that environment through actuators to achieve specific goals.
Agents are a fundamental concept in AI and can range from simple programs to
complex systems with advanced reasoning capabilities. Here’s a detailed overview of
agents in AI:
Types of Agents
1. Simple Reflex Agents:
● Behavior: Act based on a set of predefined rules or conditions. They do not
consider the history of past states.
● Example: A thermostat that turns on the heater if the temperature drops
below a certain threshold.
2. Goal-Based Agents:
● Behavior: Act to achieve specific goals. They consider the future
consequences of their actions to decide the best course of action.
● Example: A chess-playing program that selects moves based on their
potential to lead to a winning position.
3. Utility-Based Agents:
● Behavior: Choose actions based on a utility function that measures the
preferred outcomes. This allows them to handle trade-offs and
uncertainties.
● Example: A recommendation system that suggests products to users
based on a balance of relevance and diversity.
Propositional Logic
Propositional logic, also known as sentential logic, is a fundamental branch of logic

used in artificial intelligence and computer science. It deals with propositions, which are
statements that are either true or false. Propositional logic provides a formal framework
for reasoning about the truth or falsehood of these propositions and the relationships
between them. Here's an explanation of propositional logic in AI:
Components of Propositional Logic:

1. Propositions:
● Basic units of knowledge represented as statements that can be true or
false.
● Denoted by capital letters (e.g., P, Q, R).
2. Logical Connectives:
● Operators used to combine propositions to form compound propositions.
● Common logical connectives include:
● Negation (¬): Represents "not" or "negation" of a proposition.
● Conjunction (∧): Represents "and."
● Disjunction (∨): Represents "or."
● Implication (→): Represents "if...then" or "implies."
● Biconditional (↔): Represents "if and only if."
3. Truth Values:
● Each proposition can have one of two truth values: True ( T) or False ( F).
● The truth value of compound propositions is determined based on the
truth values of their component propositions and the rules of logical
connectives.
Example:
Consider the following propositions:
● P: It is raining.
● Q: The ground is wet.
Now, let's form compound propositions using logical connectives:
1. Conjunction (P∧Q): It is raining and the ground is wet.

● True only if both
● P and
● Q are true.
2. Disjunction (P∨Q): It is raining or the ground is wet.
● True if at least one of
● P or
● Q is true.
3. Negation (¬P): It is not raining.
● True if
● P is false.
4. Implication (P→Q): If it is raining, then the ground is wet.
● False only if
● P is true and
● Q is false.
5. Biconditional (P↔Q): It is raining if and only if the ground is wet.
● True if both
● P and
● Q have the same truth value (either both true or both false).
Applications in AI:
● Knowledge Representation: Propositional logic is used to represent knowledge in
AI systems, allowing them to reason about the world based on logical rules and
facts.
● Inference: AI systems use propositional logic to derive new knowledge from
existing knowledge, infer relationships between different propositions, and make
logical deductions.
● Problem Solving: Propositional logic provides a foundation for solving problems
using logical reasoning, search algorithms, and constraint satisfaction
techniques in AI applications.
Propositional Theorem
Certainly! Think of a propositional theorem as a rule in a game where no matter what,

the rule always holds true. It's like saying, "If you have a red ball, and someone asks you
if you have a red ball, you can always say yes because it's always true."
Here's a simple breakdown:
Propositional Theorem in Simple Terms:

1. Always True:
● A propositional theorem is like a rule that's always true, no matter what.
● Imagine a rule that says, "If it's raining outside, then the ground is wet."
This rule holds true all the time, regardless of whether it's actually raining
or not.
2. Logical Consequence:
● It's a statement that logically follows from basic rules of reasoning.
● Just like in a game, if you follow the rules, you always end up with the right
answer. For example, if you have a red ball and someone asks if you have
a red ball, you'll always say yes because it's a logical consequence of
having the ball.
3. Example:
● Think of the statement "Either it's sunny or it's not sunny." This is always
true because it covers all possibilities: either it's sunny or it's not, there's
no other option.
Importance in Everyday Life:

● Reliable Rules: Propositional theorems give us reliable rules to follow, like how
we know certain things are always true no matter the situation.
● Clear Thinking: They help us think clearly and make sense of the world by
providing logical guidelines to follow.
● Decision Making: In AI, they're crucial for making decisions based on logical
rules, helping machines understand and navigate the world around them.
Proving Inference:
Inference refers to the process of deriving new knowledge or conclusions from existing
knowledge or premises. In propositional logic, proving inference involves showing that a
conclusion logically follows from a set of premises using valid rules of inference. Here's
how it works:
1. Premises:
● Premises are statements or propositions assumed to be true. They serve
as the starting point for logical reasoning.
2. Rules of Inference:
● These are logical rules or principles that allow us to derive new
conclusions from premises. Examples include modus ponens, modus
tollens, and hypothetical syllogism.
3. Logical Steps:
● To prove an inference, we apply the rules of inference step by step to the
premises until we reach the desired conclusion.
4. Validity:
● An inference is considered valid if the conclusion logically follows from
the premises according to the rules of inference.
Example of Proving Inference:

Premises:
● P→Q
● P
Conclusion:
Proof:
1. Given P→Q and
2. P Using modus ponens, we conclude Q.
Proofs:
Proofs are formal demonstrations or arguments that establish the truth of a proposition
or theorem based on logical reasoning and valid inference. In mathematics and logic,
proofs are used to demonstrate the validity of mathematical statements or logical
propositions. Here's how they work:
1. Hypothesis:
● A proof starts with a hypothesis, which is the statement or proposition to
be proved.
2. Logical Steps:
● The proof proceeds through a series of logical steps, each justified by
valid rules of inference or previously proven statements (lemmas).
3. Conclusion:
● The proof concludes with a statement that logically follows from the initial
hypothesis and the logical steps taken.
4. Rigor:
● Proofs are expected to be rigorous and complete, leaving no room for
doubt about the validity of the conclusion.
Example of a Proof:
Theorem: The sum of two even integers is always even.
Proof:
1. Let n and m be even integers.

2. By definition,n=2k and m=2l for some integers
3. The sum n+m=(2k)+(2l)=2(k+l).
4. Since k+l is an integer,
5. n+m is divisible by 2 and therefore even.
6. Thus, the sum of two even integers is always even.
Importance of Proofs:
● Rigor and Certainty: Proofs provide a rigorous and systematic way to establish
the truth of mathematical and logical statements, ensuring certainty and
reliability.
● Foundation of Mathematics: Proofs form the foundation of mathematical
knowledge, allowing mathematicians to build upon established results and
theories.
● Verification: Proofs enable others to verify and validate mathematical results,
ensuring correctness and accuracy in mathematical research and practice.
Proof by Resolution
Imagine you're trying to figure out if a bunch of statements are true or not. Proof by resolution is
like playing a game where you have to find out if all the statements make sense together or if
there's a problem.
Here's how it works:
1. Setting Up the Game:

● You start by writing down all the statements in a special way so it's easier to
work with them. This special way is called "conjunctive normal form" (CNF). It's
like putting each statement in a box and then putting all the boxes together.
2. Finding Clues:
● You look for pairs of statements that seem to disagree with each other. For
example, if one statement says "It's raining" and another says "It's not raining,"
they're clearly arguing!
3. Resolving Conflicts:
● You use a special rule called "resolution" to resolve these arguments. If one
statement says something is true and another says it's not, you can combine
them to see what makes sense.
● If the arguments don't contradict each other too much, you can find a way to
resolve them. But if they're totally opposite, you'll end up with a problem.
4. Detecting Problems:
● If you keep resolving conflicts and end up with a statement that says both
something is true and not true at the same time, that's a big problem! It means
the original statements can't all be true together.
● When you see this problem, you've proven that the original statements can't all be
true at once.
5. Game Over:
● If you can't find any more conflicts to resolve and you haven't run into any
problems, that means the original statements could all be true together.
Simple Example:
Imagine you have these statements:
1. It's raining or it's sunny.

2. If it's not raining, then it's cloudy.
3. It's not sunny or it's not cloudy.
You use resolution to resolve conflicts between these statements. If you end up with a
statement like "It's both raining and not raining," that's a big problem! It means something went
wrong with the original statements.
In essence, proof by resolution is like playing detective with logical statements. You're trying to
find out if they all fit together nicely or if there's a contradiction that shows they can't all be true
at the same time.
Horn Clauses and definite Clauses
Horn Clauses in Simple Terms:

1. Simple Rules:
● Horn clauses are like basic rules or statements written in a special way for
logical thinking.
2. Fewer Options:
● They have at most one positive statement. It's like saying "I'll do this if that
happens, but not if this other thing happens."
3. Examples:
● Imagine a rule like "If it's raining, then I'll take an umbrella." This is a horn
clause because it has one positive statement ("I'll take an umbrella") and
maybe some negative ones ("It's not raining").

● Another example is "If it's not snowing, then I'll go for a walk." This is also
a horn clause because it has one positive statement ("I'll go for a walk")
and one negative one ("It's not snowing").
4. Usage:
● Horn clauses are used in logical reasoning, especially in computer science
and artificial intelligence, to represent rules or conditions.
Importance of Horn Clauses:

● Simple Logic: They help to express logical rules or conditions in a straightforward
way, making it easier to understand and work with them.
● Problem Solving: Horn clauses are often used in solving logical puzzles or
making decisions based on certain conditions.
● Programming: In computer science, they're used in programming languages like
Prolog to represent rules or facts.
Definite clauses are a specific type of logical statement that's used in various areas like
logic programming and automated reasoning. Let's simplify what definite clauses are
and how they work:
Definite Clauses in Simple Terms:

1. Special Kind of Rule:
● Definite clauses are like rules you might follow in a game or in real life, but
they're written in a specific way to make them easier to work with.
2. Clear Structure:
● Each definite clause has two parts: a condition and a conclusion.

● The condition is a combination of facts or statements, and the conclusion
is what happens if the condition is true.
3. Only One Positive Thing:
● In a definite clause, there's only one positive statement in the condition.
Everything else is either negative or a "not" version of a statement.
4. Example:
● Imagine you have a rule that says "If it's not raining and the ground is wet,
then it must have rained." This rule is a definite clause. The condition is
"It's not raining and the ground is wet," and the conclusion is "It must have
rained."
Importance of Definite Clauses:

● Simple Rules: They provide a clear and structured way to express rules or
conditions.
● Logic Programming: Definite clauses are commonly used in programming
languages like Prolog to represent knowledge or rules.
● Easy to Work With: Their clear structure makes them easier for computers to
understand and use in automated reasoning.
Forward Chaining:
1. Starting Point:
● Forward chaining is like starting with what you know and moving forward
step by step to reach a conclusion.
2. Building Up:
● You begin with the facts or rules you have and keep adding new
conclusions based on those facts or rules.
3. Example:
● Imagine you're trying to figure out if it's going to rain today. You start with
what you know: "If the clouds are dark, it might rain." Then you look
outside. If the clouds are dark, you conclude, "It might rain." Then, you
might check if you feel any drops, and if you do, you conclude, "It's
raining."
4. Continuing Until Done:
● You keep following these steps until you reach the conclusion you're
looking for or until you can't go any further.
Backward Chaining:
1. Goal-Oriented:
● Backward chaining is like starting with a goal in mind and working
backward to see how you can achieve it.
2. Working Backward:
● You start with the goal you want to achieve and then look for the facts or
rules that would allow you to reach that goal.
3. Example:
● Using the same example of figuring out if it's going to rain, if your goal is
to know if it's going to rain, you might start by asking, "What would make it
rain?" Then you think, "Dark clouds might make it rain." So, you check if the
clouds are dark.
4. Repeating Until Satisfied:

● You keep repeating this process, working backward from one step to the
previous one, until you either find the information you need or can't go any
further.
Comparison:
● Forward Chaining:
● Starts with known facts and moves forward to reach a conclusion.
● Works well when you have lots of data or when you need to explore all
possibilities.
● Backward Chaining:
● Starts with a goal and works backward to find the information needed to
achieve that goal.
● Works well when you have a specific goal in mind or when you need to find
a solution efficiently.
First-Order logic:
○ First-order logic is another way of knowledge representation in artificial

intelligence. It is an extension to propositional logic.
○ FOL is sufficiently expressive to represent the natural language statements in a

concise way.
○ First-order logic is also known as Predicate logic or First-order predicate logic.

First-order logic is a powerful language that develops information about the
objects in a more easy way and can also express the relationship between those
objects.
○ First-order logic (like natural language) does not only assume that the world
contains facts like propositional logic but also assumes the following things in
the world:
a. Objects: A, B, people, numbers, colors, wars, theories, squares, pits,

wumpus, ......
b. Relations: It can be unary relation such as: red, round, is adjacent, or n-

any relation such as: the sister of, brother of, has color, comes between
c. Function: Father of, best friend, third inning of, end of, ......
○ As a natural language, first-order logic also has two main parts:
a. Syntax
b. Semantics
Syntax of First-Order logic:

The syntax of FOL determines which collection of symbols is a logical expression in
first-order logic. The basic syntactic elements of first-order logic are symbols. We write
statements in short-hand notation in FOL.
Basic Elements of First-order logic:
Following are the basic elements of FOL syntax:
Constant 1, 2, A, John, Mumbai, cat,....

Variables x, y, z, a, b,....
Predicates Brother, Father, >,....
Function sqrt, LeftLegOf, ....
Connectives ∧, ∨, ¬, ⇒, ⇔
Equality ==
Quantifier ∀, ∃
Atomic sentences:
○ Atomic sentences are the most basic sentences of first-order logic. These
sentences are formed from a predicate symbol followed by a parenthesis with a
sequence of terms.
○ We can represent atomic sentences as Predicate (term1, term2, ......, term n).
Example: Ravi and Ajay are brothers: => Brothers(Ravi, Ajay).
Chinky is a cat: => cat (Chinky).
Complex Sentences:
○ Complex sentences are made by combining atomic sentences using

connectives.
First-order logic statements can be divided into two parts:
○ Subject: Subject is the main part of the statement.
○ Predicate: A predicate can be defined as a relation, which binds two atoms

together in a statement.
Consider the statement: "x is an integer.", it consists of two parts, the first part x is the
subject of the statement and second part "is an integer," is known as a predicate.
Quantifiers in First-order logic:

○ A quantifier is a language element which generates quantification, and
quantification specifies the quantity of specimen in the universe of discourse.
○ These are the symbols that permit to determine or identify the range and scope
of the variable in the logical expression. There are two types of quantifier:
a. Universal Quantifier, (for all, everyone, everything)
b. Existential quantifier, (for some, at least one).
First-Order Logic in Simple Terms:
1. Building Blocks:
● First-order logic is like a language for expressing relationships and facts

about the world.
● It has basic building blocks like objects, properties, and relationships.
2. Quantifiers:
● It uses quantifiers like "for all" ( ∀) and "there exists" ( ∃) to talk about
things in general or specific instances.
3. Variables and Constants:
● It has variables (like x or y) that can stand for any object, and constants
(like a or b) that represent specific objects.
4. Predicates:
● Predicates are statements about objects, like "is blue" or "is taller than."
They can be true or false depending on the objects involved.
5. Connectives:
● Connectives like "and" ( ∧), "or" ( ∨), and "not" ( ¬) are used to combine
statements and build more complex expressions.
Example:
Let's say we want to express the statement: "All dogs bark."
● We can start by defining:
● D(x): x is a dog.
● B(x): x barks.
● Using quantifiers and predicates, we can write:
● ∀x(D(x)→B(x)).
● This means "For all x, if x is a dog, then x barks."
Importance of First-Order Logic:
● Expressive Power: FOL allows us to express complex relationships and

properties in a formal and precise way.
● Foundation of Mathematics: It serves as a foundation for mathematical

reasoning and formal methods in various fields.
● AI and Knowledge Representation: FOL is widely used in artificial intelligence for

knowledge representation, reasoning, and automated theorem proving.
Inference in Simple Terms:

Inference in first-order logic (FOL) is the process of deriving new logical conclusions
from existing knowledge or premises using the rules of logic. It involves making logical
deductions based on the relationships and properties defined within the logical
framework. Let's break down inference in FOL in simple terms:
1. Starting Point:
● Inference starts with a set of premises, which are statements or facts that
are assumed to be true.

2. Logical Rules:
● Using logical rules and principles, we analyze the premises to derive new
conclusions that logically follow from them.
3. Logical Steps:
● We apply rules of inference such as modus ponens, universal
instantiation, existential instantiation, and universal generalization to
make deductions.
4. Example:
● Suppose we have the premises:
● All humans are mortal: ∀x(Human(x)→Mortal(x)).

● Socrates is human: Human(Socrates).
● From these premises, we can infer that Socrates is mortal:
● Mortal(Socrates).
5. Validity:
● The validity of an inference is determined by whether the conclusion
logically follows from the premises according to the rules of logic.
Importance of Inference in FOL:

● Reasoning: Inference allows us to reason about the world and draw logical
conclusions based on existing knowledge.
● Knowledge Representation: It is essential for representing and manipulating
knowledge in artificial intelligence systems.
● Automated Reasoning: Inference plays a crucial role in automated theorem
proving, logical reasoning, and problem-solving.
Unit-II
Uncertain Knowledge and Reasoning:
Uncertain knowledge and reasoning deal with situations where we don't have complete or precise
information, leading to uncertainty about the truth of statements or outcomes. This uncertainty can arise
due to various factors such as incomplete information, noise in data, or probabilistic events.
Basic probability
Probability Theory: Probability is defined as the chance of happening or occurrences of an event.
Generally, the possibility of analyzing the occurrence of any event concerning previous data is called
probability. For example, if a fair coin is tossed, what is the chance that it lands on the head? These types
of questions are answered under probability.
Probability measures the likelihood of an event’s occurrence. In situations where the outcome of an event
is uncertain, we discuss the probability of specific outcomes to understand their chances of happening.
The study of events influenced by probability falls under the domain of statistics.
Basic Probability in Simple Terms:

1. Dealing with Uncertainty:
● In real-world scenarios, we often encounter uncertainty where outcomes
are not entirely predictable.
● Basic probability helps AI systems understand and quantify this
uncertainty.
2. Likelihood of Events:
● Probability is a measure of how likely an event is to occur. It's expressed
as a number between 0 and 1, where 0 means impossible, 1 means
certain, and values in between represent degrees of likelihood.
3. Representing Uncertainty:
● In AI, we use probabilities to represent uncertain information, such as the
likelihood of different outcomes or the likelihood of a hypothesis being
true given observed evidence.
4. Making Decisions:
● AI systems use probabilities to make decisions under uncertainty. They
calculate the likelihood of different actions leading to desirable outcomes
and choose the action with the highest expected utility or probability of
success.
Importance of Basic Probability in AI:

● Uncertainty Modeling: Probability theory provides a formal framework for
modeling and reasoning about uncertainty in AI systems.
● Decision Making: Probabilistic methods enable AI systems to make informed
decisions even when faced with incomplete or uncertain information.
● Machine Learning: Probability is fundamental to various machine learning
algorithms, such as Bayesian methods, which use probabilistic models to learn
from data and make predictions.
Bayes' theorem
Bayes' theorem allows updating the probability prediction of an event by observing new
information of the real world.

Bayes' Rule in Simple Terms:
1. Updating Probabilities:
● Bayes' rule helps us update our beliefs about the likelihood of different
events or hypotheses given new evidence.
2. Prior Beliefs:
● We start with our initial beliefs or probabilities about the events or
hypotheses before observing any new evidence. These are called prior
probabilities.
3. New Evidence:
● When new evidence or observations become available, Bayes' rule allows
us to incorporate this evidence into our prior beliefs to obtain updated
probabilities, known as posterior probabilities.
4. Probabilistic Reasoning:
● Bayes' rule formalizes the process of probabilistic reasoning, enabling AI
systems to make informed decisions and draw conclusions based on
observed evidence.
Bayes' Rule Equation:

Bayes' rule is expressed mathematically as:
P(H∣E)= P(E∣H)×P(H) / P(E)
Where:
● P(H∣E) is the posterior probability of hypothesis H given evidence
● P(E∣H) is the likelihood of observing evidence E given hypothesis
● P(H) is the prior probability of hypothesis H
● H.P(E) is the probability of observing evidence
● E (also known as the marginal likelihood or evidence).
Belief Networks
Belief networks, also known as Bayesian networks or probabilistic graphical models, are
a type of representation used in artificial intelligence to model and reason about
uncertainty in complex systems. They provide a structured way to capture the
relationships between variables and make probabilistic inferences. Let's break down
belief networks in AI in simple terms:
Belief Networks in Simple Terms:

1. Modeling Relationships:
● Belief networks represent relationships between variables in a system,
capturing how one variable influences or depends on others.
2. Graphical Representation:
● They are represented graphically as a network of nodes (variables)
connected by directed edges (dependencies).
● Nodes represent variables, such as events or states, and edges represent
probabilistic dependencies between them.
3. Probabilistic Inference:
● Belief networks use probability theory to quantify the relationships
between variables and make probabilistic inferences about them.
4. Conditional Independence:
● The structure of a belief network captures conditional independence
relationships between variables, allowing for efficient probabilistic
reasoning.
Components of Belief Networks:

1. Nodes:
● Nodes represent variables or events in the system, such as weather
conditions, medical symptoms, or financial indicators.
2. Edges:
● Directed edges between nodes represent probabilistic dependencies or
causal relationships between variables.
● An edge from node A to node B indicates that B depends probabilistically
on A.
3. Conditional Probability Tables (CPTs):
● Each node has associated conditional probability tables that specify the
probability distribution of the node given its parent nodes.
● These tables quantify the probabilistic dependencies represented by the
edges.
Default Reasoning
Default reasoning is a type of reasoning in artificial intelligence (AI) and logic that deals
with making assumptions or drawing conclusions based on default or typical cases
when complete information is not available. It allows AI systems to fill in missing

information or make educated guesses when faced with uncertainty. Let's break down
default reasoning in AI in simple terms:
Default Reasoning in Simple Terms:

1. Making Assumptions:
● Default reasoning involves making assumptions or drawing conclusions
based on typical or default cases in the absence of complete information.
2. Fallback Strategy:
● When complete or specific information is not available, default reasoning
provides a fallback strategy to reason about the situation based on
common sense or general knowledge.
3. Common Patterns:
● Default reasoning relies on recognizing common patterns, assumptions,
or defaults that are generally true or likely to be true in similar situations.
4. Inference from Defaults:
● AI systems use default reasoning to infer conclusions or make decisions
based on these default assumptions or defaults.
Importance of Default Reasoning in AI:

● Dealing with Incomplete Information: Default reasoning allows AI systems to
reason effectively when faced with incomplete or uncertain information.
● Efficient Decision Making: It provides a way to make decisions or draw
conclusions in situations where obtaining complete information may be
impractical or impossible.
● Real-World Applications: Default reasoning is used in various real-world
applications of AI, such as natural language understanding, expert systems, and
automated reasoning.
Example:
Consider a smart home system that adjusts the temperature based on the time of day:
● Default Assumption:
● During the daytime, the default assumption may be that people are active
and prefer a cooler temperature.
● During the nighttime, the default assumption may be that people are
resting and prefer a warmer temperature.
● Default Reasoning:
● If the system lacks specific information about occupants' preferences, it
may default to these assumptions and adjust the temperature accordingly.
Fuzzy Sets in Simple Terms:
The 'Fuzzy' word means the things that are not clear or are vague. Sometimes, we
cannot decide in real life that the given problem or statement is either true or false. At
that time, this concept provides many values between the true and false and gives the
flexibility to find the best solution to that problem.
1. Traditional vs. Fuzzy:
● In traditional sets, things are either in the set or not (like a light switch: it's
either on or off).
● Fuzzy sets are more flexible. They allow things to be partially in the set.
For example, something can be "kind of hot" or "sort of tall."
2. Degrees of Membership:
● Fuzzy sets assign degrees of membership to elements. Instead of just 0
or 1, an element can belong to a set to a certain degree.
● Membership values range from 0 (not in the set at all) to 1 (fully in the
set).
3. Example:
● Take the set of "tall people." In a traditional set, someone is either tall (in
the set) or not tall (not in the set).
● In a fuzzy set, someone might have a membership value of 0.8, meaning
they're mostly tall but not extremely tall.
Fuzzy Logic in Simple Terms:

1. Handling Uncertainty:
● Fuzzy logic deals with uncertainty by allowing for imprecise or vague
statements.
● It's like how humans reason. We often use words like "warm," "cold," "fast,"
or "slow," which aren't precise but convey meaning.
2. Linguistic Variables:
● Fuzzy logic uses linguistic variables, which represent qualitative terms like
"hot," "medium," and "cold."
● These terms are fuzzy because they can cover a range of values instead
of a specific one.
3. Fuzzy Rules:
● Fuzzy logic uses rules that are based on these linguistic variables. For
example, "If it's cold, turn the heater up a bit."
4. Fuzzy Inference:
● Fuzzy logic combines these rules and linguistic variables to make
decisions or draw conclusions. It's a bit like how humans reason through
fuzzy concepts.
Importance of Fuzzy Sets and Fuzzy Logic:

● Handling Uncertainty: They're useful in situations where things aren't black and
white, like in weather forecasting or controlling household appliances.
● Human-Like Reasoning: Fuzzy logic allows AI systems to mimic human-like
reasoning by dealing with imprecise information and linguistic terms.
● Real-World Applications: They're used in various real-world applications such as
control systems, pattern recognition, and decision-making processes.
Structured Knowledge
Structured knowledge in AI refers to organized and well-defined information that is represented
in a structured format, allowing AI systems to understand, reason, and manipulate knowledge
effectively. It involves organizing data and information into a format that is easily
understandable and accessible for machines.
Associative Networks
Associative networks, also known as associative memory or associative networks, are a
type of data structure used in artificial intelligence to represent and retrieve information
based on associations or connections between data items. They are inspired by the way
human memory works, where memories are often triggered or recalled by related
concepts or cues. Let's simplify associative networks in AI:
Associative Networks in Simple Terms:

1. Connecting Concepts:
● Associative networks connect related concepts or items together based
on their associations or similarities.
● Each item in the network is linked to other items that are related to it in
some way.
2. Memory Recall:
● Like how remembering one thing can trigger memories of related things,
associative networks allow AI systems to recall information based on
associations.
● When given a cue or input, the network retrieves related information stored
in its structure.
3. Flexible Retrieval:
● Associative networks provide a flexible way to retrieve information. They
don't rely on rigid structures or exact matches but instead retrieve
information based on similarity or relevance.
4. Applications:
● They are used in various AI applications such as information retrieval,
pattern recognition, and recommendation systems.
Importance of Associative Networks in AI:

● Flexible Retrieval: They enable AI systems to retrieve information based on
associations, allowing for flexible and context-sensitive retrieval.
● Pattern Recognition: Associative networks can recognize patterns or similarities
between data items, aiding in tasks like image recognition or natural language
processing.
● Memory-like Behavior: They mimic the way human memory works, allowing AI
systems to recall information based on related concepts or cues.
Example:
Consider an associative network used for storing information about animals:
● Connections: Each animal (e.g., dog, cat, bird) is connected to related concepts
such as its characteristics (e.g., furry, feathers), habitat (e.g., forest, ocean), and
behavior (e.g., barking, meowing).
● Retrieval: When given a cue like "pet with fur," the network retrieves animals like
dogs and cats that are associated with the concept of fur.
Frame Structures
Frame structures are a way to organize knowledge in artificial intelligence (AI) by
representing objects and their properties, relationships, and behaviors in a structured
format. They provide a way to model complex entities and their characteristics, allowing
AI systems to reason about and manipulate knowledge effectively. Let's simplify frame
structures in AI:
Frame Structures in Simple Terms:

1. Organizing Knowledge:
● Frame structures organize knowledge about objects or concepts into a
structured format resembling frames or templates.
● Each frame represents an entity (object or concept) and contains slots for
storing information about its properties, relationships, and behaviors.
2. Attributes and Values:
● Frames consist of attributes (or slots) and their corresponding values.
Attributes represent different aspects or characteristics of the entity, while
values provide specific information about those aspects.
● For example, a frame representing a car might have attributes like "model,"
"color," "engine type," and values such as "Toyota," "blue," "hybrid."
3. Hierarchical Organization:
● Frame structures can be organized hierarchically, with frames inheriting
attributes and behaviors from higher-level frames.
● This hierarchical organization allows for the classification of objects into
categories and the inheritance of common properties.
4. Behavior Representation:
● In addition to properties and relationships, frame structures can also
represent behaviors or methods associated with objects.
● These behaviors describe how objects interact with each other or respond
to stimuli in the environment.
Conceptual Dependencies and Scripts
Conceptual dependencies and scripts are two related concepts used in artificial
intelligence (AI) and cognitive science to represent and understand the structure of
knowledge and events in the world. They provide a way to organize and reason about
complex information in a structured format. Let's break down conceptual dependencies
and scripts in simple terms:
Conceptual Dependencies in Simple Terms:

1. Representing Knowledge:
● Conceptual dependencies are a way of representing knowledge by
capturing the relationships and dependencies between concepts or
events.
● They describe how different pieces of information are related and how
they depend on each other.
2. Semantic Relationships:
● Conceptual dependencies focus on semantic relationships such as agent-
action, action-object, and object-property.
● These relationships help define the meaning and structure of knowledge in
a way that is understandable to AI systems.
3. Hierarchical Organization:
● Conceptual dependencies can be organized hierarchically, with higher-
level concepts depending on more specific or lower-level concepts.
● This hierarchical organization allows for the representation of complex
knowledge structures.
4. Inference and Reasoning:
● AI systems can use conceptual dependencies to perform inference and
reasoning, making logical deductions based on the relationships between
concepts.
Scripts in Simple Terms:

1. Structured Knowledge about Events:
● Scripts are a way of representing structured knowledge about events,
routines, or procedures.
● They capture the typical sequence of actions, roles of participants, and
expected outcomes associated with different types of events.
2. Elements of Scripts:
● Scripts consist of elements such as roles (e.g., actor, object), actions (e.g.,
perform, move), conditions (e.g., prerequisites, outcomes), and temporal
order.
3. Common Scenarios:
● Scripts represent common scenarios or stereotypical sequences of events
that people encounter in daily life.
● For example, a restaurant script might include actions like "entering,"
"ordering," "eating," and "paying."
4. Understanding and Prediction:
● AI systems can use scripts to understand and predict the behavior of
agents or events by recognizing familiar patterns and sequences of
actions.
Unit III
Uninformed Search strategies-

Uninformed search is a class of general-purpose search algorithms which operates in brute
force-way. Uninformed search algorithms do not have additional information about state or
search space other than how to traverse the tree, so it is also called blind search.
1. Breadth-first Search:
○ Breadth-first search is the most common search strategy for traversing a tree or
graph. This algorithm searches breadthwise in a tree or graph, so it is called
breadth-first search.
○ BFS algorithm starts searching from the root node of the tree and expands all
successor node at the current level before moving to nodes of next level.
○ The breadth-first search algorithm is an example of a general-graph search

algorithm.
○ Breadth-first search implemented using FIFO queue data structure.
Advantages:
○ BFS will provide a solution if any solution exists.
○ If there are more than one solutions for a given problem, then BFS will provide the
minimal solution which requires the least number of steps.
Disadvantages:
○ It requires lots of memory since each level of the tree must be saved into
memory to expand the next level.
○ BFS needs lots of time if the solution is far away from the root node.
Example:
In the below tree structure, we have shown the traversing of the tree using BFS
algorithm from the root node S to goal node K. BFS search algorithm traverse in layers,
so it will follow the path which is shown by the dotted arrow, and the traversed path will
be:
1. S---> A--->B---->C--->D---->G--->H--->E---->F---->I---->K
Time Complexity: Time Complexity of BFS algorithm can be obtained by the number of
nodes traversed in BFS until the shallowest Node. Where the d= depth of shallowest
solution and b is a node at every state.
T (b) = 1+b2+b3+.......+ bd= O (bd)
Space Complexity: Space complexity of BFS algorithm is given by the Memory size of
frontier which is O(bd).
Completeness: BFS is complete, which means if the shallowest goal node is at some
finite depth, then BFS will find a solution.
Optimality: BFS is optimal if path cost is a non-decreasing function of the depth of the
node.
Depth-first Search
○ Depth-first search isa recursive algorithm for traversing a tree or graph data
structure.
○ It is called the depth-first search because it starts from the root node and follows
each path to its greatest depth node before moving to the next path.
○ DFS uses a stack data structure for its implementation.
○ The process of the DFS algorithm is similar to the BFS algorithm.
Note: Backtracking is an algorithm technique for finding all possible solutions using
recursion.
Advantage:
○ DFS requires very less memory as it only needs to store a stack of the nodes on
the path from root node to the current node.
○ It takes less time to reach to the goal node than BFS algorithm (if it traverses in
the right path).
Disadvantage:
○ There is the possibility that many states keep re-occurring, and there is no
guarantee of finding the solution.
○ DFS algorithm goes for deep down searching and sometime it may go to the
infinite loop.
Example:
In the below search tree, we have shown the flow of depth-first search, and it will follow
the order as:
Root node--->Left node ----> right node.
It will start searching from root node S, and traverse A, then B, then D and E, after
traversing E, it will backtrack the tree as E has no other successor and still goal node is
not found. After backtracking it will traverse node C and then G, and here it will
terminate as it found goal node.
Completeness: DFS search algorithm is complete within finite state space as it will
expand every node within a limited search tree.

Time Complexity: Time complexity of DFS will be equivalent to the node traversed by
the algorithm. It is given by:
T(n)= 1+ n2+ n3 +.........+ nm=O(nm)
Where, m= maximum depth of any node and this can be much larger than d
(Shallowest solution depth)
Space Complexity: DFS algorithm needs to store only single path from the root node,
hence space complexity of DFS is equivalent to the size of the fringe set, which is
O(bm).
Optimal: DFS search algorithm is non-optimal, as it may generate a large number of
steps or high cost to reach the goal node.
Depth-Limited Search Algorithm:

A depth-limited search algorithm is similar to depth-first search with a predetermined
limit. Depth-limited search can solve the drawback of the infinite path in the Depth-first
search. In this algorithm, the node at the depth limit will treat as it has no successor
nodes further.
Depth-limited search can be terminated with two Conditions of failure:
○ Standard failure value: It indicates that problem does not have any solution.
○ Cutoff failure value: It defines no solution for the problem within a given depth
limit.
Advantages:
Depth-limited search is Memory efficient.

Disadvantages:
○ Depth-limited search also has a disadvantage of incompleteness.
○ It may not be optimal if the problem has more than one solution.
Example:
Completeness: DLS search algorithm is complete if the solution is above the depth-
limit.
Time Complexity: Time complexity of DLS algorithm is O(bℓ).
Space Complexity: Space complexity of DLS algorithm is O(b×ℓ).

Optimal: Depth-limited search can be viewed as a special case of DFS,
and it is also not optimal even if ℓ>d.
Uniform-cost Search Algorithm:

Uniform-cost search is a searching algorithm used for traversing a weighted tree or
graph. This algorithm comes into play when a different cost is available for each edge.
The primary goal of the uniform-cost search is to find a path to the goal node which has
the lowest cumulative cost. Uniform-cost search expands nodes according to their path
costs form the root node. It can be used to solve any graph/tree where the optimal cost
is in demand. A uniform-cost search algorithm is implemented by the priority queue. It
gives maximum priority to the lowest cumulative cost. Uniform cost search is
equivalent to BFS algorithm if the path cost of all edges is the same.
Advantages:
○ Uniform cost search is optimal because at every state the path with the least
cost is chosen.
Disadvantages:
○ It does not care about the number of steps involve in searching and only
concerned about path cost. Due to which this algorithm may be stuck in an
infinite loop.
Example:
Completeness:
Uniform-cost search is complete, such as if there is a solution, UCS will find it.
Time Complexity:
Let C* is Cost of the optimal solution, and ε is each step to get closer to the goal node.
Then the number of steps is = C*/ε+1. Here we have taken +1, as we start from state 0
and end to C*/ε.
Hence, the worst-case time complexity of Uniform-cost search isO(b1 + [C*/ε])/.
Space Complexity:
The same logic is for space complexity so, the worst-case space complexity of Uniform-
cost search is O(b1 + [C*/ε]).

Optimal:
Uniform-cost search is always optimal as it only selects a path with the lowest path
cost.
Iterative deepening depth-first Search:

The iterative deepening algorithm is a combination of DFS and BFS algorithms. This
search algorithm finds out the best depth limit and does it by gradually increasing the
limit until a goal is found.
This algorithm performs depth-first search up to a certain "depth limit", and it keeps
increasing the depth limit after each iteration until the goal node is found.
This Search algorithm combines the benefits of Breadth-first search's fast search and
depth-first search's memory efficiency.
The iterative search algorithm is useful for uninformed search when the search space is
large, and depth of the goal node is unknown.
Advantages:
○ It Combines the benefits of BFS and DFS search algorithm in terms of fast search
and memory efficiency.
Disadvantages:
○ The main drawback of IDDFS is that it repeats all the work of the previous phase.
Example:
Following tree structure shows the iterative deepening depth-first search. The IDDFS
algorithm performs various iterations until it does not find the goal node. The iteration
performed by the algorithm is given as:
1'st Iteration-----> A
2'nd Iteration----> A, B, C
3'rd Iteration------>A, B, D, E, C, F, G
4'th Iteration------>A, B, D, H, I, E, C, F, K, G
In the fourth iteration, the algorithm will find the goal node.
Completeness:
This algorithm is complete is ifthe branching factor is finite.

Time Complexity:
Let's suppose b is the branching factor and depth is d then the worst-case time
complexity is O(bd).
Space Complexity:
The space complexity of IDDFS will be O(bd).
Optimal:
IDDFS algorithm is optimal if path cost is a non- decreasing function of the depth of the
node.
Bidirectional Search Algorithm:

Bidirectional search algorithm runs two simultaneous searches, one form initial state
called as forward-search and other from goal node called as backward-search, to find
the goal node. Bidirectional search replaces one single search graph with two small
subgraphs in which one starts the search from an initial vertex and other starts from
goal vertex. The search stops when these two graphs intersect each other.
Bidirectional search can use search techniques such as BFS, DFS, DLS, etc.
ADVERTISEMENT
Advantages:
○ Bidirectional search is fast.
○ Bidirectional search requires less memory
Disadvantages:
○ Implementation of the bidirectional search tree is difficult.
○ In bidirectional search, one should know the goal state in advance.
Example:
In the below search tree, bidirectional search algorithm is applied. This algorithm
divides one graph/tree into two sub-graphs. It starts traversing from node 1 in the
forward direction and starts from goal node 16 in the backward direction.
The algorithm terminates at node 9 where two searches meet.
Completeness: Bidirectional Search is complete if we use BFS in both searches.
Time Complexity: Time complexity of bidirectional search using BFS is O(bd).

Space Complexity: Space complexity of bidirectional search is O(bd).
Optimal: Bidirectional search is Optimal.
Informed Search: Knowledge-based search algorithms (also known as heuristic search)
have been developed to incorporate heuristic information, such as estimated costs, to
guide the search process efficiently. Greedy Best-First Search was one such algorithm,
but it did not guarantee optimality for finding the shortest path.
Hill Climbing:
○ Hill climbing algorithm is a local search algorithm which continuously moves in

the direction of increasing elevation/value to find the peak of the mountain or
best solution to the problem. It terminates when it reaches a peak value where no
neighbor has a higher value.
○ Hill climbing algorithm is a technique which is used for optimizing the

mathematical problems. One of the widely discussed examples of Hill climbing
algorithm is Traveling-salesman Problem in which we need to minimize the
distance traveled by the salesman.
○ It is also called greedy local search as it only looks to its good immediate
neighbor state and not beyond that.
○ A node of hill climbing algorithm has two components which are state and value.
○ Hill Climbing is mostly used when a good heuristic is available.
○ In this algorithm, we don't need to maintain and handle the search tree or graph
as it only keeps a single current state.
Features of Hill Climbing:

Following are some main features of Hill Climbing Algorithm:
○ Generate and Test variant: Hill Climbing is the variant of Generate and Test
method. The Generate and Test method produce feedback which helps to decide
which direction to move in the search space.
○ Greedy approach: Hill-climbing algorithm search moves in the direction which

optimizes the cost.
○ No backtracking: It does not backtrack the search space, as it does not

remember the previous states.
State-space Diagram for Hill Climbing:

The state-space landscape is a graphical representation of the hill-climbing algorithm
which is showing a graph between various states of algorithm and Objective
function/Cost.
On Y-axis we have taken the function which can be an objective function or cost
function, and state-space on the x-axis. If the function on Y-axis is cost then, the goal of
search is to find the global minimum and local minimum. If the function of Y-axis is
Objective function, then the goal of the search is to find the global maximum and local
maximum.
Different regions in the state space landscape:
Local Maximum: Local maximum is a state which is better than its neighbor states, but
there is also another state which is higher than it.
Global Maximum: Global maximum is the best possible state of state space landscape.
It has the highest value of objective function.
Current state: It is a state in a landscape diagram where an agent is currently present.
Flat local maximum: It is a flat space in the landscape where all the neighbor states of
current states have the same value.
Shoulder: It is a plateau region which has an uphill edge.
Types of Hill Climbing Algorithm:
○ Simple hill Climbing:

○ Steepest-Ascent hill-climbing:
○ Stochastic hill Climbing:
1. Simple Hill Climbing:
Simple hill climbing is the simplest way to implement a hill climbing algorithm. It only
evaluates the neighbor node state at a time and selects the first one which optimizes
current cost and set it as a current state. It only checks it's one successor state, and if
it finds better than the current state, then move else be in the same state. This algorithm
has the following features:
○ Less time consuming
○ Less optimal solution and the solution is not guaranteed
Algorithm for Simple Hill Climbing:
○ Step 1: Evaluate the initial state, if it is goal state then return success and Stop.
○ Step 2: Loop Until a solution is found or there is no new operator left to apply.
○ Step 3: Select and apply an operator to the current state.
○ Step 4: Check new state:
a. If it is goal state, then return success and quit.
b. Else if it is better than the current state then assign new state as a current
state.
c. Else if not better than the current state, then return to step2.
○ Step 5: Exit.
2. Steepest-Ascent hill climbing:

The steepest-Ascent algorithm is a variation of simple hill climbing algorithm. This
algorithm examines all the neighboring nodes of the current state and selects one
neighbor node which is closest to the goal state. This algorithm consumes more time
as it searches for multiple neighbor
Algorithm for Steepest-Ascent hill climbing:
○ Step 1: Evaluate the initial state, if it is goal state then return success and stop,
else make current state as initial state.
○ Step 2: Loop until a solution is found or the current state does not change.
a. Let SUCC be a state such that any successor of the current state will be
better than it.
b. For each operator that applies to the current state:
a. Apply the new operator and generate a new state.
b. Evaluate the new state.
c. If it is goal state, then return it and quit, else compare it to the

SUCC.
d. If it is better than SUCC, then set new state as SUCC.
e. If the SUCC is better than the current state, then set current state to
SUCC.
○ Step 5: Exit.
3. Stochastic hill climbing:
Stochastic hill climbing does not examine for all its neighbor before moving. Rather, this
search algorithm selects one neighbor node at random and decides whether to choose
it as a current state or examine another state.

Problems in Hill Climbing Algorithm:
1. Local Maximum: A local maximum is a peak state in the landscape which is better
than each of its neighboring states, but there is another state also present which is
higher than the local maximum.
Solution: Backtracking technique can be a solution of the local maximum in state space
landscape. Create a list of the promising path so that the algorithm can backtrack the
search space and explore other paths as well.
2. Plateau: A plateau is the flat area of the search space in which all the neighbor states
of the current state contains the same value, because of this algorithm does not find
any best direction to move. A hill-climbing search might be lost in the plateau area.
Solution: The solution for the plateau is to take big steps or very little steps while
searching, to solve the problem. Randomly select a state which is far away from the
current state so it is possible that the algorithm could find non-plateau region.
3. Ridges: A ridge is a special form of the local maximum. It has an area which is higher
than its surrounding areas, but itself has a slope, and cannot be reached in a single
move.
Solution: With the use of bidirectional search, or by moving in different directions, we
can improve this problem.
Simulated Annealing:
A hill-climbing algorithm which never makes a move towards a lower value guaranteed
to be incomplete because it can get stuck on a local maximum. And if algorithm applies
a random walk, by moving a successor, then it may complete but not efficient.
Simulated Annealing is an algorithm which yields both efficiency and completeness.
In mechanical term Annealing is a process of hardening a metal or glass to a high
temperature then cooling gradually, so this allows the metal to reach a low-energy
crystalline state. The same process is used in simulated annealing in which the
algorithm picks a random move, instead of picking the best move. If the random move
improves the state, then it follows the same path. Otherwise, the algorithm follows the
path which has a probability of less than 1 or it moves downhill and chooses another
path.
Genetic Algorithm?
Before understanding the Genetic algorithm, let's first understand basic terminologies to
better understand this algorithm:
Pause
Next
Unmute
Current TimeÂ
6:23
DurationÂ
18:10
Loaded: 40.36%
Â
Fullscreen
○ Population: Population is the subset of all possible or probable solutions, which
can solve the given problem.
○ Chromosomes: A chromosome is one of the solutions in the population for the

given problem, and the collection of gene generate a chromosome.
○ Gene: A chromosome is divided into a different gene, or it is an element of the

chromosome.
○ Allele: Allele is the value provided to the gene within a particular chromosome.
○ Fitness Function: The fitness function is used to determine the individual's

fitness level in the population. It means the ability of an individual to compete
with other individuals. In every iteration, individuals are evaluated based on their
fitness function.
○ Genetic Operators: In a genetic algorithm, the best individual mate to regenerate

offspring better than parents. Here genetic operators play a role in changing the
genetic composition of the next generation.
○ Selection
After calculating the fitness of every existent in the population, a selection process is
used to determine which of the individualities in the population will get to reproduce and
produce the seed that will form the coming generation.
Types of selection styles available
○ Roulette wheel selection
○ Event selection
○ Rank- grounded selection
So, now we can define a genetic algorithm as a heuristic search algorithm to solve
optimization problems. It is a subset of evolutionary algorithms, which is used in

computing. A genetic algorithm uses genetic and natural selection concepts to solve
optimization problems.
How Genetic Algorithm Work?

The genetic algorithm works on the evolutionary generational cycle to generate high-
quality solutions. These algorithms use different operations that either enhance or
replace the population to give an improved fit solution.
It basically involves five phases to solve the complex optimization problems, which are
given as below:
○ Initialization
○ Fitness Assignment
○ Selection
○ Reproduction
○ Termination
1. Initialization
The process of a genetic algorithm starts by generating the set of individuals, which is
called population. Here each individual is the solution for the given problem. An
individual contains or is characterized by a set of parameters called Genes. Genes are
combined into a string and generate chromosomes, which is the solution to the
problem. One of the most popular techniques for initialization is the use of random
binary strings.
ADVERTISEMENT
2. Fitness Assignment
Fitness function is used to determine how fit an individual is? It means the ability of an
individual to compete with other individuals. In every iteration, individuals are evaluated
based on their fitness function. The fitness function provides a fitness score to each
individual. This score further determines the probability of being selected for
reproduction. The high the fitness score, the more chances of getting selected for
reproduction.
3. Selection
The selection phase involves the selection of individuals for the reproduction of
offspring. All the selected individuals are then arranged in a pair of two to increase
reproduction. Then these individuals transfer their genes to the next generation.
There are three types of Selection methods available, which are:
○ Roulette wheel selection

○ Tournament selection
○ Rank-based selection
4. Reproduction
After the selection process, the creation of a child occurs in the reproduction step. In
this step, the genetic algorithm uses two variation operators that are applied to the
parent population. The two operators involved in the reproduction phase are given
below:
○ Crossover: The crossover plays a most significant role in the reproduction phase
of the genetic algorithm. In this process, a crossover point is selected at random
within the genes. Then the crossover operator swaps genetic information of two
parents from the current generation to produce a new individual representing the
offspring.
The genes of parents are exchanged among themselves until the crossover point
is met. These newly generated offspring are added to the population. This
process is also called or crossover. Types of crossover styles available:
○ One point crossover
○ Two-point crossover
○ Livery crossover
○ Inheritable Algorithms crossover

○ Mutation
The mutation operator inserts random genes in the offspring (new child) to
maintain the diversity in the population. It can be done by flipping some bits in
the chromosomes.
Mutation helps in solving the issue of premature convergence and enhances
diversification. The below image shows the mutation process:
Types of mutation styles available,
○ Flip bit mutation
○ Gaussian mutation
○ Exchange/Swap mutation
5. Termination
After the reproduction phase, a stopping criterion is applied as a base for termination.
The algorithm terminates after the threshold fitness solution is reached. It will identify
the final solution as the best solution in the population.
General Workflow of a Simple Genetic Algorithm

Advantages of Genetic Algorithm
○ The parallel capabilities of genetic algorithms are best.
○ It helps in optimizing various problems such as discrete functions, multi-

objective problems, and continuous functions.
○ It provides a solution for a problem that improves over time.
○ A genetic algorithm does not need derivative information.
Limitations of Genetic Algorithms
○ Genetic algorithms are not efficient algorithms for solving simple problems.
○ It does not guarantee the quality of the final solution to a problem.
○ Repetitive calculation of fitness values may generate some computational

challenges.
How Greedy Best-First Search Works?

● Greedy Best-First Search works by evaluating the cost of each
possible path and then expanding the path with the lowest cost.
This process is repeated until the goal is reached.
● The algorithm uses a heuristic function to determine which path is
the most promising.
● The heuristic function takes into account the cost of the current path
and the estimated cost of the remaining paths.
● If the cost of the current path is lower than the estimated cost of the
remaining paths, then the current path is chosen. This process is
repeated until the goal is reached.
An example of the best-first search algorithm is below graph, suppose
we have to find the path from A to G

The values in red color represent the heuristic value of reaching the goal
node G from current node
1) We are starting from A , so from A there are direct path to node B( with
heuristics value of 32 ) , from A to C ( with heuristics value of 25 ) and from
A to D( with heuristics value of 35 ) .
2) So as per best first search algorithm choose the path with lowest
heuristics value , currently C has lowest value among above node . So we
will go from A to C.
3) Now from C we have direct paths as C to F( with heuristics value of 17 )
and C to E( with heuristics value of 19) , so we will go from C to F.

4) Now from F we have direct path to go to the goal node G ( with heuristics
value of 0 ) , so we will go from F to G.
5) So now the goal node G has been reached and the path we will follow is
A->C->F->G .
Advantages of Greedy Best-First Search:

● Simple and Easy to Implement: Greedy Best-First Search is a
relatively straightforward algorithm, making it easy to implement.
● Fast and Efficient: Greedy Best-First Search is a very fast algorithm,
making it ideal for applications where speed is essential.

● Low Memory Requirements: Greedy Best-First Search requires only
a small amount of memory, making it suitable for applications with
limited memory.
● Flexible: Greedy Best-First Search can be adapted to different types
of problems and can be easily extended to more complex problems.
● Efficiency: If the heuristic function used in Greedy Best-First Search
is good to estimate, how close a node is to the solution, this
algorithm can be a very efficient and find a solution quickly, even in
large search spaces.
Disadvantages of Greedy Best-First Search:

● Inaccurate Results: Greedy Best-First Search is not always
guaranteed to find the optimal solution, as it is only concerned with
finding the most promising path.
● Local Optima: Greedy Best-First Search can get stuck in local
optima, meaning that the path chosen may not be the best possible
path.
● Heuristic Function: Greedy Best-First Search requires a heuristic
function in order to work, which adds complexity to the algorithm.
● Lack of Completeness: Greedy Best-First Search is not a complete
algorithm, meaning it may not always find a solution if one is exists.

This can happen if the algorithm gets stuck in a cycle or if the search
space is a too much complex.
A* search algorithm is an algorithm that separates it from other traversal

techniques. This makes A* smart and pushes it much ahead of conventional
algorithms.
Let’s try to understand Basic AI Concepts and comprehend how does A*

algorithm work. Imagine a huge maze that is too big that it takes hours to
reach the endpoint manually. Once you complete it on foot, you need to go for
another one. This implies that you would end up investing a lot of time and
effort to find the possible paths in this maze. Now, you want to make it less
time-consuming. To make it easier, we will consider this maze as a search
problem and will try to apply it to other possible mazes we might encounter in
due course, provided they follow the same structure and rules.
As the first step to converting this maze into a search problem, we need to
define these six things.
1. A set of prospective states we might be in

2. A beginning and end state
3. A way to decide if we’ve reached the endpoint
4. A set of actions in case of possible direction/path changes
5. A function that advises us about the result of an action
6. A set of costs incurring in different states/paths of movement
To solve the problem, we need to map the intersections to the nodes (denoted
by the red dots) and all the possible ways we can make movements towards
the edges (denoted by the blue lines).
A denotes the starting point, and B denotes the endpoint. We define the
starting and endpoints at nodes A and B, respectively.
If we use an uninformed search algorithm, it would be like finding a path that
is blind, while an informed algorithm for a search problem would take the path
that brings you closer to your destination. For instance, consider Rubik’s cube;
it has many prospective states that you can be in, making the solution very
difficult. This calls for the use of a guided search algorithm to find a solution.
This explains the importance of A*.
Unlike other algorithms, A* decides to take up a step only if it is convincingly
sensible and reasonable as per its functions. This means it never considers
any non-optimal steps. This is why A* is a popular choice for AI systems that
replicate the real world – like video games and machine learning.
Explore the Artificial Intelligence syllabus for various courses offered by Great
Learning
A* Search Algorithm Steps
Step 1: Add the beginning node to the open list

Step 2: Repeat the following step
In the open list, find the square with the lowest F cost, which denotes the
current square. Now we move to the closed square.
Consider 8 squares adjacent to the current square and Ignore it if it is on the
closed list or if it is not workable. Do the following if it is workable.
Check if it is on the open list; if not, add it. You need to make the current
square as this square’s a parent. You will now record the different costs of the
square, like the F, G, and H costs.
If it is on the open list, use G cost to measure the better path. The lower the G
cost, the better the path. If this path is better, make the current square as the
parent square. Now you need to recalculate the other scores – the G and F
scores of this square.
– You’ll stop:
If you find the path, you need to check the closed list and add the target
square to it.
There is no path if the open list is empty and you cannot find the target
square.
Step 3. Now you can save the path and work backward, starting from the
target square, going to the parent square from each square you go, till it takes
you to the starting square. You’ve found your path now.
Why is A* Search Algorithm Preferred?

It’s easy to give movement to objects. But pathfinding is not simple. It is a
complex exercise. The following situation explains it.
The task is to take the unit you see at the bottom of the diagram to the top of
it. You can see that nothing indicates that the object should not take the path
denoted with pink lines. So it chooses to move that way. As and when it
reaches the top, it has to change its direction because of the ‘U’ shaped
obstacle. Then it changes direction and goes around the obstacle to reach the
top. In contrast to this, A* would have scanned the area above the object and
found a short path (denoted with blue lines). Thus, pathfinder algorithms like
A* help you plan things rather than waiting until you discover the problem.
They act proactively rather than reacting to a situation. The disadvantage is
that it is a bit slower than the other algorithms. You can use a combination of
both to achieve better results – pathfinding algorithms give a bigger picture
and long paths with obstacles that change slowly, and movement algorithms
for a local picture and short paths with obstacles that change faster.
Read how artificial intelligence will create more jobs by 2025.

A* Search Algorithm and Its Basic Concepts
A* algorithm works based on heuristic methods, and this helps achieve

optimality. A* is a different form of the best-first algorithm. Optimality
empowers an algorithm to find the best possible solution to a problem. Such
algorithms also offer completeness; if there is any solution possible to an
existing problem, the algorithm will definitely find it.
When A* enters into a problem, firstly, it calculates the cost to travel to the
neighboring nodes and chooses the node with the lowest cost. If The f(n)
denotes the cost, A* chooses the node with the lowest f(n) value. Here ‘n’
denotes the neighboring nodes. The calculation of the value can be done as
shown below:
f(n)=g(n)+h(n)f(n)=g(n)+h(n)
g(n) = shows the shortest path’s value from the starting node to node n
h(n) = The heuristic approximation of the value of the node
The heuristic value has an important role in the efficiency of the A* algorithm.
To find the best solution, you might have to use different heuristic functions
according to the type of the problem. However, the creation of these functions
is a difficult task, and this is the basic problem we face in AI.
What is a Heuristic Function?
A heuristic is simply called a heuristic function that helps rank the alternatives
given in a search algorithm at each of its steps. It can either produce a result
on its own or work in conjugation with a given algorithm to create a result.
Essentially, a heuristic function helps algorithms to make the best decision
faster and more efficiently. This ranking is based on the best available
information and helps the algorithm decide the best possible branch to follow.
Admissibility and consistency are the two fundamental properties of a heuristic
function.
Admissibility of the Heuristic Function
A heuristic function is admissible if it can effectively estimate the real distance

between a node ‘n’ and the end node. It never overestimates; if it ever does, it
will be denoted by ‘d’, which also denotes the accuracy of the solution.
Consistency of the Heuristic Function
A heuristic function is consistent if the estimate of a given heuristic function

turns out to be equal to or less than the distance between the goal (n) and a
neighbor and the cost calculated to reach that neighbor.
A* is indeed a very powerful algorithm used to increase the performance of

artificial intelligence. It is one of the most popular search algorithms in AI. The
sky is the limit when it comes to the potential of this algorithm. However, the
efficiency of an A* algorithm highly depends on the quality of its heuristic
function. Wonder why this algorithm is preferred and used in many software
systems? There is no single facet of AI where the A*algorithm has not found
its application. From search optimization to games, robotics, and machine
learning, the A* algorithm is an inevitable part of a smart program.
Implementation with Python

In this section, we are going to find out how the A* search algorithm can be
used to find the most cost-effective path in a graph. Consider the following
graph below.
The numbers written on edges represent the distance between the nodes,
while the numbers written on nodes represent the heuristic values. Let us find
the most cost-effective path to reach from start state A to final state G using
the A* Algorithm.
Memory-bounded heuristic search algorithms are designed to solve search

problems while keeping memory usage within fixed limits. These algorithms extend
traditional heuristic search methods by incorporating mechanisms to manage and
bound memory consumption. This is crucial for dealing with large search spaces where
memory constraints are significant. Let's explore some of the main memory-bounded
heuristic search algorithms:
UNIT IV
Natural language processing
Natural Language Processing (NLP) is a field of artificial intelligence that focuses on the
interaction between computers and humans through natural language. The ultimate
goal of NLP is to enable computers to understand, interpret, and respond to human
language in a way that is both meaningful and useful.
Grammers
In the context of natural language processing (NLP), grammars refer to formal systems
that describe the structure and syntax of languages. Grammars in NLP help define the
rules and patterns that govern how words and phrases are combined to form valid
sentences in a language. These grammars serve as the foundation for various NLP
tasks, such as parsing, text generation, and syntactic analysis. Let's delve deeper into
the concept of grammars in NLP:
Types of Grammars
1. Phrase Structure Grammar (PSG):
● Defines the structure of sentences in terms of constituent phrases (e.g.,
noun phrases, verb phrases) and rules for combining them.
● Each rule specifies how one or more phrases can be combined to form a
larger phrase.
2. Dependency Grammar (DG):

● Focuses on the relationships (dependencies) between words in a
sentence.
● Each word is associated with a head word, and the dependencies
represent syntactic relationships like subject, object, modifier, etc.
3. Transformational Grammar:
● Specifies a set of transformation rules that derive one sentence from
another.
● Useful for analyzing syntactic structures and generating sentences.
Parsing
Parsing in the context of natural language processing (NLP) refers to the process of
analyzing the grammatical structure of a sentence to determine its syntactic
components and their relationships. The primary goal of parsing is to understand the
structure of sentences according to the rules defined by a formal grammar. Let's break
down the parsing process:
Steps in Parsing
1. Tokenization:
● The first step is to tokenize the input text into smaller units, such as words
or tokens. This step breaks the text into individual elements that can be
analyzed further.
2. Part-of-Speech (POS) Tagging:
● Each token is assigned a part-of-speech tag (e.g., noun, verb, adjective)
based on its grammatical role in the sentence. POS tagging helps
disambiguate the meaning of words and assists in syntactic analysis.

3. Parsing Algorithm:
● Various parsing algorithms are used to analyze the syntactic structure of
sentences. One common approach is to use a formal grammar, such as a
context-free grammar (CFG), along with parsing algorithms like recursive
descent, bottom-up parsing, or chart parsing.
4. Tree Representation:
● The output of parsing is typically represented as a syntactic parse tree or
a dependency tree. These trees depict the hierarchical structure of the
sentence, with nodes representing constituents (phrases) and edges
representing syntactic relationships.
5. Semantic Analysis (Optional):
● In some cases, parsing may involve semantic analysis to determine the
meaning of the sentence beyond its syntactic structure. This step may
involve resolving ambiguities, identifying named entities, or extracting
semantic roles
Pattern Recognition
Pattern recognition is the process of identifying and interpreting patterns or regularities
in data. These patterns can exist in various forms, including images, sounds, texts,
signals, and other types of data. The goal of pattern recognition is to make sense of
complex data by recognizing similarities, differences, and underlying structures, which
can then be used for tasks such as classification, prediction, or decision-making.
Here's a simplified explanation of how pattern recognition works:

1. Data Collection: The first step in pattern recognition is to collect data. This could
involve gathering images, recordings, texts, or other types of data from various
sources.
2. Feature Extraction: Once the data is collected, relevant features or characteristics
are extracted from it. These features could be visual attributes like shapes and
colors in images, frequency components in signals, or linguistic properties in
texts.
3. Pattern Representation: The extracted features are then used to represent the
patterns in the data in a meaningful way. This could involve encoding the
features into a mathematical representation or a feature vector that can be
processed by pattern recognition algorithms.
4. Pattern Classification: In pattern classification, the goal is to assign a label or
category to each pattern based on its features. This is typically done using
machine learning algorithms such as neural networks, support vector machines,
or decision trees, which learn from labeled examples to classify new patterns
accurately.
5. Pattern Recognition and Interpretation: Once the patterns are classified, the
results are interpreted to make sense of the data. This could involve identifying
objects in images, recognizing words in speech, detecting anomalies in signals,
or predicting future trends based on historical data.
6. Feedback and Improvement: Pattern recognition systems often incorporate
feedback mechanisms to improve their performance over time. This could
involve retraining the algorithms with new data, refining the feature extraction
process, or adjusting the classification criteria based on user feedback.
Recognition:
a. Recognition is the process of identifying or categorizing an object, pattern,
or entity based on its features or characteristics.
b. It involves matching the observed data (e.g., an image, signal, or
sequence) with known patterns or templates stored in a database or
memory.
c. Recognition can be performed in various domains, including computer
vision (e.g., recognizing objects in images), speech recognition (e.g.,
identifying spoken words), and pattern matching (e.g., recognizing
patterns in time-series data).
d. Recognition often involves a search process to find the best match
between the observed data and the stored patterns, followed by a
decision-making step to determine the identity or category of the
recognized object.
Classification:
e. Classification is a specific type of recognition task that involves assigning
an observed object or data point to one of several predefined categories
or classes.
f. It typically involves training a classification model using labeled examples
(i.e., data points with known categories) and then using this model to
predict the category of new, unseen data points.
g. Classification algorithms can range from simple methods such as nearest
neighbor or decision trees to more complex techniques like support vector
machines or deep neural networks.

h. The goal of classification is to accurately assign each data point to the
correct category based on its features, enabling tasks such as document
categorization, disease diagnosis, or spam detection.
Theoretic classification
Theoretic classification refers to the categorization of objects, concepts, or phenomena
based on theoretical frameworks or models. This type of classification is grounded in
theories that provide a systematic understanding of the attributes and relationships
among the items being classified. In the context of artificial intelligence and machine
learning, theoretic classification involves the use of formal theories to guide the
development and application of classification algorithms.
Syntactic classification
Syntactic classification is a method used in pattern recognition and machine learning
that classifies objects based on their structural or syntactic properties. Unlike other
classification methods that might rely purely on statistical features or numerical data,
syntactic classification focuses on the arrangement and relationships between the
components of the objects being classified.
Learning classification patterns
Learning classification patterns is a way for computers to learn how to sort things into
different categories based on examples. It's like teaching a computer to recognize
whether a picture is of a cat or a dog by showing it many examples of both.
Key Concepts
1. Training Data:
● This is a set of examples that we use to teach the computer. Each
example is labeled with the correct category. For instance, pictures of cats
are labeled "cat" and pictures of dogs are labeled "dog".
2. Features:
● Features are the important pieces of information from each example that
help in making the decision. For a picture, features could be things like the
shape of the ears, the color of the fur, or the presence of a tail.
3. Model:
● The model is what the computer learns from the training data. It’s a set of
rules or a mathematical formula that the computer uses to make
predictions about new examples.
4. Classification:
● This is the process of using the model to assign a new example to one of
the categories. For example, after learning from many pictures, the
computer uses its model to decide whether a new picture is a cat or a dog.
Recognizing and understanding speech involves the ability of a computer system to
interpret spoken language, just like humans do. Here's a simple breakdown:
Recognizing Speech
1. Listening:
● The system listens to spoken words using a microphone or audio input.
2. Signal Processing:
● The recorded speech is converted into digital data through signal
processing techniques. This includes filtering out background noise and
normalizing the audio.
3. Feature Extraction:
● Important features like the frequency of sound waves and the duration of
sounds are extracted from the audio data. These features help in
identifying words and sounds.
4. Pattern Matching:
● The system matches the extracted features with pre-existing patterns
stored in its database. These patterns represent known words and
phrases.
5. Recognition:
● Based on the closest match between the extracted features and stored
patterns, the system recognizes the spoken words and converts them into
text.
Understanding Speech
1. Language Processing:
● After recognizing the words, the system processes the text to understand
the meaning. This involves syntactic and semantic analysis to decipher
the grammar and context of the spoken words.
2. Contextual Understanding:
● Understanding the meaning of words in the context of the conversation or
task at hand. For instance, recognizing that "bank" refers to a financial
institution or the side of a river based on the context of the conversation.

3. Interpretation:
● Making sense of the spoken words to derive intent or take appropriate
action. This could involve answering a question, executing a command, or
providing relevant information.
Example: Virtual Assistant

1. Recognizing Speech:
● When you say, "Hey, virtual assistant," the system recognizes the wake
word ("Hey,") and starts listening.
2. Understanding Intent:
● After recognizing the wake word, the system processes the following
words to understand the intent. For example, if you say, "What's the
weather today?" it understands that you're asking for weather information.
3. Retrieving Information:
● The system retrieves the relevant information by accessing weather data
and formulates a response.
4. Providing Response:
● Finally, the system converts the response into synthesized speech and
communicates it back to you.
Challenges
● Noise and Variability: Speech recognition systems must deal with background
noise, accents, and variations in pronunciation.

● Ambiguity: Spoken language often contains ambiguity and context-dependent
meanings, which makes understanding speech challenging.
● Vocabulary and Language: Recognizing and understanding a wide vocabulary
and different languages requires extensive training and data.
What is an Expert System?

An expert system is a computer program that is designed to solve complex problems
and to provide decision-making ability like a human expert. It performs this by extracting
knowledge from its knowledge base using the reasoning and inference rules according
to the user queries.
The expert system is a part of AI, and the first ES was developed in the year 1970, which
was the first successful approach of artificial intelligence. It solves the most complex
issue as an expert by extracting the knowledge stored in its knowledge base. The
system helps in decision making for compsex problems using both facts and heuristics
like a human expert. It is called so because it contains the expert knowledge of a
specific domain and can solve any complex problem of that particular domain. These
systems are designed for a specific domain, such as medicine, science, etc.
The performance of an expert system is based on the expert's knowledge stored in its
knowledge base. The more knowledge stored in the KB, the more that system improves
its performance. One of the common examples of an ES is a suggestion of spelling
errors while typing in the Google search box.
Below is the block diagram that represents the working of an expert system:
Note: It is important to remember that an expert system is not used to replace the human
experts; instead, it is used to assist the human in making a complex decision. These
systems do not have human capabilities of thinking and work on the basis of the knowledge
base of the particular domain.
Below are some popular examples of the Expert System:
○ DENDRAL: It was an artificial intelligence project that was made as a chemical

analysis expert system. It was used in organic chemistry to detect unknown
organic molecules with the help of their mass spectra and knowledge base of
chemistry.
○ MYCIN: It was one of the earliest backward chaining expert systems that was
designed to find the bacteria causing infections like bacteraemia and meningitis.
It was also used for the recommendation of antibiotics and the diagnosis of
blood clotting diseases.
○ PXDES: It is an expert system that is used to determine the type and level of lung
cancer. To determine the disease, it takes a picture from the upper body, which
looks like the shadow. This shadow identifies the type and degree of harm.
○ CaDeT: The CaDet expert system is a diagnostic support system that can detect
cancer at early stages.
Characteristics of Expert System
○ High Performance: The expert system provides high performance for solving any
type of complex problem of a specific domain with high efficiency and accuracy.
○ Understandable: It responds in a way that can be easily understandable by the

user. It can take input in human language and provides the output in the same
way.
○ Reliable: It is much reliable for generating an efficient and accurate output.
○ Highly responsive: ES provides the result for any complex query within a very
short period of time.
Components of Expert System
An expert system mainly consists of three components:
○ User Interface
○ Inference Engine
○ Knowledge Base
1. User Interface
With the help of a user interface, the expert system interacts with the user, takes queries
as an input in a readable format, and passes it to the inference engine. After getting the
response from the inference engine, it displays the output to the user. In other words, it
is an interface that helps a non-expert user to communicate with the expert system to
find a solution.
2. Inference Engine(Rules of Engine)
○ The inference engine is known as the brain of the expert system as it is the main
processing unit of the system. It applies inference rules to the knowledge base to
derive a conclusion or deduce new information. It helps in deriving an error-free
solution of queries asked by the user.
○ With the help of an inference engine, the system extracts the knowledge from the
knowledge base.
○ There are two types of inference engine:
○ Deterministic Inference engine: The conclusions drawn from this type of

inference engine are assumed to be true. It is based on facts and rules.
○ Probabilistic Inference engine: This type of inference engine contains

uncertainty in conclusions, and based on the probability.
Inference engine uses the below modes to derive the solutions:
○ Forward Chaining: It starts from the known facts and rules, and applies the
inference rules to add their conclusion to the known facts.
○ Backward Chaining: It is a backward reasoning method that starts from the goal
and works backward to prove the known facts.
3. Knowledge Base
○ The knowledgebase is a type of storage that stores knowledge acquired from the
different experts of the particular domain. It is considered as big storage of
knowledge. The more the knowledge base, the more precise will be the Expert
System.
○ It is similar to a database that contains information and rules of a particular

domain or subject.
○ One can also view the knowledge base as collections of objects and their
attributes. Such as a Lion is an object and its attributes are it is a mammal, it is
not a domestic animal, etc.
Components of Knowledge Base

○ Factual Knowledge: The knowledge which is based on facts and accepted by
knowledge engineers comes under factual knowledge.
○ Heuristic Knowledge: This knowledge is based on practice, the ability to guess,

evaluation, and experiences.
Knowledge Representation: It is used to formalize the knowledge stored in the
knowledge base using the If-else rules.
Knowledge Acquisitions: It is the process of extracting, organizing, and structuring the
domain knowledge, specifying the rules to acquire the knowledge from various experts,
and store that knowledge into the knowledge base.
Development of Expert System
Here, we will explain the working of an expert system by taking an example of MYCIN
ES. Below are some steps to build an MYCIN:
○ Firstly, ES should be fed with expert knowledge. In the case of MYCIN, human
experts specialized in the medical field of bacterial infection, provide information
about the causes, symptoms, and other knowledge in that domain.
○ The KB of the MYCIN is updated successfully. In order to test it, the doctor
provides a new problem to it. The problem is to identify the presence of the
bacteria by inputting the details of a patient, including the symptoms, current
condition, and medical history.
○ The ES will need a questionnaire to be filled by the patient to know the general
information about the patient, such as gender, age, etc.
○ Now the system has collected all the information, so it will find the solution for
the problem by applying if-then rules using the inference engine and using the
facts stored within the KB.
○ In the end, it will provide a response to the patient by using the user interface.
Participants in the development of Expert System
There are three primary participants in the building of Expert System:
1. Expert: The success of an ES much depends on the knowledge provided by

human experts. These experts are those persons who are specialized in that
specific domain.
2. Knowledge Engineer: Knowledge engineer is the person who gathers the

knowledge from the domain experts and then codifies that knowledge to the
system according to the formalism.
3. End-User: This is a particular person or a group of people who may not be

experts, and working on the expert system needs the solution or advice for his
queries, which are complex.
Why Expert System?

Before using any technology, we must have an idea about why to use that technology
and hence the same for the ES. Although we have human experts in every field, then
what is the need to develop a computer-based system. So below are the points that are
describing the need of the ES:
1. No memory Limitations: It can store as much data as required and can

memorize it at the time of its application. But for human experts, there are some
limitations to memorize all things at every time.
2. High Efficiency: If the knowledge base is updated with the correct knowledge,
then it provides a highly efficient output, which may not be possible for a human.
3. Expertise in a domain: There are lots of human experts in each domain, and they
all have different skills, different experiences, and different skills, so it is not easy
to get a final output for the query. But if we put the knowledge gained from
human experts into the expert system, then it provides an efficient output by
mixing all the facts and knowledge
4. Not affected by emotions: These systems are not affected by human emotions
such as fatigue, anger, depression, anxiety, etc.. Hence the performance remains
constant.
5. High security: These systems provide high security to resolve any query.
6. Considers all the facts: To respond to any query, it checks and considers all the
available facts and provides the result accordingly. But it is possible that a
human expert may not consider some facts due to any reason.
7. Regular updates improve the performance: If there is an issue in the result

provided by the expert systems, we can improve the performance of the system
by updating the knowledge base.
Capabilities of the Expert System
Below are some capabilities of an Expert System:
○ Advising: It is capable of advising the human being for the query of any domain
from the particular ES.
○ Provide decision-making capabilities: It provides the capability of decision

making in any domain, such as for making any financial decision, decisions in
medical science, etc.
○ Demonstrate a device: It is capable of demonstrating any new products such as

its features, specifications, how to use that product, etc.
○ Problem-solving: It has problem-solving capabilities.
○ Explaining a problem: It is also capable of providing a detailed description of an

input problem.
○ Interpreting the input: It is capable of interpreting the input given by the user.
○ Predicting results: It can be used for the prediction of a result.
○ Diagnosis: An ES designed for the medical field is capable of diagnosing a

disease without using multiple components as it already contains various inbuilt
medical tools.
Advantages of Expert System
○ These systems are highly reproducible.
○ They can be used for risky places where the human presence is not safe.
○ Error possibilities are less if the KB contains correct knowledge.
○ The performance of these systems remains steady as it is not affected by

emotions, tension, or fatigue.
○ They provide a very high speed to respond to a particular query.
Limitations of Expert System
○ The response of the expert system may get wrong if the knowledge base
contains the wrong information.
○ Like a human being, it cannot produce a creative output for different scenarios.
○ Its maintenance and development costs are very high.
○ Knowledge acquisition for designing is much difficult.
○ For each domain, we require a specific ES, which is one of the big limitations.
○ It cannot learn from itself and hence requires manual updates.
Applications of Expert System
○ In designing and manufacturing domain

It can be broadly used for designing and manufacturing physical devices such as
camera lenses and automobiles.
○ In the knowledge domain
These systems are primarily used for publishing the relevant knowledge to the
users. The two popular ES used for this domain is an advisor and a tax advisor.
○ In the finance domain

In the finance industries, it is used to detect any type of possible fraud,
suspicious activity, and advise bankers that if they should provide loans for
business or not.
○ In the diagnosis and troubleshooting of devices

In medical diagnosis, the ES system is used, and it was the first area where these
systems were used.
○ Planning and Scheduling

The expert systems can also be used for planning and scheduling some
particular tasks for achieving the goal of that task.
characteristics
1. Knowledge Base
● Comprehensive: Contains a vast amount of domain-specific knowledge.
● Organized: Structured in a way that allows for efficient retrieval and application of
knowledge.
2. Inference Engine
● Logical Reasoning: Applies logical rules to the knowledge base to deduce new
information or make decisions.
● Problem-Solving: Uses techniques such as forward chaining and backward
chaining to solve problems.

3. User Interface
● Interactive: Allows users to interact with the system, inputting data and receiving
advice or decisions.
● User-Friendly: Designed to be accessible to users who may not have expertise in
the underlying technology or domain.
4. Explanation Facility
● Transparent Reasoning: Can explain the reasoning process and how it arrived at
a particular conclusion.
● Educational: Helps users understand the domain and the logic behind the
decisions, useful for learning and training.
5. Knowledge Acquisition
● Adaptable: Capable of integrating new knowledge without significant redesign.
● Updatable: Allows for easy updating and expansion of the knowledge base as
new information becomes available.
6. Performance
● Accuracy: Provides highly accurate solutions, often comparable to or exceeding
human expert performance.
● Efficiency: Delivers solutions quickly, often much faster than a human expert
could.
7. Reliability
● Consistency: Offers consistent advice or decisions, free from human errors or
biases.
● Robustness: Maintains performance even with incomplete or uncertain data.
8. Domain Specificity
● Specialized: Designed for specific domains or fields (e.g., medical diagnosis,
financial advice).
● Focused Expertise: Contains detailed and specialized knowledge pertinent to the
domain.
The rule-based system in AI bases choices or inferences on established rules. These
laws are frequently expressed in human-friendly language, such as "if X is true, then Y is
true," to make them easier for readers to comprehend. Expert and decision support
systems are only two examples of the many applications in which rule-based systems
have been employed.
What is a Rule-based System?
A system that relies on a collection of predetermined rules to decide what to do next is
known as a rule-based system in AI. These laws are predicated on several
circumstances and deeds. For instance, if a patient has a fever, the doctor may
recommend antibiotics because the patient may have an infection. Expert systems,
decision support systems, and chatbots are examples of apps that use rule-based
systems.
Characteristics of Rule-based Systems in AI
The following are some of the primary traits of the rule-based system in AI:
● The rules are written simply for humans to comprehend, making rule-based
systems simple to troubleshoot and maintain.
● Given a set of inputs, rule-based systems will always create the same output,
making them predictable and dependable. This property is known as
determinism.
● A rule-based system in AI is transparent because the standards are clear and
open to human inspection, which makes it simpler to comprehend how the
system operates.
● A rule-based system in AI is scalable. When scaled up, large quantities of data
can be handled by rule-based systems.
● Rule-based systems can be modified or updated more easily because the rules
can be divided into smaller components.
How does a Rule-based System Work?

A rule-based system in AI generates an output by using a collection of inputs and a set
of rules. The system first determines which principles apply to the inputs. If a rule is
applicable, the system executes the corresponding steps to generate the output. If no
guideline is applicable, the system might generate a default output or ask the user for
more details.
Main Components of a Rule-based System

Typically, a rule-based system in AI consists of seven fundamental elements:
1. The knowledge base:
It contains the specialized expertise required for problem-solving. The
information is represented as a set of rules in a rules-based system. Every rule
has an IF (condition) THEN (action) structure and defines a relationship,
suggestion, directive, strategy, or heuristic. The rule is activated, and the action
portion is carried out as soon as the conditional portion of the rule is met.
2. The database:
The database contains a collection of facts compared to the knowledge base's
rules IF (condition) clause.
3. The inference engine:
The expert system uses the inference engine to derive the logic and arrive at a
conclusion. The inference engine's task is to connect the facts kept in the
database with the rules specified in the knowledge base. The semantic reasoner
is another name for the reasoning engine. It deduces information or executes
necessary actions based on data and the rule base present in the knowledge
base. For example, the match-resolve-act loop used by the semantic reasoner
goes like this:
○ Match:
A portion of the production rule system is compared to the information in

the working memory to create a conflict in which numerous examples of
satisfied productions are present.
○ Conflict Resolution:
Following the matching of the production systems, one of the production
cases involved in the conflict will be executed to gauge the procedure's
status.
○ Act:
The production instance chosen in the step before is carried out, changing
the information in the working memory.
4. Explanations facilities:
The user can use the explanation facilities to question the expert system on how
it came to a particular conclusion or why a particular fact is necessary. The
expert system must be able to defend its logic, recommendations, analyses, and
conclusions.
5. User Interface:
The user interface is the channel through which the user interacts with the expert
system to find a solution to an issue. The user interface should be as simple and
intuitive as possible, and the dialogue should be as helpful and friendly as
possible.
Each of these five components is essential to any rule-based system in AI. These
form the basis of the rule-based structure. However, the mechanism might also
include a few extra parts. The working brain and the external interface are two
examples of these parts.
6. External connection:
An expert system can interact with external data files and programs written in
traditional computer languages like C, Pascal, FORTRAN, and Basic, thanks to the
external interface.
7. Active recall:
The working memory keeps track of transient data and knowledge.
Non Production system
Nonproduction system architectures in AI refer to approaches that do not rely on
predefined sets of rules to make decisions or solve problems. These systems often use
other methods such as statistical models, machine learning, neural networks, and other
AI techniques. Let's explore some key nonproduction system architectures:
1. Neural Networks
Neural networks are inspired by the human brain's structure and function. They consist
of layers of interconnected nodes (neurons) that process data and learn patterns.
● Structure:
● Input Layer: Receives raw data.
● Hidden Layers: Intermediate layers where data processing occurs through
weighted connections.
● Output Layer: Produces the final prediction or classification.
● Learning:
● Training: Neural networks learn from large amounts of data by adjusting
weights based on errors in predictions.

● Backpropagation: A common training method where errors are
propagated back through the network to update weights.
● Applications: Image and speech recognition, natural language processing, and
autonomous driving.
2. Decision Trees
Decision trees use a tree-like model of decisions and their possible consequences. They
are easy to understand and interpret.
● Structure:
● Nodes: Represent features or attributes.
● Branches: Represent decisions or rules.
● Leaves: Represent outcomes or classifications.
● Learning:
● Trees are constructed by splitting data into subsets based on the feature
that provides the highest information gain.
● Applications: Classification tasks, such as determining whether an email is spam
or not.
3. Support Vector Machines (SVMs)

SVMs are supervised learning models used for classification and regression tasks.
● Structure:
● Hyperplane: A decision boundary that separates data points of different
classes.
● Support Vectors: Data points that are closest to the hyperplane and
influence its position.
● Learning:
● SVMs find the hyperplane that maximizes the margin between different
classes.
● Applications: Image classification, text categorization, and bioinformatics.
4. Bayesian Networks
Bayesian networks are probabilistic graphical models that represent a set of variables
and their conditional dependencies via a directed acyclic graph (DAG).
● Structure:
● Nodes: Represent variables.
● Edges: Represent conditional dependencies.
● Learning:
● Uses Bayesian inference to update the probability of a hypothesis as more
evidence becomes available.
● Applications: Diagnostic systems, risk assessment, and decision support
systems.
5. Genetic Algorithms
Genetic algorithms are optimization algorithms inspired by the process of natural
selection.
● Structure:
● Population: A set of potential solutions.

● Chromosomes: Represent solutions.
● Genes: Represent parts of a solution.
● Learning:
● Selection: Chooses the best solutions.
● Crossover: Combines parts of two solutions to create a new solution.
● Mutation: Introduces random changes to solutions.
● Applications: Optimization problems, such as scheduling and route planning.
6. Reinforcement Learning
Reinforcement learning involves learning by interacting with an environment to achieve
a goal.
● Structure:
● Agent: Learns to make decisions.
● Environment: Where the agent operates.
● Actions: Choices the agent makes.
● Rewards: Feedback from the environment based on actions.
● Learning:
● The agent learns to maximize cumulative rewards through trial and error.
● Applications: Game playing (e.g., AlphaGo), robotics, and recommendation
systems.
Key Characteristics of Nonproduction Systems

1. Learning from Data:
● Unlike rule-based systems that rely on predefined rules, nonproduction
systems learn patterns and make decisions based on data.

2. Adaptability:
● These systems can adapt and improve over time with new data, making
them more flexible and robust.
3. Complexity Handling:
● Nonproduction systems can handle complex patterns and relationships
that are difficult to define explicitly with rules.
4. Probabilistic Reasoning:
● Many nonproduction systems incorporate probabilistic reasoning,
allowing them to deal with uncertainty and make more nuanced decisions.
Advantages
● Scalability: Can handle large and complex datasets.
● Flexibility: Adaptable to different types of problems and domains.
● Efficiency: Often more efficient in making predictions and decisions once trained.
Challenges
● Data Dependency: Requires large amounts of data for training.
● Interpretability: Some models, like deep neural networks, can be difficult to
interpret.
● Computational Resources: Can require significant computational power,
especially for training complex models.
Knowledge Acquisition and Validation in AI

Knowledge acquisition and validation are crucial steps in building intelligent systems,
especially expert systems. These processes ensure that the system has accurate,
reliable, and applicable knowledge to make decisions and solve problems effectively.
Knowledge Acquisition
Knowledge acquisition is the process of gathering, organizing, and integrating
information from various sources into the system’s knowledge base.
Steps in Knowledge Acquisition
1. Identify Sources of Knowledge:
● Human Experts: Interviewing and consulting with experts in the domain to
extract their knowledge.
● Documents and Databases: Analyzing textbooks, research papers,
technical manuals, and databases.
● Direct Observation: Observing experts at work to understand their
decision-making processes.
2. Elicit Knowledge:
● Interviews: Conduct structured or unstructured interviews with experts to
gather detailed information.
● Questionnaires: Use surveys to collect knowledge from multiple experts
efficiently.
● Workshops: Organize sessions where experts discuss and share their
knowledge collaboratively.
● Protocol Analysis: Record and analyze the thought processes of experts
as they solve problems.
3. Represent Knowledge:
● Rule-Based: Represent knowledge as if-then rules.

● Semantic Networks: Use graphs where nodes represent concepts and
edges represent relationships.
● Frames: Use structured templates to represent stereotypical situations.
● Ontologies: Define concepts and their relationships within a domain
formally.
4. Integrate Knowledge:
● Combine knowledge from various sources, ensuring consistency and
completeness.
● Resolve conflicts and redundancies in the gathered knowledge.
Tools for Knowledge Acquisition
● Knowledge Engineering Tools: Software that assists in capturing, modeling, and
integrating knowledge.
● Expert System Shells: Pre-built frameworks for developing expert systems, which
include tools for knowledge acquisition.
Knowledge Validation
Knowledge validation ensures that the acquired knowledge is correct, reliable, and
applicable. It involves testing and refining the knowledge base to meet the required
standards.
Steps in Knowledge Validation
1. Internal Validation:
● Consistency Checking: Ensure there are no conflicting rules or facts within
the knowledge base.

● Completeness Checking: Verify that the knowledge base covers all
necessary areas and scenarios.
● Logical Validation: Check the logical correctness of the rules and their
execution flow.
2. External Validation:
● Expert Review: Have domain experts review the knowledge base to
confirm its accuracy and relevance.
● Case Testing: Apply the knowledge base to real-world cases and
scenarios to see if it produces correct and useful results.
● Simulation: Simulate various conditions and scenarios to test the
robustness of the knowledge base.
3. Performance Evaluation:
● Accuracy Testing: Measure how accurately the system’s outputs match
the expected results.
● Reliability Testing: Ensure the system produces consistent results under
the same conditions.
● Usability Testing: Check if the system is user-friendly and meets the users'
needs.
Iterative Refinement
● Feedback Loop: Continuously gather feedback from users and experts to
improve and refine the knowledge base.
● Update and Maintenance: Regularly update the knowledge base to reflect new
knowledge and changes in the domain.
Examples of Knowledge Acquisition and Validation

1. Medical Diagnosis System:
● Acquisition: Gather knowledge from doctors, medical textbooks, and
patient records.
● Validation: Test the system with real patient cases and have doctors
review the system’s recommendations.
2. Financial Expert System:
● Acquisition: Elicit knowledge from financial analysts, market reports, and
economic models.
● Validation: Simulate market conditions and compare the system’s advice
with actual market outcomes.
Importance of Knowledge Acquisition and Validation

● Accuracy: Ensures the system provides correct and reliable outputs.
● Relevance: Keeps the system up-to-date with the latest knowledge and practices.
● Trust: Builds confidence among users that the system can be relied upon for
critical decisions.
● Efficiency: Enhances the system’s ability to solve problems quickly and
effectively by using validated knowledge.
Question - Differentiate between general artificial intelligence and narrow
artificial intelligence. Describe the Turing test for artificial intelligence and
justify its validity from a modern standpoint.
General artificial intelligence (AGI) refers to the hypothetical intelligence of a machine
that could successfully perform any intellectual task that a human being can. It's often
depicted in science fiction as sentient, self-aware machines capable of reasoning,

learning, and problem-solving across a wide range of domains. AGI would possess
cognitive abilities comparable to or exceeding those of humans.
Narrow artificial intelligence (ANI), on the other hand, refers to AI systems that are
designed and trained for specific tasks or narrow domains. These systems excel at
performing a single task or a set of closely related tasks but lack the broad cognitive
abilities associated with human intelligence. Examples of ANI include voice assistants
like Siri, recommendation systems, and image recognition algorithms.
The Turing test is a method of inquiry in artificial intelligence (AI) for determining
whether or not a computer is capable of human-like intelligence. It was proposed by
Alan Turing in 1950. The test involves a human evaluator interacting with both a human
and a machine (hidden from the evaluator), through text-based communication. If the
evaluator cannot reliably distinguish the machine's responses from the human's, then
the machine is said to have passed the Turing test.
The validity of the Turing test from a modern standpoint can be justified in several ways:
1. Behavioral Approach: The Turing test focuses on observable behavior rather than
the internal mechanisms of intelligence. In today's AI systems, which are
increasingly focused on practical applications and outputs rather than mimicking
human cognition, this perspective remains relevant.
2. Practical Benchmark: Passing the Turing test demonstrates a level of
conversational ability and contextual understanding that is indistinguishable
from humans. In practical terms, this benchmark signifies that the AI is capable
of interacting with humans in a natural and meaningful way, which is a crucial
goal for many AI applications.

3. Ethical Considerations: The Turing test indirectly addresses ethical concerns by
emphasizing the importance of human-like behavior in AI. An AI system that can
pass the Turing test is more likely to respect social norms, understand human
emotions, and communicate effectively, reducing the risk of harmful or
inappropriate behavior.
4. Adaptability: The Turing test doesn't specify a fixed set of tasks or criteria for
intelligence, allowing for flexibility in evaluating AI systems across various
domains and contexts. This adaptability makes the test applicable to a wide
range of AI applications, from language processing to robotics.
While the Turing test has its limitations and critics argue that passing it doesn't
necessarily imply true intelligence or consciousness, it remains a foundational concept
in AI research and a practical benchmark for evaluating the capabilities of AI systems.
simple terms
Sure! Let's break it down.
General AI: Imagine a robot that's super smart and can do anything a person can do, like
solving puzzles, learning new things, and even having conversations like you and I are
having right now. That's what we call General AI.
Narrow AI: Now, think about your smartphone's voice assistant or a computer program
that recognizes faces in photos. They're really good at specific tasks, but they can't do
everything like a super-smart robot can. These are examples of Narrow AI.
Turing Test: Imagine you're chatting with someone online. You can't see them, and
you're not sure if they're a person or a computer program. If the responses you get are
so human-like that you can't tell if it's a person or a machine, then the machine passes
the Turing Test.
Why it matters: The Turing Test helps us see how close computers are to being as
smart as humans in conversations and understanding. If a computer can pass this test,
it's a big deal because it means it's really good at understanding and talking like
humans.
What is a production rule? Give an example and define two basic parts of
the
production rule.
A production rule, also known as a production or a production system, is a formal
representation used in artificial intelligence and expert systems to describe a set of
conditions and actions or outcomes. It's a fundamental concept in rule-based systems.
Here's an example of a production rule:
IF (Condition) THEN (Action)
Let's say we have a simple expert system for diagnosing car problems:
IF the engine won't start
THEN check the battery and the starter motor
In this example:
1. Condition: "the engine won't start" is the condition that triggers the rule. It's the
situation or state that the system checks for.
2. Action: "check the battery and the starter motor" is the action that the system
takes if the condition is met. It's what the system does in response to the
condition being true.
So, a production rule consists of two basic parts:
● Condition: This is the part of the rule that describes the situation or state that
needs to be true for the rule to be activated.
● Action: This is the part of the rule that specifies what the system should do if the
condition is met. It describes the response or outcome of the rule being
triggered.
What is fuzzy logic? Define main operations of fuzzy sets. Provide examples.
Fuzzy logic is a branch of mathematics and a form of reasoning that deals with
approximate or uncertain information. Unlike classical binary logic, which operates in
terms of strict true or false values (0 or 1), fuzzy logic allows for degrees of truth
between 0 and 1. It's particularly useful in situations where variables can have imprecise
or ambiguous values, making it more suitable for modeling human reasoning and
decision-making processes.
The main operations of fuzzy sets are:
1. Fuzzy Set Membership Function: A membership function assigns a degree of
membership (a value between 0 and 1) to each element of the fuzzy set,

indicating the extent to which the element belongs to the set.
Example: Consider a fuzzy set representing "tall people" where the membership
function assigns a degree of membership to each person's height. So, someone
who is 6 feet tall might have a membership value of 0.8, indicating a high degree
of belonging to the set of "tall people."
2. Fuzzy Set Union (OR): The union operation combines the membership values of
two fuzzy sets for each element, resulting in a new fuzzy set that includes
elements belonging to either or both of the original sets.
Example: Let's say we have two fuzzy sets representing "young" and "old" based
on age. The union operation combines these sets to create a new fuzzy set
representing "young or old" individuals. If someone is 30 years old, they might
have a high membership value in the "young" set and a lower membership value
in the "old" set, resulting in a combined membership value for the new set.
3. Fuzzy Set Intersection (AND): The intersection operation combines the
membership values of two fuzzy sets for each element, resulting in a new fuzzy
set that includes elements belonging to both of the original sets.
Example: Consider fuzzy sets representing "tall" and "old" individuals. The
intersection operation combines these sets to create a new fuzzy set
representing "tall and old" individuals. If someone is both 6 feet tall and 70 years
old, they might have high membership values in both sets, resulting in a high
membership value in the new set.
Design the class-frame for the object Student, determine its attributes and define
several instances for this class.
Sure, here's a simple design for a class-frame representing the object "Student" in
Python:
Python
class Student:
def __init__(self, name, age, grade):
self.name = name
self.age = age
self.grade = grade
def display_info(self):
print("Name:", self.name)
print("Age:", self.age)
print("Grade:", self.grade)
# Define instances of the Student class
student1 = Student("Alice", 15, 10)
student2 = Student("Bob", 16, 11)
student3 = Student("Charlie", 14, 9)
# Display information for each student
print("Student 1:")
student1.display_info()
print("\nStudent 2:")
print("\nStudent 3:")
In this class-frame:
● Attributes:
● name: Represents the name of the student.
● age: Represents the age of the student.
● grade: Represents the grade or class level of the student.

● Methods:
● __init__(): Constructor method to initialize the attributes of the Student
object.
● display_info(): Method to display the information of the student.
Then, we create instances (objects) of the Student class: student1, student2, and
student3. Each instance has its own set of attributes representing different students.
Finally, we call the display_info() method for each instance to print out the information
of each student.
Discuss the issues in recognizing and understanding speech
1. Different Ways of Speaking: People speak in many different ways, like with
different accents or speeds. Computers need to understand all these variations,
which can be tricky.
2. Background Noise: Sometimes there's noise in the background, like cars honking
or people talking. Computers have to filter out this noise to focus on what
someone is saying.
3. Words That Sound Alike: Some words sound the same but mean different things,
like "to" and "too." Computers need to figure out which word is being said based
on the context.
4. Stumbling Over Words: Sometimes people pause, repeat themselves, or say "um"
or "uh." Computers have to understand these pauses and not get confused by
them.
5. New Words: If someone says a word that the computer hasn't heard before, it
might not understand what it means.
6. Different People Speaking: Sometimes a computer has to understand different
people speaking, and everyone speaks a bit differently. This can make it harder
for the computer to understand.
7. Understanding Emotion: Computers can't always understand the emotions or
tone of voice behind what someone is saying, like if they're happy, sad, or angry.
What is the use of conceptual dependency in artificial intelligence?
Imagine you're teaching a computer to understand what people say. People use all sorts
of words and sentences, but the computer needs to understand the main ideas behind
those words. That's where conceptual dependency comes in.
1. Understanding Language: Conceptual dependency helps the computer
understand the main ideas in what people say, like who did what to whom and
why, without getting too caught up in the exact words.
2. Making Sense of Information: Instead of focusing on every single word, the
computer looks at the big picture. It figures out the main concepts and
relationships between them. For example, if someone says "John gave Mary the
book because she loves to read," the computer understands that John gave Mary
the book because she loves reading, even if the words are a bit different.
3. Helping Computers Think: By breaking down language into concepts and
relationships, conceptual dependency helps computers think more like humans.
It helps them make logical deductions, answer questions, and have meaningful
conversations with people.
4. Talking Back: Not only does conceptual dependency help computers understand
what people say, but it also helps them talk back in a way that makes sense.
Computers can generate responses that capture the main ideas of the
conversation, even if they're not using the exact same words.
List and describe the five major players in the expert system development team. What is the role of
the knowledge engineer?
1. Domain Expert: The domain expert is someone who possesses extensive knowledge and
expertise in the specific field or domain that the expert system is being developed for. Their
role is to provide the necessary domain knowledge, rules, and insights to ensure that the
expert system accurately represents the knowledge and reasoning processes used by
human experts in that domain.
2. Knowledge Engineer: The knowledge engineer acts as a liaison between the domain expert
and the technical team responsible for implementing the expert system. Their role is to elicit,
capture, and formalize the domain expert's knowledge into a format that can be understood
and utilized by the expert system. This involves tasks such as interviewing domain experts,
identifying decision-making processes, and structuring knowledge into rules or algorithms.

3. Software Developer: The software developer is responsible for designing, coding, and
implementing the expert system based on the specifications provided by the domain expert
and knowledge engineer. They use programming languages and development tools to build
the software components of the expert system, including the inference engine, knowledge
base, user interface, and integration with other systems if necessary.
4. User Interface Designer: The user interface designer focuses on creating an intuitive and
user-friendly interface for interacting with the expert system. Their role is to design screens,
menus, and interactions that enable users to input queries, receive responses, and navigate
through the system effectively. This involves considering factors such as usability,
accessibility, and user experience.
5. Quality Assurance (QA) Tester: The QA tester is responsible for ensuring the quality and
reliability of the expert system by identifying and addressing any defects, errors, or
inconsistencies. Their role involves testing the system's functionality, performance, and
accuracy against a variety of test cases and scenarios. They work closely with the
development team to troubleshoot issues and verify that the system meets the requirements
and expectations of end users.
The role of the knowledge engineer is to bridge the gap between the domain expert and the technical
team by translating domain knowledge into a format that can be used by the expert system. This
involves:
● Collaborating with domain experts to understand their expertise, decision-making processes,
and problem-solving strategies.
● Eliciting and capturing domain knowledge through interviews, observations, and
documentation.
● Structuring and organizing knowledge into a formal representation, such as rules, frames, or
ontologies, that can be processed by the expert system.
● Validating and refining the knowledge representation through iterative feedback and testing
with domain experts.

● Communicating effectively with software developers to ensure that the knowledge
representation is accurately implemented in the expert system.
What is a production system model? List and define the five basic components of an expert
system.
A production system model is a type of rule-based system used in artificial intelligence
(AI) and expert systems. It consists of a set of rules, known as productions, that guide
the behavior of the system. Each production consists of a condition (or set of
conditions) and an associated action (or set of actions) to be performed if the condition
is satisfied. Production systems are commonly used for problem-solving, decision-
making, and control tasks.
The five basic components of an expert system are:
1. Knowledge Base (KB): The knowledge base is where all the relevant information,
rules, facts, and heuristics about a specific domain are stored. It represents the
expertise of human specialists in the domain and serves as the foundation for
the expert system's reasoning and decision-making processes.
2. Inference Engine (IE): The inference engine is responsible for reasoning and
making deductions based on the knowledge stored in the knowledge base. It
uses the rules and facts from the knowledge base to derive new conclusions,
make recommendations, or answer queries posed to the expert system.
3. User Interface (UI): The user interface provides a means for users to interact with
the expert system. It allows users to input queries, provide information, and
receive responses or recommendations from the system. The user interface
should be intuitive, user-friendly, and tailored to the specific needs of the users.
4. Explanation Facility (EF): The explanation facility provides explanations for the
reasoning and decisions made by the expert system. It helps users understand
how the system arrived at a particular conclusion or recommendation by
providing justifications, rationales, or supporting evidence from the knowledge
base.
5. Knowledge Acquisition (KA) Facility: The knowledge acquisition facility is used to
acquire, elicit, and update the knowledge stored in the knowledge base. It
provides tools and methods for domain experts and knowledge engineers to
input, edit, and validate the rules, facts, and heuristics that comprise the
knowledge base. This component is crucial for keeping the expert system up-to-
date and relevant to changing domain requirements.

Artificial Intelegence

Uploaded by

Copyright:

Available Formats

Artificial Intelegence

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Artificial Intelegence

Uploaded by

Copyright:

Available Formats

What is Intelligence?

Key Aspects of Human Intelligence

4.1. Foundations – Mathematics

4.2. Foundations – Neuroscience

4.3. Foundations – Control Theory

4.4. Foundations – Linguistics

Precept - : bump , dirt , home

Real World problems:

PAGE Framework for Tic-Tac-Toe AI

Example of Minimax in Tic-Tac-Toe

Water Jug Problem

b liters, and you need to measure exactly

c liters of water. The operations you can perform are:

1. Fill a jug completely.

PAGE Framework for the Water Jug Problem

Solving the Water Jug Problem

1. Initial State: Start with both jugs empty: (0,0)

In this sequence of actions, we reach the state

Propositional logic, also known as sentential logic, is a fundamental branch of logic

Components of Propositional Logic:

Now, let's form compound propositions using logical connectives:

1. Conjunction (P∧Q): It is raining and the ground is wet.

Certainly! Think of a propositional theorem as a rule in a game where no matter what,

Here's a simple breakdown:

Propositional Theorem in Simple Terms:

Importance in Everyday Life:

Example of Proving Inference:

1. Let n and m be even integers.

Here's how it works:

1. Setting Up the Game:

1. It's raining or it's sunny.

wrong with the original statements.

at the same time.

Horn Clauses and definite Clauses

Horn Clauses in Simple Terms:

happens, but not if this other thing happens."

maybe some negative ones ("It's not raining").

and one negative one ("It's not snowing").

● Horn clauses are used in logical reasoning, especially in computer science

and artificial intelligence, to represent rules or conditions.

Importance of Horn Clauses:

way, making it easier to understand and work with them.

making decisions based on certain conditions.

● Programming: In computer science, they're used in programming languages like

Prolog to represent rules or facts.

and how they work:

Definite Clauses in Simple Terms:

they're written in a specific way to make them easier to work with.

● Each definite clause has two parts: a condition and a conclusion.

is what happens if the condition is true.

3. Only One Positive Thing:

● In a definite clause, there's only one positive statement in the condition.

Everything else is either negative or a "not" version of a statement.

Importance of Definite Clauses:

● Logic Programming: Definite clauses are commonly used in programming

languages like Prolog to represent knowledge or rules.

understand and use in automated reasoning.

step by step to reach a conclusion.

conclusions based on those facts or rules.