UNIT 2 (Problem Solving Agent)
UNIT 2 (Problem Solving Agent)
Agents solve the problem in 3 phases. They are ijformulate i)soarch iü)execute
Problem formulation is the process of
given a goal.
deciding what actions and states to consider,
Searching the process of looking for a sequence of actions that reaches the goal
Execution is carrying out the solution which is found in
searching process
Problem Fomulation
A problem can be defined formally by five components
Initial State [s]
Actions [Actions(s)]
Transition Model [Result(s,a)]
Goal Test [s]
Path cost function [c(s,a,s")]
Problem formulation for Vacuum Cleaner
No
F10 The Stole paa Y A VGCCwm wmld, AvCy
iMnitial state can be anyof 8.states A w: L =Left , R:R, S-SUCK.
2x2 .
i)Actions are left,right,suck
mostly
specific problem and provide the best result.
used these search strategies or algorithms to
solvea
use atomic Problem-solving agents are the
goal-based agents and
representation. In this topic, we will learn various problem-solving search
algorithms.
Search Algorithm Terminologies:
Search: Searchingis a step by step procedure to solve a
search-problem in a given search space.A
search problem can have three main factors:
a. Search Space: Search space represents a set of
possible solutions, which a system may have.
b. Start State: It is a state from where agent begins the search.
Goal test: It is a function which observe the current state and returns whether the
goal state is
achieved or not.
Search tree: A tree representation of search problem is called Search tree. The root of the search tree is the
root node which is corresponding to the initial state.
Actions: It gives the description of all the available actions to the
agent.
Transition model: A description of what each action do, be
can represented as a transition model.
Path Cost: It is a function which assigns a numeric cost to each path.
Solution: It is an action sequence which leads from the start node to the goal node.
Optimal Solution: If a solution has the lowest cost among all solutions.
Optimality: If a solution found for an algorithm is guaranteed to be the best solution (lowest path
cost) among all other solutions, then such a solution for is said to be an optimal solution.
Time Complexity: Time complexity is a measure of time for an algorithm to complete its task.
Space Complexity: It is the maximum storage space required at any point during the search, as the
complexity of the problem
3
Search Algorithm
Uninformed/Blind Search:
such as closeness, the location of the
The uninformed search does not contain any domain knowledge
includes information about how to traverse the tree
goal. It operates in a brute-force way as it only is
Uninformed search applies a way in which search tree
and how to identify leaf and goal nodes.
like initial state operators and test for the
searched without any information about the search space
each node of the tree until it achieves the goal node.
goal, so it is also called blind search.lt examines
It can be divided into five main types:
oBreadth-first search
o Uniform cost search
o Depth-first search
Informed search can solve much complex problem which could not be solved in another way.
1. Greedy Search
2. A* Search
1. Breadth-first Search
2. Depth-first Search
3. Depth-imited Search
4. Iterative deepening depth-first search
1. Breadth-first Search:
Breadth-first search is the most common search strategy for traversing a tree or graph. This algorithm
Advantages:
Disadvantages:
o It
requires lots of
memory since each level
level of the tree must be
saved into
memory to expand the next
o BFS needs lots of
time if the solution is
far away from the
root node
Example:
In the below
tree structure, we have shown the
root node S to
goal node K. BFS traversing of the tree using BFS
shown by the dotted algorithm traverse in layers, so it will followalgorithm
arrow, and the traversed
search from the
the path which is
path will be
1. S->A->B-C--->D->G-->H->E-->F--->---->K
Level o
B
Level 1
D Level2
Level 3
Level4
Time Complexity{Time Complexity BFS algorithm can be obtained by the number of nodes
of
traversed in BFS until the shaliowest Node.
at every state.
Where the d= depth of shallowest solution and b is a node
T (b) =
1+b*+b'+...+ b°= O (b)
Space Complexity: Space complexity of BFS algorithm is given by the Memory size of frontier which
is Ofb)
Completeness: BFS is complete, which means if the shallowest goal node is at some finite depth, then
BFS will fnd a solution
of the depth of the node.
function
cost is a non-decreasing
is optimal if path
Optimality: BFS
2. Depth-first Search
structure.
a tree or graph data
recursive algorithm for
traversing to its
oDepth-first search
isa
node and follows
each path
from the root
starts
search because it
It is called the depth-first
next path.
before moving to the
node
greatest depth
stack data structure for its implementation.
DFS uses a
similar to the BFS algorithm.
The process of the DFS algorithm is
solutions using recursion.
is an algorithm technique for finding all possible
Note: Becktraoking
Advantage:
nodes on the path from root
it only needs to store a stack of the
o DFS requires very less memory as
Disadvantage:
that many states keep re-occurring, and there is no guarantee of finding the
o There is the possibility
solution
down searching and sometime it may go to the infinite loop.
DFS algorithm goes for deep
Example:
follow the order
depth-first search, and it will
as:
In the below search tree, we have shown the flow of
evelo
Levela
Completeness: DFS search algorithm is complete within finite state space as it will expand every node
within a limited search tree.
Time Complexity: Time complexity of DFS will be equivalent to the node traversed by the algorith
It is given by
T(n)= 1+ n?+ n* *...+ n=O(n")
Where, m= maximum depth of any node and this can be much larger than d (Shallowest
solution depth)
Space Complexity: DFS algorithm needs to store only single path from the root node, hence space
complexity of DFS is equivalent to the size of the fringe set, which is O(bm).
Optimal: DFS search algorithm is non-optimal, as it may generate a large number of steps or high
cost to reach to the goal node.
Advantages:
Depth-limited search is Memory efficient.
Disadvantages:
Depth-limited search also has a disadvantage of incompleteness.
o It may not be optimal if the problem has more than one solution.
Example:
Depth Limited Search
vel o
Level
1evel s
Duajajduo
if the solution is above the depth-limit.
Completeness: DLS search algorithm is complete
if d .
Advantages:
because at every state the path with the least cost is chosen.
o Uniform cost search is optimal
Disadvantages:
concerned about path cost
about the number of steps
involve in searching and only
It does not care
may be stuck in
an infinite loop.
Due to which this algorithm
Level o
B Leveli
Level 2
C
E Level s
5
Level +
G
Completeness:
Uniform-cost search i mplete, such as if there is a solution, UCS will find it.
Time Complexity:
ne
mb optimal solution, and e is each step to get closer to the goal node. Then the
Seps is
=C*/£+1. Here we have taken +1, as we start from state 0 and end to C"/E.
ence, the
worst-case time complexity of Uniform-cost search iso(b' C)
Space Complexity:
Sdme logic is for space complexity so, the worst-case space complexity of Uniform-cost searcn
is O(bC/l)
Optimal:
Onitorm-cost search is always optimal it only selects a path with the lowest
as path cost
5. Iterative deepening depth-first Search:
he iterative deepening algorithm is a combination of DES and BFS algorithms. This search algorithm
finds out the best depth limit and does it by gradually increasing the limit until a goal is found.
This algorithm performs depth-first search up to a certain "depth limit", and it keeps increasing the
depth limit after each iteration until the goal node is found.
This Search algorithm combines the benefits of Breadth-first search's fast search and depth-first
search's memory efficiency.
The iterative search algorithm is useful uninformed search when search space is large, and depth of
goal node is unknown.
Advantages:
o Itcombines the benefits of BFS and DFS search algorithm in terms of fast search and memory efficiency.
Disadvantages:
o The main drawback of IDDFS is that it repeats all the work of the previous phase.
Example:
Following tree structure is showing the iterative deepening depth-first search. IDDFS algorithm
performs various iterations until it does not find the goal node. The iteration performed by the
algorithm is given as:
first search
Iterative deepening depth
Level o
B C Level i
Level
H Level3
1'st Iteration---->
2'nd Iteration-- A B,
3'rd Iteration------>A, B, D C G
4'th Iteration------>A, B, D, H, C F. G
In the fourth iteration, the algorithm will find the goal node.
Completeness:
This algorithm is complete is ifthe branching factor is finite.
Time Complexity:
Let's suppose b is the branching factor and depth is d then the worst-case time complexity is O(b").
Space Complexity:
The space complexity of 1DDFS will be O(bd)
Optimal:
IDDFS algorithm is optimal if path cost is a non- decreasing function of the depth of the node.
Disadvantages:
o Implementation of the bidirectional search tree is difficult.
Example:
divides one
In the below search tree, bidirectional search algorithm is applied. This algorithm
direction and starts
graph/tree into two sub-graphs. It starts traversing from node 1 in the forward
from goal node 16 in the backward direction.
Bidirectional Search
Root node
13
14
2
3
Intersection
Node
5 6
Goal node
all possible solutions of the problem without having any additional knowledge about search space
Dut intormed search algorithm contains an array of knowledge such as how far we are from the goal,
path cost, how to reach to goal node, etc. This knowledge help agents to explore less to the search
space and find more efficiently the goal node.
The informed search algorithm is more useful for large search space. Informed search algorithm uses
the idea of heuristic, so it is also called Heuristic search.
Heuristics function: Heuristic is a function which is used in Informed Search, and it finds the most
promising path. It takes the current state of the agent as its input and produces the estimation of how
close agent is from the
goal. The heuristic method, however, might not always give the best solution,
but it guaranteed to find a
good solution in reasonable time. Heuristic function estimates how close a
state is to the goal. It is
represented by h(n), and it calculates the cost of an optimal path between the
pair of states. The value of the heuristic function is always positive.
Admissibility of the heuristic function is given as:
Here h(n) is heuristic cost, and h*(n) is the estimated cost. Hence
heuristic cost should be less
than or equal to the estimated cost.
On each iteration, each noden with the lowest heuristic value is expanded and generates all its
successors and n is placed to the closed list. The algorithm continues unit a goal state is found.
In the informed search we will discuss two main algorithms which are given below
oBest First Search Algorithm(Greedy search)
o A Search Algorithm
algorithm, we expand the node which is closest to the goal node and the closest cost is estumded uy
heuristic function, i.e.
1. f(n)= h(n).
oStep 3: Remove the node n, from the OPEN list which has the lowestvalue of h(n), and places it in the
CLOSED list
o Step 4: Expand the node n, and generate the successors of node n.
o Step 5: Check each successor of node n, and find whether any node is a goal node or not. If any
successor node is goal node, then return success and terminate the search, else proceed to Step 6.
o Step 6: For each successor node, algorithm checks for evaluation function f(n), and then check if the
node has been in either OPEN or CLOSED list. If the node has not been in both list, then add it to the
OPEN list.
Advantages:
oBest first search can switch between BFS and DFS by gaining the advantages of both the algorithms.
Disadvantages:
oItcan behave as an unguided depth-first search in the worst case scenario.
It can get stuck in a loop as DFS.
Example:
Consider the below search problem, and we will traverse it using greedy best-first search. At each
iteration, each node is expanded using evaluation function f(n)=h(n) , which is given in the below
table
S node H(n)
12
B B 4
D
F E
F
H
H 13
S
G
In this search example, we are using two lists which are OPEN and CLOSED Lists. Following are the
iteration for traversing the above example.
12 A B 4
E F
Iteration 2: Open
E A Closed S
Open [E. A}, Closed [S, B, F]
S
Iteration 3: Open
:Open II, E, A), Closed S, B, G, Al Closed B
F, G]
Hence the final
solution path will be: S---->
B->F---> G
Time
Complexity: The worst case time
complexity of Greedy best first search is
Space Olp)
the
Complexity: The worst case space complexity of
maximum depth of the search Greedy best first search is O(b"). Where, m
space.
Complete: Greedy best-first search is also
incomplete, even if the given state space is finite.
Optimal: Greedy best first search algorithm is not optimal.
2.) A Search Algorithm:
A search is the
most commonly known form of
cost to reach the node n best-first search. It uses heuristic function
from the start state h(n), and
first search, by which it solve g(n). It has combined features of UCS
and greedy best-
the problem efficiently. A* search
through the search space algorithm
using the heuristic function. This search algorithm
finds the shortest path
and provides optimal result faster.
A* algorithm is similar to UCS expands less search tree
of gn). except that it uses g(n) +h(n) instead
In A search
algorithm, we use search heuristic as well as the cost to reach the node. Hence
combine both costs as we can
following, and this sum is called as a fitness number.
fln) = gln)+ hn)
Estimated cost
of the cheapest Cost to reach Cost to reach
solution. node nfrom fromnode n to
start state goalnode
Advantages
A search
o
algorithm is the best algorithm than other search algorithms.
o A search
algorithm is optimal and complete.
o This algorithm can solve
very complex problems
Disadvantages
o It does not always produce the shortest path as it mostly based on heuristics and approximation.
o A search algorithm has some
complexity issues
o The main drawback of A* is
memory requirement as it keeps all generated nodes in the memory, so it is
not practical for various
large-scale problems.
Example
In this example, we will traverse the given graph using the A* algorithm. The heuristic value of all states is given
in the below table will calculate the f(n) of each state
so we
using the formula f(n)= g(n) + h{n), where gln) is
the cost to reach any node from start state.
State h(n)
S
10
D
Solution:
o
teration 4 illgive the final result, as S--A--C-G it provides the optimal path with cos6
Points to remember
A algorithm returns the path which occured first, and it does not search for all renaining paths.
="" li="
A algorithrm expands all nodes which satisty the condition fn)e
Complete: A* algorithm is complete as long as:
Admissible: the first condition requires for optimality is that hin) should be an admissble heuristc for
A tree search. An admisible heuristic is optimistic in nature
Consistency Second required condtion is consistency for only A* graph-search
f t e r i s t c funct.on s adm ssibie teri ree seareh wil al ways fne the leat co pas
Time Complexity: The time complexity of A* search algorithm depends on heuristic function, and the
number of nodes expanded is exponential to the depth of solution d. So the time complexity is
O(bAd), where b is the branching factor
Space Complexity: The space complexity of A* search algorithm is O(b^d)
Objective function
Global maximun
shoulder
Local maximum
"flat" local maximumn
Shoulder: It is a
plateau region which has an uphill edge
20
o Less time
consuming
oLess optimal solution and the solution is not guaranteed
Algorithm for First-Choice Hill Climbing:
o
Step 1: Evaluate the initial state, if it is
goal state then return success and Stop.
Step 2: Loop Until a solution is found or there is no new operator left to apply.
Step 3: Select and apply an operator to the current state.
Local maximum
Plateau/Flat maximum
H+
3. Ridges: A ridge is a special form of the local maximum. It has an area which is higher than its
surrounding areas, but itself has a slope, and cannot be reached in a single move.
Solution: With the use of bidirectional search, or by moving in different directions, we can improve
this problem.
2
Ridge
MM
Simulated Annealing
A hill-climbing algorithm which never makes a move towards a lower value guaranteed to be incomplete
because it can get stuck on a local maximum.
In contrast, a purely random walk, that is, moving to a successor chosen uniformly at random from the set of
Simulated annealing is the algorithm which combines hill-climbing with a random walk. It gives both efficiency
and completeness
In mechanical term Annealing is a process of hardening a metal or glass to a high temperature then cooling
Otherwise, the algorithm follows the path which has a probability of less than 1.
The probability also decreases as the temperature T goes down. It means that "bad moves" are more likely to
be allowed at the start when T is high, and they become more unlikely as T decreases.
If the schedule lowers T slowly enough, the algorithm will find a global optimum
23
Algorithm-Simulated Annealing
function StMULATED ANNEALING
lnpat prolen, a problem returns asolution state
achall a mapping from
enipranun
rument MAKENoDE TAL STATE
foc Tto do
Tachedult
Othen retur en
AFna:VALUE uent,VALL
KAE0then ere
thecumret-t nly wih peobilg
Production of offspringstates
of child states are determined by their "fitness".
states which are participated in production
The parent
A
crossover point is chosen randomly from the positions in the strings.
ne
offsprings themselves are created by crossing over the parent strings at the crossover point.
E
Nondeterministic erratic Vacuum cleaner
I n the erratic vacuum
world, the suck action works as follows:
When applied to a dirty square, the action cleans the
square and sometimes cleans
square up dirt in the
adjacent
When applied to a clean
square, the action deposits the dirt.
AND-OR SEARCH TREES
In a
deterministic environment, the
nodes are called OR nodes. branching is done with the own choices of
agent in each state. Inese
In a
nondeterministic
action.These nodes areenvironment,
called AND
the branching is also done
the choice of environment for each
outcome or
nodes.