MCA2 CA435 M1 Intro Agent Problem Solving
MCA2 CA435 M1 Intro Agent Problem Solving
INTELLIGENCE
Text books:
[T1] Russel Norvig Artificial Intelligence A Modern Approah, 2nd edition, Prentice Hall
[RefT1] https://github.com/aimacode
https://github.com/aimacode/aima-pseudocode/blob/master/algorithms.pdf
                           Slides contain content adapted from the text/reference books for lectures to BIT Students only
                                Syllabus
•   MODULE-I
•   Introduction: Overview of Artificial Intelligence- Problems of AI, AI Technique,
    Tic-Tac-Toe Problem.
•   Intelligent Agents: Agents & Environment, Nature of Environment, Structure of
    Agents, Goal Based Agents, Utility Based Agents, Learning Agents.
•   Problem Solving: Problems, Problem Space & Search: Defining the Problem as
    State Space Search, Production System, Problem Characteristics, Issues In The
    Design Of Search Programs.
                Target Sample Questions
1.   What is AI technique? What is Turing-test? Explain different application areas of AI
2.   With suitable example show the difference between AI programming and conventional
     programming.
3.   Relationship/ contrast between AI, ML, DL, and logic programming
4.   What is an agent? Describe briefly how an agent should act in a computer program
5.   Explain i) goal-based agents ii) utility-based agents iii) simple reflex agents iv) model based reflex
     agents v) learning agents
6.   Differentiate between – i) Deterministic and stochastic environment ii) Episodic and sequential
     environment
7.   How AI technique can be applied for Tic-Tac-Toe problem? Explain in details
8.   Suggest one application of artificial intelligence (AI) in each of the following indicating the
     preceptor, environment, actuator, and performance measure- i) healthcare ii) housekeeping iii) public
     transport iv) banking v) manufacturing and production vi) Defence vii) space mission viii)
     environment care ix) agriculture x) e-Governancexi) Education
9.   What capability the following components add to a computing system to make it act like human -
     natural language processing, knowledge representation, automated reasoning, machine learning,
     computer vision, robotics
                    Module I
                   AI Concepts
    “Can a machine think and behave like humans do?”
• Defining Intelligence
• Introduction to AI and Intelligent Agents
• AI problems and Solution approaches
• Problem solving using Search and Heuristics
• AI Knowledge-base: creation, updation and
  reasoning,
• Broad category of branches in AI and
  intelligent Systems .
    Types of Intelligence [Howard Gardner]
A machine or a system is artificially intelligent when it is equipped with at least one
or all intelligences, as listed below, in it
                      Review Questions
•   Suggest the type of intelligence for following AI systems:
•   i) computer based medical diagnosis assistant ii) automized surgical system iii)
    brain tumor detector iv) vaccum cleaner v) automated washing machine vi)
    automated taxi vii) banking fraud detector viii) manufacturing part picking robot ix)
    automated surveillance drone x) new celestial body detector xi) earthquake
    detector xii) crop growth monitor xiii) geographical information system (GIS)
    based water management system xiv) crime monitoring xv) Intelligent tutoring
    system xvi) Gaming system for children with mental disorder xvii) Airport robot
    like RADA xviii) socially assistive robot like PARO, iCube, NAO and OHMNI
    xix) Pathfinder Game
                 Review Questions
• Examine the AI literature to discover whether the following
    tasks can currently be solved by computers:
a. Playing a decent game of table tennis (ping-pong).
b. Driving in the center of Cairo.
c. Buying a week's worth of groceries at the market.
d. Buying a week’s worth of groceries on the web.
e. Playing a decent game of bridge at a competitive level.
f. Discovering and proving new mathematical theorems.
g. Writing an intentionally funny story.
h. Giving competent legal advice in a specialized area of law.
i. Translating spoken English into spoken Swedish in real time.
j. Performing a complex surgical operation.
       What is Intelligence Composed Of?
• Reasoning - It is the set of processes that enable us to provide
   basis for judgement, making decisions, and prediction.
    – Inductive reasoning/Deductive reasoning
• Learning
   – Auditory/ Episodic/ Motor/ Observational/ Perceptual/
      Relational/ Sensory/
• Problem Solving - includes decision making, which is the process
  of selecting the best suitable alternative out of multiple alternatives
  to reach the desired goal.
• Perception - process of acquiring, interpreting, selecting, and
  organizing sensory information.
• Linguistic Intelligence - ability to use, comprehend, speak,
  and write the verbal and written language.
•   Philosophy
                                Foundations of AI
     –   Can formal rules be used to draw valid conclusions?
     –   How does the mental mind arise from a physical
     –   Where does knowledge come from?
     –   How does knowledge lead to action?
•   Mathematics
     –   Can formal rules be used to draw valid conclusions?
     –   How does the mental mind arise from a physical brain?
     –   Where does knowledge come from?
     –   How does knowledge lead to action?
•   Economics
     –   What are the formal rules to draw valid
     –   What can be computed?
     –   How do we reason with uncertain information?
•   Neuroscience
     –   How should we make decisions so as to maximize payoff?
     –   How should we do this when others may not go along?
     –   How should we do this when the payoff may be in the future?
•   Psychology
     –   How do humans and animals think and act?
•   Computer Engineering
     –   How can we build an efficient computer?
•   Control Theory and Cybernetics
     –   How can artifacts operate under their own control?
•   Linguistics
     –   How does language relate to thought?
[R2. pp-VI]
    Module I : AI Concepts- Introduction to AI
      https://www.cs.utexas.edu/~mooney/cs343/slide-handouts/intro.4.pdf
➢ Definition of AI- Art of designing Systems that { think like humans +
  think rationally + act like humans + act rationally }
➢ Thinking Humanly: Cognitive Modeling- Interdisciplinary field (AI,
  psychology, linguistics, philosophy, anthropology) that tries to form
  computational theories of human cognition
➢ Thinking Rationally : Laws of Thought- Formalize “correct”
  reasoning using a mathematical model (e.g. of deductive reasoning)
➢ Acting Humanly: The Turing Test - If the response of a computer to
  an unrestricted textual natural-language conversation cannot be
  distinguished from that of a human being then it can be said to be
  intelligent
➢ Acting Rationally: Rational Agents- rationality involves maximizing
  goals within the computational and other resources available.
    Module I : AI Concepts- Introduction to AI
      https://www.cs.utexas.edu/~mooney/cs343/slide-handouts/intro.4.pdf
➢ Foundations of AI: Philosophy, Mathematics, Psychology; Computer
  Science; Linguistics
                                                                               20
                 …Rational agents
• Agents can perform actions in order to modify future percepts
  so as to obtain useful information (information gathering,
  exploration)
• An agent is autonomous if its behavior is determined by its own
  experience (with ability to learn and adapt)
                                                              21
             Vacuum-cleaner world
      [R3] Russel Norvig Artificial Intelligence A Modern Approah, 2nd edition, Prentice Hall
                                                                                                22
                                           PEAS
                                                  24
                       PEAS
• Agent: Part-picking robot
   – Performance measure: Percentage of parts in
     correct bins
   – Environment: Conveyor belt with parts, bins
   – Actuators: Jointed arm and hand
   – Sensors: Camera, joint angle sensors
                                                   25
                                PEAS
• Agent: Interactive English tutor
   –   Performance measure: Maximize student's score on test
   –   Environment: Set of students
   –   Actuators: Screen display (exercises, suggestions, corrections)
   –   Sensors: Keyboard
                                                                         26
                           Rational Agent?
Definition of a rational agent: What is rational at any
given time depends on four things:
• The performance measure that defines the criterion of
    success.
• The agent's prior knowledge of the environment.
• The actions that the agent can perform.
• The agent's percept sequence to date.
                                              30
                        Review Questions
2.3 This exercise explores the differences between agent functions and agent
programs.
   a. Can there be more than one agent program that implements a given agent function? Give
   an example, or show why one is not possible.
   b. Are there agent functions that cannot be implemented by any agent program?
   c. Given a fixed machine architecture, does each agent program implement exactly one agent
   function?
   d. Given an architecture with n bits of storage, how many different possible agent programs
   are there?
2.4 Let us examine the rationality of various vacuum-cleaner agent functions.
   a. Show that the simple vacuum-cleaner agent function described in Figure 2.3 is indeed
   rational under the assumptions listed on page 36.
   b. Describe a rational agent function for the modified performance measure that deducts one
   point for each movement. Does the corresponding agent program require internal state?
   c. Discuss possible agent designs for the cases in which clean squares can become dirty and
   the geography of the environment is unknown. Does it make sense for agent to learn from its
   experience in these cases? If so, what should it learn?
                        Review Questions
2.12 The vacuum environments in the preceding exercises have all been deterministic.
Discuss possible agent programs for each of the following stochastic versions:
a. Murphy's law: 25% of the time, the action fails to clean the floor if it is dirty and
deposits dirt onto the floor if the floor is clean. How is your agent program affected if
the dirt sensor gives the wrong answer 10% of the time?
b. Small children: At each time step, each clean square has a 10% chance of becoming
dirty. Can you come up with a rational agent design for this case?
                Agent types
• Four basic types in order of increasing
  generality:
  – Simple reflex agents
  – Model-based reflex agents
  – Goal-based agents
  – Utility-based agents
                                            33
                  Simple reflex agents
                                                                                                 34
       [R3] Russel Norvig Artificial Intelligence A Modern Approah, 2nd edition, Prentice Hall
• Simple reflex agents act only on the basis of the current percept,
  ignoring the rest of the percept history. The agent function is based
  on the condition-action rule: "if condition, then action".
https://aimacode.github.io/aima-exercises/agents-exercises/
                   Agent Design Paradigms
•   [ https://hellotars.com/blog/understanding-ai-agents-and-environments-a-comprehensive-guide ]
•   Learning from environments: Reinforcement learning: Reinforcement
    learning is a method where agents learn from interactions with their
    environment. Key concepts include:
      • Rewards and penalties: Agents receive rewards or penalties based on
        their actions to reinforce desirable behaviors (such as the robot
        vacuum in the previous example).
                                                                                          43
[R3] Russel Norvig Artificial Intelligence A Modern Approah, 2nd edition, Prentice Hall
• Goal-based agents only distinguish between goal states and non-goal
  states.
                                                                                          45
[R3] Russel Norvig Artificial Intelligence A Modern Approah, 2nd edition, Prentice Hall
               Learning Element
• The most important distinction is between the
  – "learning element", which is responsible for making
    improvements: uses feedback from the "critic" on
    how the agent is doing and determines how the
    performance element should be modified to do better
    in the future.
  – "performance element", which is responsible for
    selecting external actions: what we have previously
    considered to be the entire agent: it takes in percepts
    and decides on actions
  – "problem generator“: responsible for suggesting
    actions that will lead to new and informative
    experiences.
           Module I : AI Concepts- Intelligent Agents
                 [Russel] [http://aima.cs.berkeley.edu/algorithms.pdf]
TABLE-DRIVEN-AGENT; REFLEX-VACUUM-AGENT; SIMPLE-REFLEX-AGENT; MODEL-BASED-REFLEX-AGENT
                                                def TableDrivenAgentProgram(table):
                                                  """
                                                  [Figure 2.7]
                                                  This agent selects an action based on the
                                                percept sequence.
                                                  It is practical only for tiny domains.
                                                  To customize it, provide as table a dictionary
                                                of all
                                                  {percept_sequence:action} pairs.
                                                  """
                                                  percepts = []
                                                   def program(percept):
                                                     percepts.append(percept)
                                                     action = table.get(tuple(percepts))
                                                     return action
return program
                                                         def program(percept):
                                                           state = interpret_input(percept)
                                                           rule = rule_match(state, rules)
                                                           action = rule.action
                                                           return action
return program
                 [R3] Russel Norvig Artificial Intelligence A Modern Approah, 2nd edition, Prentice Hall
                  Python code at - https://github.com/aimacode/aima-python/blob/master/agents.py
          Module I : AI Concepts- Intelligent Agents
                  [Russel] [http://aima.cs.berkeley.edu/algorithms.pdf]
SEARCH BASED PROBLEM SOLVING AGENTS
          [R3] Russel Norvig Artificial Intelligence A Modern Approah, 2nd edition, Prentice Hall
                              Implement Agent
              https://github.com/aimacode/aima-python/blob/master/agents.py
An agent program is a callable instance, taking
percepts and choosing actions
                                                    54
• 1. Self-Driving Car
    • Static vs. Dynamic: Dynamic (The environment changes over time, such as
      traffic lights, other vehicles, and pedestrians.)
    • Discrete vs. Continuous: Continuous (The car's position, speed, and steering
      require continuous values.)
    • Deterministic vs. Stochastic: Stochastic (The outcomes depend on other
      drivers, weather, and unpredictable events.)
    • Episodic vs. Sequential: Sequential (Decisions impact future states, such as
      navigating an intersection affecting subsequent paths.)
• 2. Chess-Playing AI
    • Static vs. Dynamic: Static (The board does not change unless the agent or
      opponent moves.)
    • Discrete vs. Continuous: Discrete (The board has finite positions and moves.)
    • Deterministic vs. Stochastic: Deterministic (Outcomes of moves are
      predictable and depend only on the current state and the agent’s actions.)
    • Episodic vs. Sequential: Sequential (Every move impacts the game’s future
      state.)
• 3. Online Shopping Recommender System
   •   Static vs. Dynamic: Dynamic (Customer preferences and inventory may change in real-
       time.)
   •   Discrete vs. Continuous: Discrete (Actions involve selecting from a finite set of
       recommendations.)
   •   Deterministic vs. Stochastic: Stochastic (User behavior can be unpredictable;
       recommendations might not always lead to purchases.)
   •   Episodic vs. Sequential: Episodic (Each recommendation episode is independent of
       previous ones.)
• 4. Robot Vacuum Cleaner
   •   Static vs. Dynamic: Dynamic (The environment may change, such as humans moving
       furniture or dropping objects.)
   •   Discrete vs. Continuous: Discrete (If the cleaner divides the room into a grid) or
       Continuous (If it operates freely in space.)
   •   Deterministic vs. Stochastic: Stochastic (Outcomes like moving to a spot might be
       affected by sensor noise or unexpected obstacles.)
   •   Episodic vs. Sequential: Sequential (Cleaning a particular area impacts the next steps,
       such as avoiding cleaned areas.)
• 5. Medical Diagnosis AI
   •   Static vs. Dynamic: Static (Data remains fixed during analysis.)
   •   Discrete vs. Continuous: Discrete (Symptoms and test results may have categorical
       outcomes) or Continuous (Some results, like blood pressure, are measured on a scale.)
   •   Deterministic vs. Stochastic: Stochastic (Patient conditions and responses to treatment
       can be uncertain.)
   •   Episodic vs. Sequential: Episodic (Each diagnosis is a separate event.)
• 6. Weather Forecasting System
   •   Static vs. Dynamic: Dynamic (Weather conditions change continuously.)
   •   Discrete vs. Continuous: Continuous (Weather parameters like temperature, humidity,
       and wind speed are continuous values.)
   •   Deterministic vs. Stochastic: Stochastic (Weather depends on complex and
       unpredictable factors.)
   •   Episodic vs. Sequential: Sequential (Forecasting today’s weather impacts predictions for
       the coming days.)
       Practical Resources- Agents
• Solution – agent design vaccum cleaner
   – https://github.com/abhin4v/russell-norvig-ai-
     problems/blob/master/chapter2/AI/Vacuum/Cleaner.hs
   – https://github.com/abhin4v/russell-norvig-ai-
     problems/tree/master/chapter2
• Execute on Haskell online editor
   – https://www.tutorialspoint.com/compile_haskell_online.php
• Python Solution – Agent design – “Dog in a Park”
   – https://github.com/aimacode/aima-
     python/blob/master/agents.ipynb
   – GitHub - aimacode/aima-python: Python implementation of
     algorithms from Russell And Norvig's "Artificial Intelligence - A
     Modern Approach"
PROBLEM SOLVING
   Problem Description- Four components
         => State-space description
• Initial state that the agent starts in.
• Successor function - A description of the possible actions available to the
  agent. Given a particular state x, returns a set of (action, successor)
  ordered pairs, where each action is one of the legal actions in state x and
  each successor is a state that can be reached from x by applying the
  action.
    – ➔ State space – The set of all states reachable from the initial state. The state
      space forms a graph in which the nodes are states and the arcs between
      nodes are actions. A path in the state space is a sequence of states connected
      by a sequence of actions.
• The goal test, which determines whether a given state is a goal state.
• A path cost function that assigns a numeric cost to each path. The
  problem-solving agent chooses a cost function that reflects its own
  performance measure.
    – The step cost of taking action a to go from state x to state y is denoted by C(x,
      a , y).
    – an optimal solution has the lowest path cost among all solutions.
                 Problem Solving
•Rational agents need to perform sequences of actions in order
   to achieve goals.
•Intelligent behavior can be generated by having a look-up table
   or reactive policy that tells the agent what to do in every
   circumstance, but:
   -    Such a table or policy is difficult to build
   -    All contingencies must be anticipated
•A more general approach is for the agent to have knowledge of
   the world and how its actions affect it and be able to
   simulate execution of actions in an internal model of the
   world in order to determine a sequence of actions that will
   accomplish its goals.
•This is the general task of problem solving and is typically
   performed by searching through an internally modeled space  61
   of world states.
                Problem Solving Task
(after internal model of the problem has been specified)
•Given:
-An initial state of the world
-A set of possible actions or operators (with constraints) that can be
    performed.
-A goal test that can be applied to a single state of the world to determine if it
    is a goal state.
•Find:
-A solution stated as a path of states and operators that shows how to
    transform the initial state into one that satisfies the goal test.
•The initial state and set of operators implicitly define a space of states of the
    world and operator transitions between them. May be infinite.
                                                                                62
        Measuring Performance
•Path cost: a function that assigns a cost to a path,
  typically by summing the cost of the individual
  operators in the path. May want to find minimum
  cost solution.
                                                        63
                  Problem types
• Deterministic, fully observable → single-state problem
   – Agent knows exactly which state it will be in; solution is a
     sequence
• Non-observable → sensorless problem (conformant problem)
   – Agent may have no idea where it is; solution is a sequence
• Nondeterministic and/or partially observable → contingency
  problem
   – percepts provide new information about current state
   – often interleave {search, execution}
• Unknown state space → exploration problem
                                                             64
 Module I : AI Concepts- AI problems and Solution approaches
 [Russel/ Mooney] [https://www.cs.utexas.edu/~mooney/cs343/slide-handouts/search.4.pdf]
➢Route Finding,
➢8-Puzzle
➢8-Queen
➢Missionaries and cannibals
              Module I : AI Concepts- Solution Approaches
 [Russel/ Mooney] [https://www.cs.utexas.edu/~mooney/cs343/slide-handouts/search.4.pdf]
➢ Search Problem
    ➢ TSP- optimization problem – redefined as a search problem = to search min cost
      from the set of permutations:
         ➢ Given: Internal model: Graph G = <V,E> : Given set V of nodes, connectivity given as set E of
           edges : V={A,B,C,D}, n=|V|=4
         ➢ Given: Initial state: first node is A (=given source node): |{A}|=1
         ➢ Given: Goal state: sequence of nodes ending at A with all nodes included : |{A,_,_,_,A}
           |=n+1=5
         ➢ Find: output O= sequence of vertices with initially O={A} and at last O= {A,_,_,_,A}
         …==(to begin with A, add next connected node until getting a path returning to A )
         ➢ Find : optimization criterion: min cost path
         ➢ Find : constraint: s.t. will be back to Source node A only if |O|=4};
           e.g. if instantaneous |O|=|{A,C,B}|=3 will not get to O={A,C,B,A}. But when
           O={A,C,B,D} then it will get to O={A,C,B,D,A} and return as one possible solution.
               ➢ TSP as problem of search across S= (permutation of nodes in V with A as first node; also
                    as suffix. ➔ {perm({A,B,C,D})} and accept min cost permutation :This constraint is
                    implicit if we are evaluating all possible permutations of ABCD having A as the first
                    node.
➢ Solution (permutation-vector) search space (n) = n!
➢ Approaches
    ➢ Uninformed search
    ➢ Informed search
             Module I : AI Concepts- Solution Approaches
[Russel/ Mooney] [https://www.cs.utexas.edu/~mooney/cs343/slide-handouts/search.4.pdf]
➢Search Problem
   ➢Knapsack- optimization problem – redefined as a search
    problem = to search max profit subset of objects from the set of
    objects with given wt. and profit and capacity:
        ➢ Given: Internal model: Set O = <W,P> : Given set V of objects, wt and
          profit of each
        ➢ Given: Initial state: first object: |{A}|=1
        ➢ Given: Goal state: subject of objects with maximum profit
        ➢ Find: output O= Subset of objects with maximum profit withing
          capacity C.
        ➢ Find : optimization criterion: max profit
        ➢ Find : constraint: s.t. sum of weights <= C
          Knapsack as problem of search across S= (subsets of n objects.).
➢Solution (subset vector) search space (n) = 2^n
➢Approaches
   ➢Uninformed search
   ➢Informed search
    Example: MinCost Path from Single Source=> Multistage
   Graph (internal model) => Informed Search based Solution
                                                                 69
Example: MultiStage Graph Search
                             71
      Example: The 8-puzzle Problem
                                                                              72
         More Realistic Problems
•   Route finding
•   Travelling salesman problem
•   VLSI layout
•   Robot navigation
•   Web searching
                                   73
              Searching Concepts
•A state can be expanded by generating all states that can be
   reached by applying a legal operator to the state.
•State space can also be defined by a successor function that
   returns all states produced by applying a single legal operator.
•A search tree is generated by generating search nodes by
   successively expanding states starting from the initial state as
   the root.
•A search node in the tree can contain
-Corresponding state
-Parent node
-Operator applied to reach this node
-Length of path from root to node (depth)
-Path cost of path from initial state to node                    74
                       Summary
• AI Agent’s process needs
   – internal model of the problem : matrix, graph, statistical
     formulation, linear equations, ….. : suitable definition of
     STATE and STATE-TRANSITIONS
   – Can be represented as optimization problem
   – Problem to be defined in proper format : {Given, FIND,
     Process}
   – Problems can be mapped as SEARCH problem- (simple
     reflex agents have obvious SEARCH task to refer to
     LookUpTable to decide action)
   – Suitable Search Method to be applied
      • Informed Search- If PATH-COST/SEARCH-COST can be determined
      • UnInformed Search- BFS, DFS
                Tic-Tac-Toe Problem
• https://github.com/aimacode/aima-python/blob/master/games4e.ipynb
                           e(p) = 6 - 5 = 1
   Initial State: Board position of 3x3 matrix with 0 and X.
   Operators: Putting 0’s or X’s in vacant positions alternatively
   Terminal test: Which determines game is over
   Utility function:
           e(p) = (No. of complete rows, columns or diagonals are
    still open for player ) – (No. of complete rows, columns or
    diagonals are still open for opponent )
                Minimax Algorithm
• Generate the game tree
• Apply the utility function to each terminal state to get its
  value
• Use these values to determine the utility of the nodes one
  level higher up in the search tree
   – From bottom to top
   – For a max level, select the maximum value of its successors
   – For a min level, select the minimum value of its successors
• From root node select the move which leads to highest value
            Game tree for Tic-Tac-Toe
Figure 6.1 A (partial) search tree for the game of tic-tac-toe. The top node is the initial
state, and MAX moves first, placing an in an empty square. We show part of the search
tree, giving alternating moves by MIN (0) and MAX, until we eventually reach terminal
states, which can be assigned utilities according to the rules of the game.
          Game tree for Tic-Tac-Toe
Figure 6.2 A two-ply game tree. The nodes are "MAX nodes," in which it is turn to
move, and the nodes are "MIN nodes." The terminal nodes show the utility
values for the other nodes are labeled with their minimax values. best move at
the root is because it leads to the successor with the highest minimax value, and
best reply is because it leads to the successor with the lowest minimax value.
Courtesy : Principles of Artificial Intelligence , Nilsson
        Properties of Minimax
                                                  X
                  X                                                             X               X’s Turn
e=8–4=4                                                          e=8–6=2
                                      e=8–5=3
                       X                              X                                  0’s Turn
                0                                     0
           e=5–4=1                        e=5–3=2
  Courtesy : CS621-Artificial Intelligence , 2007, Prof. Pushpak Bhatacharya
      Alpha-Beta Search Algorithm
• If the MAXIMIZER nodes already possess αmin values, then their current
  αmin value = Max (αmin value, α’min); on the other hand, if the
  MINIMIZER nodes already possess βmax values, then their current
  βmax value = Min (βmax value, β’max).