KNOWLEDGE-BASED AGENTS
The central component of a knowledge-based agent is its knowledge base, or KB. A knowledge base is a
set of sentences. (Here “sentence” is used as a technical term. It is related but not identical to the
sentences of English and other natural languages.) Each sentence is expressed in a language called a
knowledge representation language and represents some assertion about the world.
There must be a way to add new sentences to the knowledge base and a way to query what is known. The
standard names for these operations are TELL and ASK, respectively. Both operations may involve
inference—that is, deriving new sentences from old. Inference must obey the requirement that when one
ASKs a question of the knowledge base, the answer should follow from what has been told (or TELLed)
to the knowledge base previously.
The Wumpus World
The wumpus world is a cave consisting of rooms connected by passageways. Lurking somewhere in the
cave is the terrible wumpus, a beast that eats anyone who enters its room. The wumpus can be shot by an
agent, but the agent has only one arrow. Some rooms contain bottomless pits that will trap anyone who
wanders into these rooms (except for the wumpus, which is too big to fall in). The only mitigating feature
of this bleak environment is the possibility of finding a heap of gold.
A sample wumpus world is shown in above figure. The precise definition of the task environment is given
by the PEAS description:
• Performance measure: +1000 for climbing out of the cave with the gold, –1000 for falling into a pit or
being eaten by the wumpus, –1 for each action taken and –10 for using up the arrow. The game ends
either when the agent dies or when the agent climbs out of the cave.
• Environment: A 4×4 grid of rooms. The agent always starts in the square labeled [1,1], facing to the
right. The locations of the gold and the wumpus are chosen randomly, with a uniform distribution, from
the squares other than the start square. In addition, each square other than the start can be a pit, with
probability 0.2.
• Actuators: The agent can move Forward, TurnLeft by 90◦, or TurnRight by 90◦. The agent dies a
miserable death if it enters a square containing a pit or a live wumpus. (It is safe, albeit smelly, to enter a
square with a dead wumpus.) If an agent tries to move forward and bumps into a wall, then the agent does
not move. The action Grab can be used to pick up the gold if it is in the same square as the agent. The
action Shoot can be used to fire an arrow in a straight line in the direction the agent is facing. The arrow
continues until it either hits (and hence kills) the wumpus or hits a wall. The agent has only one arrow, so
only the first Shoot action has any effect. Finally, the action Climb can be used to climb out of the cave,
but only from square [1,1].
• Sensors: The agent has five sensors, each of which gives a single bit of information: – In the square
containing the wumpus and in the directly (not diagonally) adjacent squares, the agent will perceive a
Stench.
– In the squares directly adjacent to a pit, the agent will perceive a Breeze.
– In the square where the gold is, the agent will perceive a Glitter.
– When an agent walks into a wall, it will perceive a Bump.
– When the wumpus is killed, it emits a woeful Scream that can be perceived anywhere in the cave.
The percepts will be given to the agent program in the form of a list of five symbols; for example, if there
is a stench and a breeze, but no glitter, bump, or scream, the agent program will get [Stench, Breeze,
None, None, None].
PROPOSITIONAL LOGIC: A VERY SIMPLE LOGIC
The syntax of propositional logic defines the allowable sentences. The atomic sentences consist of a
single proposition symbol.
Each such symbol stands for a proposition that can be true or false.
We use symbols that start with an uppercase letter and may contain other letters or subscripts, for
example: P, Q, R, W1,3 and North.
The names are arbitrary but are often chosen to have some mnemonic value—we use W1,3 to stand for the
proposition that the wumpus is in [1,3]. (Remember that symbols such as W1,3 are atomic, i.e., W, 1, and
3 are not meaningful parts of the symbol.)
There are two proposition symbols with fixed meanings: True is the always-true proposition and False is
the always-false proposition.
Complex sentences are constructed from simpler sentences, using parentheses and logical connectives.
There are five connectives in common use:
¬ (not). A sentence such as ¬W1,3 is called the negation of W1,3. A literal is either an atomic sentence
(a positive literal) or a negated atomic sentence (a negative literal).
∧ (and). A sentence whose main connective is ∧, such as W1,3 ∧ P3,1, is called a conjunction; its parts
are the conjuncts. (The ∧ looks like an “A” for “And.”)
∨ (or). A sentence using ∨, such as (W1,3∧P3,1)∨W2,2, is a disjunction of the disjuncts (W1,3 ∧ P3,1) and
W2,2. (Historically, the ∨ comes from the Latin “vel,” which means “or.” For most people, it is easier to
remember ∨ as an upside-down ∧.)
⇒ (implies). A sentence such as (W1,3∧P3,1) ⇒ ¬ W2,2 is called an implication (or conditional). Its
premise or antecedent is (W1,3 ∧P3,1), and its conclusion or consequent is ¬W2,2. Implications are also
known as rules or if–then statements. The implication symbol is sometimes written in other books as ⊃
or →.
⇔ (if and only if). The sentence W1,3 ⇔ ¬W2,2 is a biconditional. Some other books write this as ≡.
Fig.1 Fig.2
Fig.3 Fig.4
A simple knowledge base
For now, we need the following symbols for each [x, y] location:
Px,y is true if there is a pit in [x, y].
Wx,y is true if there is a wumpus in [x, y], dead or alive.
Bx,y is true if the agent perceives a breeze in [x, y].
Sx,y is true if the agent perceives a stench in [x, y].
The sentences we write will suffice to derive ¬P1,2 (there is no pit in [1,2]).We label each sentence R i so
that we can refer to them:
There is no pit in [1,1]:
R1 : ¬P1,1 .
A square is breezy if and only if there is a pit in a neighboring square. This has to be stated for each
square; for now, we include just the relevant squares:
R2 : B1,1 ⇔ (P1,2 ∨ P2,1) .
R3 : B2,1 ⇔ (P1,1 ∨ P2,2 ∨ P3,1) .
The preceding sentences are true in all wumpus worlds. Now we include the breeze percepts for the first
two squares visited in the specific world the agent is in, leading up to the situation in Fig.2
R4 : ¬B1,1.
R5 : B2,1.
Propositional Theorem Proving
The first concept is logical equivalence: two sentences α and β are logically equivalent if they are true in
the same set of models. We write this as α ≡ β.
An alternative definition of equivalence is as follows: any two sentences α and β are equivalent only if
each of them entails the other:
α ≡ β if and only if α |= β and β |= α .
The second concept we will need is validity. A sentence is valid if it is true in all models. For example,
the sentence P ∨ ¬P is valid. Valid sentences are also known as tautologies—they are necessarily true.
Because the sentence True is true in all models, every valid sentence is logically equivalent to True. What
good are valid sentences? From our definition of entailment, we can derive the deduction theorem:
For any sentences α and β, α |= β if and only if the sentence (α ⇒ β) is valid.
The final concept we will need is satisfiability. A sentence is satisfiable if it is true in, or satisfied by,
some model. Satisfiability can be checked by enumerating the possible models until one is found that
satisfies the sentence.
Validity and satisfiability are of course connected: α is valid iff ¬α is unsatisfiable; contrapositively, α is
satisfiable iff ¬α is not valid. We also have the following useful result:
α |= β if and only if the sentence (α ∧ ¬β) is unsatisfiable.
Proving β from α by checking the unsatisfiability of (α ∧ ¬ β) corresponds exactly to the standard
mathematical proof technique of reductio ad absurdum (literally, “reduction to an absurd thing”). It is
also called proof by refutation or proof by contradiction. One assumes a sentence β to be false and
shows that this leads to a contradiction with known axioms α. This contradiction is exactly what is meant
by saying that the sentence (α ∧ ¬β) is unsatisfiable.
Inference and Proofs
Inference rules can be applied to derive a proof—a chain of conclusions that leads to the desired goal.
The best-known rule is called Modus Ponens (Latin for MODUS PONENS mode that affirms) and is
written
The notation means that, whenever any sentences of the form α ⇒ β and α are given, then the sentence β
can be inferred. For example, if (WumpusAhead ∧WumpusAlive) ⇒ Shoot and (WumpusAhead ∧
WumpusAlive) are given, then Shoot can be inferred.
Another useful inference rule is And-Elimination, which says that, from a conjunction, any of the
conjuncts can be inferred:
For example, from (WumpusAhead ∧ WumpusAlive), WumpusAlive can be inferred.
All of the logical equivalences can be used as inference rules. Following are some Standard logical
equivalences. The symbols α, β, and γ stand for arbitrary sentences of propositional logic.
Let us see how these inference rules and equivalences can be used in the wumpus world. We start with
the knowledge base containing R1 through R5 and show how to prove ¬ P1,2, that is, there is no pit in
[1,2]. First, we apply biconditional elimination to R2 to obtain
R6: (B1,1 ⇒ (P1,2 ∨ P2,1)) ∧ ((P1,2 ∨ P2,1) ⇒ B1,1) .
Then we apply And-Elimination to R6 to obtain
R7: ((P1,2 ∨ P2,1) ⇒ B1,1) .
Logical equivalence for contrapositives gives
R8: (¬B1,1 ⇒ ¬(P1,2 ∨ P2,1)).
Now we can apply Modus Ponens with R8 and the percept R4 (i.e., ¬B1,1), to obtain
R9 : ¬(P1,2 ∨ P2,1).
Finally, we apply De Morgan’s rule, giving the conclusion
R10 : ¬P1,2 ∧ ¬P2,1.
That is, neither [1,2] nor [2,1] contains a pit.
Proof by Resolution
The current section introduces a single inference rule, resolution, that yields a complete inference
algorithm when coupled with any complete search algorithm.
We begin by using a simple version of the resolution rule in the wumpus world. Let us consider the steps
leading up to Fig.3: the agent returns from [2,1] to [1,1] and then goes to [1,2], where it perceives a
stench, but no breeze. We add the following facts to the knowledge base:
R11 : ¬B1,2 .
R12 : B1,2 ⇔ (P1,1 ∨ P2,2 ∨ P1,3) .
By the same process that led to R 10 earlier, we can now derive the absence of pits in [2,2] and [1,3]
(remember that [1,1] is already known to be pitless):
R13 : ¬P2,2 .
R14 : ¬P1,3 .
We can also apply biconditional elimination to R 3, followed by Modus Ponens with R 5, to obtain the fact
that there is a pit in [1,1], [2,2], or [3,1]:
R15 : P1,1 ∨ P2,2 ∨ P3,1 .
Now comes the first application of the resolution rule: the literal ¬P2,2 in R13 resolves with the literal P2,2
in R15 to give the resolvent
R16 : P1,1 ∨ P3,1 .
In English; if there’s a pit in one of [1,1], [2,2], and [3,1] and it’s not in [2,2], then it’s in [1,1] or [3,1].
Similarly, the literal ¬P1,1 in R1 resolves with the literal P1,1 in R16 to give
R17 : P3,1 .
These last two inference steps are examples of the unit resolution inference rule,
where each l is a literal and li and m are complementary literals (i.e., one is the negation of the other).
Thus, the unit resolution rule takes a clause—a disjunction of literals—and a literal and produces a new
clause. Note that a single literal can be viewed as a disjunction of one literal, also known as a unit clause.
The unit resolution rule can be generalized to the full resolution rule,
where li and mj are complementary literals. This says that resolution takes two clauses and produces a new
clause containing all the literals of the two original clauses except the two complementary literals.
Conjunctive Normal Form (CNF)
The resolution rule applies only to clauses (that is, disjunctions of literals), so it would seem to be
relevant only to knowledge bases and queries consisting of clauses.
Every sentence of propositional logic is logically equivalent to a conjunction of clauses. A sentence
expressed as a conjunction of clauses is said to be in conjunctive normal form or CNF
We now describe a procedure for converting to CNF. We illustrate the procedure by converting the
sentence B1,1 ⇔ (P1,2 ∨ P2,1) into CNF. The steps are as follows:
1. Eliminate ⇔, replacing α ⇔ β with (α ⇒ β) ∧ (β ⇒ α).
(B1,1⇒(P1,2 ∨ P2,1)) ∧ ((P1,2 ∨ P2,1) ⇒B1,1) .
2. Eliminate ⇒, replacing α ⇒ β with ¬α ∨ β:
(¬B1,1 ∨ P1,2 ∨ P2,1) ∧ (¬(P1,2 ∨ P2,1) ∨ B1,1) .
3. CNF requires ¬ to appear only in literals, so we “move ¬ inwards” by repeated application of the
following equivalences:
¬(¬α) ≡ α (double-negation elimination)
¬(α ∧ β) ≡ (¬α ∨ ¬β) (De Morgan)
¬(α ∨ β) ≡ (¬α ∧ ¬β) (De Morgan)
In the example, we require just one application of the last rule:
(¬B1,1 ∨ P1,2 ∨ P2,1) ∧ ((¬P1,2 ∧ ¬ P2,1) ∨ B1,1) .
4. Now we have a sentence containing nested ∧ and ∨ operators applied to literals. We
apply the distributivity law from Figure 7.11, distributing ∨ over ∧ wherever possible.
(¬B1,1 ∨ P1,2 ∨ P2,1) ∧ (¬P1,2 ∨ B1,1) ∧ (¬P2,1∨ B1,1) .
The original sentence is now in CNF, as a conjunction of three clauses (or Conjunction of disjunction of
literals).
A Resolution Algorithm
Inference procedures based on resolution work by using the principle of proof by contradiction. That is, to
show that KB |= α, we show that (KB ∧ ¬α) is unsatisfiable. We do this by proving a contradiction.
First, (KB ∧ ¬α) is converted into CNF.
Then, the resolution rule is applied to the resulting clauses.
Each pair that contains complementary literals is resolved to produce a new clause, which is
added to the set if it is not already present.
The process continues until one of two things happens:
o there are no new clauses that can be added, in which case KB does not entail α; or,
o two clauses resolve to yield the empty clause, in which case KB entails α.
The empty clause—a disjunction of no disjuncts—is equivalent to False because a disjunction is true only
if at least one of its disjuncts is true. Another way to see that an empty clause represents a contradiction is
to observe that it arises only from resolving two complementary unit clauses such as P and ¬P.
We can apply the resolution procedure to a very simple inference in the wumpus world. When the agent is
in [1,1], there is no breeze, so there can be no pits in neighboring squares.
The relevant knowledge base is
KB = R2 ∧ R4 = (B1,1 ⇔ (P1,2 ∨ P2,1)) ∧ ¬B1,1
and we wish to prove α which is, say, ¬P1,2.
(¬P1,2 is shown to follow from the first four clauses in the top row.)
When we convert (KB ∧ ¬α) into CNF, we obtain the clauses shown at the top of the above figure. The
second row of the figure shows clauses obtained by resolving pairs in the first row.
Then, when P1,2 is resolved with ¬P1,2, we obtain the empty clause, shown as a small square.
Inspection of the above figure reveals that many resolution steps are pointless. For example, the clause
B1,1∨¬B1,1∨P1,2 is equivalent to True ∨ P1,2 which is equivalent to True. Deducing that True is true is not
very helpful.
Therefore, any clause in which two complementary literals appear can be discarded.
Horn clauses and definite clauses
One such restricted form is the definite clause, which is a disjunction of literals of which exactly one is
positive. For example, the clause (¬L1,1 ∨¬Breeze ∨B1,1) is a definite clause, whereas (¬B1,1 ∨ P1,2 ∨
P2,1) is not.
Slightly more general is the Horn clause, which is a disjunction of literals of which at most one is
positive. So all definite clauses are Horn clauses, as are clauses with no positive literals; these are called
goal clauses. Horn clauses are closed under resolution: if you resolve two Horn clauses, you get back a
Horn clause.
Knowledge bases containing only definite clauses are interesting for three reasons:
1. Every definite clause can be written as an implication whose premise is a conjunction of positive
literals and whose conclusion is a single positive literal.
For example, the definite clause ( ¬ L1,1 ∨ ¬ Breeze ∨B1,1) can be written as the implication (L 1,1 ∧
Breeze) ⇒ B1,1. In the implication form, the sentence is easier to understand: it says that if the agent is in
[1,1] and there is a breeze, then [1,1] is breezy.
In Horn form, the premise is called the body and the conclusion is called the head.
A sentence consisting of a single positive literal, such as L 1,1, is called a fact. It too can be written in
implication form as True ⇒ L1,1, but it is simpler to write just L1,1.
2. Inference with Horn clauses can be done through the forward- chaining and backward- chaining
algorithms. Both of these algorithms are natural, in that the inference steps are obvious and easy for
humans to follow. This type of inference is the basis for logic programming.
3. Deciding entailment with Horn clauses can be done in time that is linear in the size of the knowledge
base—a pleasant surprise.
Forward and backward chaining
The forward-chaining algorithm determines if a single proposition symbol q—the query—is entailed by
a knowledge base of definite clauses. It begins from known facts (positive literals) in the knowledge base.
If all the premises of an implication are known, then its conclusion is added to the set of known facts. For
example, if L1,1 and Breeze are known and (L1,1 ∧ Breeze) ⇒ B1,1 is in the knowledge base, then B1,1 can
be added. This process continues until the query q is added or until no further inferences can be made.
The best way to understand the algorithm is through an example and a picture. Consider the following
two figures (a) and (b).
Figure (a) shows a simple knowledge base of Horn clauses with A and B as known facts. Figure (b)
shows the same knowledge base drawn as an AND–OR graph.
In AND–OR graphs, multiple links joined by an arc indicate a conjunction—every link must be proved—
while multiple links without an arc indicate a disjunction—any link can be proved.
It is easy to see how forward chaining works in the graph. The known leaves (here, A and B) are set, and
inference propagates up the graph as far as possible.
Wherever a conjunction appears, the propagation waits until all the conjuncts are known before
proceeding.
Forward chaining is an example of the general concept of data-driven reasoning—that is, reasoning in
which the focus of attention starts with the known data. It can be used within an agent to derive
conclusions from incoming percepts, often without a specific query in mind.
The backward-chaining algorithm, as its name suggests, works backward from the query. If the query q
is known to be true, then no work is needed. Otherwise, the algorithm finds those implications in the
knowledge base whose conclusion is q. If all the premises of one of those implications can be proved
true(by backward chaining), then q is true. When applied to the query Q in the above Figure, it works
back down the graph until it reaches a set of known facts, A and B, that forms the basis for a proof.
Backward chaining is a form of goal-directed reasoning. It is useful for answering specific questions
such as “What shall I do now?” and “Where are my keys?” Often, the cost of backward chaining is much
less than linear in the size of the knowledge base, because the process touches only relevant facts.