Notes: A* Algorithm

CPSC 352 -- Artificial Intelligence
Notes: A* Algorithm

Admissibility, Monotonicity, Informedness

Three questions we ask about heuristic algorithms?

Is it guaranteed to find the shortest path?
Is it better than another heuristic?
Does it make steady progress toward a goal?

Admissibility

A search algorithm is admissible if it is guaranteed to find a minimal path to a solution whenever such a solution exists. Breadth first search is admissible, because it looks at every state at level n before considering any state at level n+1.

The A^* (A star) Algorithm

Let's characterize a class of admissible heuristic search strategies, using the evaluation function:

f(n) = g(n) + h(n)

As we saw in previous notes, g(n) represents the actual distance at which the state n has been found in the graph, and h(n) is the heuristic estimate of the distance from n to a goal state. So f(n) represents an estimate of the total cost of the path from the start, through n to the goal.

Let's define the evaluation function f^*:

f^*(n) = g^*(n) + h^*(n)

where g^*(n) is the cost of the shortest path from the start to n and h^*(n) returns the actual cost of the shortest path from n to the goal. So f^* is the actual cost of the optimal path from the start to the goal through n.

We call the function f^* an oracle -- an ideal evaluation function that can see the shortest path from the start to the goal. Of course, oracles of this type don't exist for most search problems. For real problems, we want an evaluation function, f, that approaches f^*

Algorithm A: In algorithm A, g(n), the cost of the current path from start to n, is a reasonable estimate of g^*:

g(n) >= g^*(n)

g(n) and g^*(n) will only be equal if the search has found the optimal path to n. Similarly, in algorithm A, we replace h^*(n) with h(n). Although we can't actually compute h^*(n), we can often determine whether h(n) is bounded by h^*(n) -- that is, we can often determine that h(n) is less than or equal to the actual cost of a minimal path (h^*(n)).

Algorithm A: Algorithm A is the best-first search algorithm plus the evaluation function f(n) = g(n) + h(n).

Algorithm A^*: is algorithm A in which

h(n) <= h^*(n)

for all states n. In other words, if an algorithm uses an evaluation function that underestimates the cost to the goal it is an A^* algorithm.

Key Point: All A^* algorithms are admissible.

Breadth-first Search is an A^* algorithm in which

f(n) = g(n) + 0

In other words, bread-first search uses a trivial estimate of the distance to the goal.

Route Finding Example: For route-finding problems, the straight-line distance from city n to a goal city is a lower bound on the distance of an optimal route from n to the goal.

Eight puzzle Example: The heuristic of number of tiles out-of-place is certainly less than the actual number of moves to the goal state. So this heuristic (combined with best-first search) is an admissible algorithm. So is the heuristic sum of the distances of out-of-place tiles, because it too underestimates the actual number of moves required to reach a goal state.

Monotonicity

This property asks if an algorithm is locally admissible---that is, it always underestimates the cost between any two states in the search space. Recall that A^* does not require that g(n) = g^*(n). A heuristic function, h is monotone if:

For all states n_i and n_j, where n_j is a descendant of n_i, h(n_i) - h(n_j) <= cost(n_i,n_j).
The heuristic evaluation of the goal state is 0: h(Goal) = 0.

In other words, for any two states in the search space, a monotonic heuristic always underestimates the cost of going from n_i to n_j. The heuristic is everywhere admissible.

If best-first search is used with a monotonic heuristic, you can skip checking for shortest path when a state is encountered a second time. The second occurrence will not be shorter because the heuristic finds the shortest path to any state the first time the state is found. Thus, for a monotone heuristic, searching a graph is no different from searching a tree.

Any monotonic heuristic is admissible. For some sequence of states, s₁, s₂, s₃,...,s_g, from start to goal, if the heuristic underestimates the cost of going from s₁ to s₂ and from s₂ to s₃ and so on, then it underestimates the cost of going from any state to the goal. So, it follows that h(n) <= cost(s_n, s_g) = h^*(n).

Informedness

For two A^* heuristics, h₁ and h₂, if h₁(n) <= h₂(n), for all states n in the search space, then heuristic h₂ is more informed than h₁.

Eight Puzzle Heuristics: Breadth-first search is an A^* heuristic with h₁(n) = 0, for all n. We have also noted that h₂, the number of tiles out of place is a lower bound for h^*. So it follows that:

h₁ <= h₂ < h^*(n)

Therefore, "number of tiles out of place" is more informed than breadth-first heuristic. This should be obvious.

The following figure illustrates the difference between the "number of out-of-place tiles" heuristic and breadth-first search. The heavy dark line shows the optimal solution path. The states that show the numbers on the tiles are the states that would be the portion of the space that is searched by the "tiles out-of-place" heuristic. The rest of the states shown are the ones that would also be examined by breadth-first search. You can see the significant pruning done by the more informed heuristic.

Eight Puzzle Heuristic