Chapter 4, Beyond Classical Search Video Solutions, Artificial Intelligence. A Modern Approach [Global Edition]

Problem 1

Give the name of the algorithm that results from each of the following special cases:
a. Local beam search with $=1$.
b. Local beam search with one initial state and no limit on the number of states retained.
c. Simulated annealing with $=0$ at all times (and omitting the termination test).
d. Simulated annealing with $=\infty$ at all times.
e. Genetic algorithm with population size $=1$.

Chris Trentman

Numerade Educator

Problem 2

Exercise 3.16 considers the problem of building railway tracks under the assumption that pieces fit exactly with no slack. Now consider the real problem, in which pieces don't fit exactly but allow for up to 10 degrees of rotation to either side of the "proper" alignment. Explain how to formulate the problem so it could be solved by simulated annealing.

Check back soon!

Problem 3

Generate a large number of 8-puzzle and 8-queens instances and solve them (where possible) by hill climbing (steepest-ascent and first-choice variants), hill climbing with random restart, and simulated annealing. Measure the search cost and percentage of solved problems and graph these against the optimal solution cost. Comment on your results.

Check back soon!

Problem 4

The AND-OR-GRAPH-SEARCH algorithm in Figure 4.11 checks for repeated states only on the path from the root to the current state. Suppose that, in addition, the algorithm were to store every visited state and check against that list. (See BREADTH-FIRST-SEARCH in Figure 3.11 for an example.) Determine the information that should be stored and how the algorithm should use that information when a repeated state is found. (Hint: You will need to distinguish at least between states for which a successful subplan was constructed previously and states for which no subplan could be found.) Explain how to use labels, as defined in Section 4.3.3, to avoid having multiple copies of subplans.

Check back soon!

Problem 5

Explain precisely how to modify the AND-OR-GRAPH-SEARCH algorithm to generate a cyclic plan if no acyclic plan exists. You will need to deal with three issues: labeling the plan steps so that a cyclic plan can point back to an earlier part of the plan, modifying OR-SEARCH so that it continues to look for acyclic plans after finding a cyclic plan, and augmenting the plan representation to indicate whether a plan is cyclic. Show how your algorithm works on (a) the slippery vacuum world, and (b) the slippery, erratic vacuum world. You might wish to use a computer implementation to check your results.

Check back soon!

Problem 6

In Section 4.4.1 we introduced belief states to solve sensorless search problems. A sequence of actions solves a sensorless problem if it maps every physical state in the initial belief state to a goal state. Suppose the agent knows $h^*()$, the true optimal cost of solving the physical state in the fully observable problem, for every state in . Find an admissible heuristic $h($ ) for the sensorless problem in terms of these costs, and prove its admissibilty. Comment on the accuracy of this heuristic on the sensorless vacuum problem of Figure 4.14. How well does $\mathrm{A}^*$ perform?

Check back soon!

01:44

Problem 7

This exercise explores subset-superset relations between belief states in sensorless or partially observable environments.
a. Prove that if an action sequence is a solution for a belief state, it is also a solution for any subset of . Can anything be said about supersets of ?
b. Explain in detail how to modify graph search for sensorless problems to take advantage of your answers in (a).
c. Explain in detail how to modify AND-OR search for partially observable problems, beyond the modifications you describe in (b).

Lucía Guerrero

Numerade Educator

Problem 8

On page 139 it was assumed that a given action would have the same cost when executed in any physical state within a given belief state. (This leads to a belief-state search problem with well-defined step costs.) Now consider what happens when the assumption does not hold. Does the notion of optimality still make sense in this context, or does it require modification? Consider also various possible definitions of the "cost" of executing an action in a belief state; for example, we could use the minimum of the physical costs; or the maximum; or a cost interval with the lower bound being the minimum cost and the upper bound being the maximum; or just keep the set of all possible costs for that action. For each of these, explore whether $\mathrm{A}^*$ (with modifications if necessary) can return optimal solutions.

Check back soon!

01:09

Problem 9

Consider the sensorless version of the erratic vacuum world. Draw the belief-state space reachable from the initial belief state $\{1,3,5.7\}$, and explain why the problem is unsolvable.

Tyler Moulton

Numerade Educator

Problem 10

We can turn the navigation problem in Exercise 3.7 into an environment as follows:
$\bullet$ The percept will be a list of the positions, relative to the agent, of the visible vertices. The percept does not include the position of the robot! The robot must learn its own position from the map; for now, you can assume that each location has a different "view."
$\bullet$ Each action will be a vector describing a straight-line path to follow. If the path is unobstructed, the action succeeds; otherwise, the robot stops at the point where its path first intersects an obstacle. If the agent returns a zero motion vector and is at the goal (which is fixed and known), then the environment teleports the agent to a random location (not inside an obstacle).
$\bullet$ The performance measure charges the agent 1 point for each unit of distance traversed and awards 1000 points each time the goal is reached.
a. Implement this environment and a problem-solving agent for it. After each teleportation, the agent will need to formulate a new problem, which will involve discovering its current location.
b. Document your agent's performance (by having the agent generate suitable commentary as it moves around) and report its performance over 100 episodes.
c. Modify the environment so that $30 \%$ of the time the agent ends up at an unintended destination (chosen randomly from the other visible vertices if any; otherwise, no move at all). This is a crude model of the motion errors of a real robot. Modify the agent so that when such an error is detected, it finds out where it is and then constructs a plan to get back to where it was and resume the old plan. Remember that sometimes getting back to where it was might also fail! Show an example of the agent successfully overcoming two successive motion errors and still reaching the goal.
d. Now try two different recovery schemes after an error: (1) head for the closest vertex on the original route; and (2) replan a route to the goal from the new location. Compare the performance of the three recovery schemes. Would the inclusion of search costs affect the comparison?
e. Now suppose that there are locations from which the view is identical. (For example, suppose the world is a grid with square obstacles.) What kind of problem does the agent now face? What do solutions look like?

Check back soon!

View

Problem 11

Suppose that an agent is in a $3 \times 3$ maze environment like the one shown in Figure 4.19. The agent knows that its initial location is $(3,3)$, that the goal is at $(1,1)$, and that the four actions Up, Down, Left, Right have their usual effects unless blocked by a wall. The agent does not know where the internal walls are. In any given state, the agent perceives the set of legal actions; it can also tell whether the state is one it has visited before or is a new state.
a. Explain how this online search problem can be viewed as an offline search in belief-state space, where the initial belief state includes all possible environment configurations. How large is the initial belief state? How large is the space of belief states?
b. How many distinct percepts are possible in the initial state?
c. Describe the first few branches of a contingency plan for this problem. How large (roughly) is the complete plan?
Notice that this contingency plan is a solution for every possible environment fitting the given description. Therefore, interleaving of search and execution is not strictly necessary even in unknown environments.

Shu Naito

Numerade Educator

Problem 12

In this exercise, we examine hill climbing in the context of robot navigation, using the environment in Figure 3.31 as an example.
a. Repeat Exercise 4.10 using hill climbing. Does your agent ever get stuck in a local minimum? Is it possible for it to get stuck with convex obstacles?
b. Construct a nonconvex polygonal environment in which the agent gets stuck.
c. Modify the hill-climbing algorithm so that, instead of doing a depth-1 search to decide where to go next, it does a depth- search. It should find the best -step path and do one step along it, and then repeat the process.
d. Is there some for which the new algorithm is guaranteed to escape from local minima?
e. Explain how LRTA* enables the agent to escape from local minima in this case.

Check back soon!

Problem 13

Relate the time complexity of LRTA* to its space complexity.

Check back soon!