Tom Pecher | Portfolio

Section 1: Uninformed Search Algorithms

The Premise

As the name suggest, uninformed search problems are a set of problems where we are searching for a specific solution, but we have no additional information about how to find it. Mazes are a good example of uninformed search problems; a person in a maze has to search for a path to the exit, however they are given no hints about which direction to go. In other words, the only way to find a solution to such a problem is by exhaustively searching each path to the exit (the set of all possible paths in a search problem is referred to as the search space). There are a few different search strategies that can be used to find a solution to this problem. As we shall see, how effective each strategy is depends on the shape of the search space.

Despite being terribly inefficient for large problems, they are still useful in practice for a number of reasons:

We often do not have any additional helpful information.
The algorithms are easy to understand and implement.
They form the foundation for more advanced search strategies.

In this lesson, we will look at four main types:

Breadth-First Search – explore level-by-level.
Depth-First Search – explore one path as far as possible before backtracking.
Uniform Cost Search – always follow the cheapest path so far.
Bidirectional Search – start from both the start and the goal and meet in the middle.

Key Insight: The Frontier

All of these algorithms share the concept of a frontier. Let us imagine that we are presented with some graph (a set of nodes connected by a set of edges) and we are tasked with finding a path from a start node to a goal node. As we said, we do not have any additional information about the graph, therefore we have nothing better to do that start searching each node from the start node in an attempt to eventually create a path. However, we do not want to just search randomly, as we may search some nodes twice whilst not visiting others: we need some way of keeping track of what nodes we have not yet visited. This is where the frontier comes in: at a given timestep, the frontier is the set of all nodes that we have not yet visited, but are adjacent to a node that has already been visited. In simpler terms, the frontier is the border between the set of nodes that have been searched and those that have not. This is the key insight behind all of the following algorithms: if a node is in the frontier, then we know that there must be a path to it from the start node. Therefore, if we wish to find a path to the end node, at any given timestep we only need to search a node from the frontier, which will remove it from the frontier but add all of its unvisited neighbours. This process will push the frontier further and further from the start node until we either find the goal node, or we run out of nodes to search (in which case there is no solution). A very elegant solution indeed, all of the following algorithms incorporate this principle, and as a result are much better than randomly trying to guess a path. Where the algorithms differ is how they choose which node in the frontier to search next (in computer science terms, what data structure they use to implement it).

Breadth-First Search

Breadth-First Search (BFS) explores the search space level by level. Think of it like looking for someone in a building by checking every room on the first floor, then every room on the second floor, and so on. BFS finds the shortest path in terms of number of steps (if all steps have the same cost). Due to this property, BFS is excellent if each node in the search space has only a few connections (the number of connections that a node has is called its branching factor). It follows then that the opposite is also true: BFS is computationally intractable for problems with large branching factors.

Implementation:

Start with the starting node in the queue.
Remove the first node from the queue.
Add all unvisited neighbours to the end of the queue.
Stop when the goal is found.

Complexity:

Time: O(b^d) where b is branching factor and d is depth.
Space: O(b^d), large memory use.

Depth-First Search

Conversely, Depth-First Search (DFS) explores a maze by following one path as far as possible before backtracking. Whilst not necessarily finding the shortest path, it is often faster (and more practical) than BFS. DFS works best in a graph with few nodes but many connections. Sometimes, it gets stuck searching at the bottom of the search tree for long periods of time, however in practice this is often better than being stuck at the top like with BFS.

Implementations:

Recursive DFS - function calls itself for each child.
Iterative DFS - uses a stack (LIFO).

Time: O(b^m) where m is max depth.
Space: O(bm), smaller than BFS.

Variants:

In practice, DFS can be modified to prevent getting stuck in a rabbit hole at the bottom of the search tree. This can be achieved by limiting its maximum search depth:

Depth-Limited Search - stop after set depth.
Iterative Deepening DFS - DFS with gradually increasing depth limits.

Uniform Cost Search

Uniform Cost Search (UCS) is like BFS but considers cost (this is if the graph that we are search has weighted edges). Whilst you might understandably argue that this makes UCS an informed search algorithm, this is not true since the weights do not assist in solving the problem in the way that an evaluation function would. UCS works by always expanding the node with the lowest total cost so far. If you are familiar with Dijkstra’s algorithm, UCS is essentially the same thing, except that the goal of UCS is to find the shortest path to a single goal node, whereas Dijkstra will find the shortest path to all nodes in the network. Like Dijkstra, UCS will always finds the cheapest path so long as the step costs are non-negative.

Implementation:

Uses a priority queue (min-heap) ordered by path cost.
Always expand the lowest-cost node first.

Complexity:

Worst-case similar to BFS in large search spaces.

Bidirectional Search

Unlike the other three algorithms, Bidirectional Search searches (using DFS) from both the start and the goal nodes. In other words, this is equivalent to performing two DFSs simultaneously, except that now the algorithm terminates when one DFS reaches a node that has been visited by the other DFS. This greatly reduces the search space which is a significant advantage, although this naturally comes at the cost of increased space usage. One significant disadvantage of Bidirectional Search is that we must know the solution in order to begin the algorithm.

Implementation:

One search from the start, another from the goal.
Stop when searches meet.

Complexity:

Improves from O(b^d) to about O(b^d/2).

Comparison:

Algorithm	Completeness	Optimality	Time Complexity	Space Complexity	Efficient Use Case
BFS	Yes	Yes (equal cost)	O(b^d)	O(b^d)	Shortest path in unweighted graph.
DFS	No	No	O(b^m)	O(bm)	Deep search, low memory.
UCS	Yes	Yes	Varies	High	Cheapest path in weighted graph.
Bidirectional	Yes	Yes	O(b^d/2)	High	Known start and goal, large search space.

Conclusion

Uninformed search strategies might seem basic, but they form the backbone of AI search theory. They work even when you know nothing about the problem and form the foundation for more advanced algorithms like A*. You may ask why such primitive algorithms are even considered as AI. This is because, despite their simple nature, they are excellent at certain types of problem solving. The example that is often associated with this is the Tower of Hanoi problem, where the goal is to move a stack of increasingly large disks from one peg to another, with the constraint that you cannot place a larger disk on a smaller one. Despite being a trivial problem with a low number of disks, the solutions appears more and more complex as the number of disks increases. However, if we consider the possible states of the game as a graph, then the problem becomes a graph search problem. In this case, we can use an uninformed search algorithm to search and find the sequence of moves that solves the problem (relatively quickly too). This complex behaviour (solving the Tower of Hanoi problem) resulting from these simple algorithms is why they are considered as AI.

Previous Course Home Next