12. Graphs

In this chapter, we study two representations of graphs and basic algorithms that use these representations.

Mathematically, a (directed) graph is a pair $ G=(V,E)$ where $ V$ is a set of vertices and $ E$ is a set of ordered pairs of vertices called edges. An edge $ \ensuremath{(\ensuremath{\mathit{i}},\ensuremath{\mathit{j}})}$ is directed from $ \ensuremath{\ensuremath{\mathit{i}}}$ to $ \ensuremath{\ensuremath{\mathit{j}}}$; $ \ensuremath{\ensuremath{\mathit{i}}}$ is called the source of the edge and $ \ensuremath{\ensuremath{\mathit{j}}}$ is called the target. A path in $ G$ is a sequence of vertices $ v_0,\ldots,v_k$ such that, for every $ i\in\{1,\ldots,k\}$, the edge $ (v_{i-1},v_{i})$ is in $ E$. A path $ v_0,\ldots,v_k$ is a cycle if, additionally, the edge $ (v_k,v_0)$ is in $ E$. A path (or cycle) is simple if all of its vertices are unique. If there is a path from some vertex $ v_i$ to some vertex $ v_j$ then we say that $ v_j$ is reachable from $ v_i$. An example of a graph is shown in Figure 12.1.

Figure 12.1: A graph with twelve vertices. Vertices are drawn as numbered circles and edges are drawn as pointed curves pointing from source to target.
\includegraphics[scale=0.90909]{figs-python/graph}

Due to their ability to model so many phenomena, graphs have an enormous number of applications. There are many obvious examples. Computer networks can be modelled as graphs, with vertices corresponding to computers and edges corresponding to (directed) communication links between those computers. City streets can be modelled as graphs, with vertices representing intersections and edges representing streets joining consecutive intersections.

Less obvious examples occur as soon as we realize that graphs can model any pairwise relationships within a set. For example, in a university setting we might have a timetable conflict graph whose vertices represent courses offered in the university and in which the edge $ \ensuremath{(\ensuremath{\mathit{i}},\ensuremath{\mathit{j}})}$ is present if and only if there is at least one student that is taking both class $ \ensuremath{\ensuremath{\mathit{i}}}$ and class $ \ensuremath{\ensuremath{\mathit{j}}}$. Thus, an edge indicates that the exam for class $ \ensuremath{\ensuremath{\mathit{i}}}$ should not be scheduled at the same time as the exam for class $ \ensuremath{\ensuremath{\mathit{j}}}$.

Throughout this section, we will use $ \ensuremath{\ensuremath{\mathit{n}}}$ to denote the number of vertices of $ G$ and $ \ensuremath{\ensuremath{\mathit{m}}}$ to denote the number of edges of $ G$. That is, $ \ensuremath{\ensuremath{\ensuremath{\mathit{n}}}}=\vert V\vert$ and $ \ensuremath{\ensuremath{\ensuremath{\mathit{m}}}}=\vert E\vert$. Furthermore, we will assume that $ V=\{0,\ldots,\ensuremath{\ensuremath{\ensuremath{\mathit{n}}}}-1\}$. Any other data that we would like to associate with the elements of $ V$ can be stored in an array of length $ \ensuremath{\ensuremath{\ensuremath{\mathit{n}}}}$.

Some typical operations performed on graphs are:

Note that these operations are not terribly difficult to implement efficiently. For example, the first three operations can be implemented directly by using a USet, so they can be implemented in constant expected time using the hash tables discussed in Chapter 5. The last two operations can be implemented in constant time by storing, for each vertex, a list of its adjacent vertices.

However, different applications of graphs have different performance requirements for these operations and, ideally, we can use the simplest implementation that satisfies all the application's requirements. For this reason, we discuss two broad categories of graph representations.



Subsections
opendatastructures.org