Buckets:

|
download
raw
21.4 kB

Understanding networks and their behaviors using sheaf theory

(Invited Paper)

Michael Robinson
Mathematics and Statistics
American University
Washington, DC, USA
michaelr@american.edu

Abstract—Many complicated network problems can be easily understood on small networks. Difficulties arise when small networks are combined into larger ones. Fortunately, the mathematical theory of sheaves was constructed to address just this kind of situation; it extends locally-defined structures to globally valid inferences by way of consistency relations. This paper exhibits examples in network monitoring and filter hardware where sheaves have useful descriptive power.

Index Terms—sheaf; FIR filter; network monitoring; topological filter

I. INTRODUCTION

This paper serves as an invitation to the signal processing community to think about how consistency between pieces of local information serves to drive global inferences. Because sheaves are the mathematical tool for manipulating local data, they could be useful for solving certain signal processing problems. For example, this article shows that a sampling theory for sheaves can lead to an inference methodology for monitoring water distribution networks. In a rather different context, we show that sheaves are a natural model of signal processing hardware.

Sheaves were introduced by Leray during World War II to study partial differential equations. They remained enshrined within “pure” mathematics, where they enjoyed a central role in Grothendieck’s algebraic geometry programme. (See the foreword by C. Houzel in [1] for a clear discussion of the early history of sheaves.) An early departure from this abstract setting to a more concrete, computational setting occurred in [2], which treated sheaves over posets. However, cellular sheaves introduced by Shepard [3], are usually more natural for engineering contexts. These ideas lay dormant until a recent flurry of activity [4], [5], [6], [7].

II. THE MAIN IDEAS OF SHEAF THEORY

A sheaf is a mathematical datastructure for storing local information over a topological space. This article addresses sheaves over abstract simplicial complexes, for which the theory is minimally complicated. (The interested reader is encouraged to explore [8] for a more extensive – but readable – introduction to sheaves over cellular spaces.)

Definition 1. An abstract simplicial complex over a set $A$ is a collection $X$ of (possibly ordered) subsets of $A$ , for which $x \in X$ implies that every subset of $x$ is also in $X$ . We call each $x \in X$ with $k+1$ elements a $k$ -face of $X$ , referring to the number $k$ as its dimension. Zero dimensional faces (singleton subsets of $A$ ) are called vertices, and one dimensional faces are called edges. We say that a face $a$ includes into a face $b$ (written $a \rightsquigarrow b$ ) whenever $a$ is a proper subset of $b$ .

Example 2. A graph $G = (V, E)$ with vertices $V = {v_1, \dots}$ and edges $E = {e_1, \dots}$ can be given the structure of a simplicial complex $X = {{v_1}, \dots, e_1, \dots}$ . Observe that if ${v_1, v_2}$ is an edge in $X$ , then ${v_1}, {v_2} \in X$ automatically by this construction.

Given a simplicial complex, a sheaf is merely the assignment of vector-valued data to each face that is compatible with the inclusions of faces.

Definition 3. A sheaf $\mathcal{F}$ on an abstract simplicial complex $X$ consists of the assignment of

    1. a vector space $\mathcal{F}(a)$ to each face $a$ of $X$ (called the stalk at $a$ ), and
    1. a linear map $\mathcal{F}(a \rightsquigarrow b) : \mathcal{F}(a) \rightarrow \mathcal{F}(b)$ (called the restriction along $a \rightsquigarrow b$ ) to each inclusion of faces $a \rightsquigarrow b$ , so that
    1. $\mathcal{F}(b \rightsquigarrow c) \circ \mathcal{F}(a \rightsquigarrow b) = \mathcal{F}(a \rightsquigarrow c)$ whenever $a \rightsquigarrow b \rightsquigarrow c$ .

Definition 4. Suppose $X$ is the abstract simplicial complex model of $\mathbb{R}$ whose vertices are given by the set of integers and whose edges are given by pairs ${n, n+1}$ . The $N$ -term grouping sheaf $\mathcal{V}^{(N)}$ is given by the diagram written over $X$

VN1σVNσ+VN1σVNσ+\longrightarrow V^{N-1} \xleftarrow{\sigma_-} V^N \xrightarrow{\sigma_+} V^{N-1} \xleftarrow{\sigma_-} V^N \xrightarrow{\sigma_+}

where $\sigma_{\pm}$ are the $(N-1) \times N$ matrices

σ=(0I(N1)×(N1)) and σ+=(I(N1)×(N1)0).\sigma_- = \begin{pmatrix} 0 & I_{(N-1) \times (N-1)} \end{pmatrix} \text{ and } \sigma_+ = \begin{pmatrix} I_{(N-1) \times (N-1)} & 0 \end{pmatrix}.

The $N$ -term grouping sheaf represents the time evolution of the contents of a $N$ -word shift register. The underlying simplicial complex structure for this sheaf represents a discrete timeline, with each vertex representing a timestep. The stalkof $\mathcal{V}^{(N)}$ at each vertex is $V^N$ , which represents the contents of the shift register at that timestep.

Example 5. To see how this works, consider a 3-word shift register that stores integers. Suppose that at time 0 it stores the vector $(1, 1, 9)$ , and at time 1 it loads a 2 into the last slot so that it contains $(1, 9, 2)$ . Observe that the $\sigma_{\pm}$ maps defined above compute what is preserved during the transition from time 0 to time 1: $\sigma_+(1, 9, 2) = (1, 9) = \sigma_-(1, 1, 9)$ .

The example shows that local consistency between values from the stalks of a sheaf leads to information that is consistent more globally. This motivates the definition of a section of a sheaf.

Definition 6. Suppose that $\mathcal{F}$ is a sheaf on an abstract simplicial complex $X$ . A global section $s$ assigns a value $s(a) \in \mathcal{F}(a)$ to each $a \in X$ so that for each inclusion $a \rightsquigarrow b$ of faces, $\mathcal{F}(a \rightsquigarrow b)s(a) = s(b)$ . The set of global sections forms a vector space $\Gamma\mathcal{F}$ .

Although sheaves are useful descriptors, it is more important to focus on sheaf morphisms, which are ways to manipulate sheaves.

Definition 7. A sheaf morphism $f : \mathcal{F} \rightarrow \mathcal{G}$ of sheaves on an abstract simplicial complex $X$ assigns a linear map $f_a : \mathcal{F}(a) \rightarrow \mathcal{G}(a)$ to each face $a$ so that for every inclusion $a \rightsquigarrow b$ of faces of $X$ , $f_b \circ \mathcal{F}(a \rightsquigarrow b) = \mathcal{G}(a \rightsquigarrow b) \circ f_a$ .

Sheaf morphisms play an important role in models of signal processing because they transform global sections of one sheaf into those of another.

Lemma 8. A sheaf morphism $f : \mathcal{F} \rightarrow \mathcal{G}$ induces a linear map $f_* : \Gamma\mathcal{F} \rightarrow \Gamma\mathcal{G}$ .

III. NETWORK MONITORING

The flow of contaminants through a water distribution network is usually assessed by measurements taken at specific locations. These measurements are fundamentally local procedures, since they only collect a sample in the immediate vicinity of a location and time. Because of this, we construct a sheaf that models the concentration of water-borne contaminants in a network of channels whose connection graph is represented by a directed graph $X$ . The edges of $X$ are labelled with a real-valued function $R$ which represents the volume flow rate of water along that edge. In order to be physically reasonable, $R$ must satisfy a conservation law: at each vertex, the total flow rate in must equal the total flow rate out. Additionally, we assume that any contaminant present in the inputs to a vertex is perfectly mixed at the vertex and flows out with the same concentration along all outgoing edges.

Definition 9. The concentration sheaf $\mathcal{C}$ on $X$ with flow rate $R$ is given by

    1. $\mathcal{C}(v) = \mathbb{R}^n$ over each vertex $v$ with in-degree $n$ , and
    1. $\mathcal{C}(e) = \mathbb{R}$ over each edge $e$ .

The restriction from a vertex $v$ to the $i$ -th incoming edge $e_i^+$ is given by the projection $\text{pr}_i$ onto the $i$ -th component of $\mathcal{C}(v)$ .

Because of the perfect mixing assumptions, the restriction from a vertex $v$ to any outgoing edge $e^-$ is given by the formula

(C(ve))(c1,,cn)=j=1ncjR(ej+)j=1nR(ej+).(1)(\mathcal{C}(v \rightsquigarrow e^-))(c_1, \dots, c_n) = \frac{\sum_{j=1}^n c_j R(e_j^+)}{\sum_{j=1}^n R(e_j^+)}. \quad (1)

Global sections of a concentration sheaf represent self-consistent values of contaminant concentration over the entire network.

Example 10. Consider a network that distributes water from a main to several delivery points. The concentration sheaf of this network is given by the diagram

where water flows from left-to-right.

By inspection, the space of sections is determined uniquely by measurement at any one of the delivery points. Therefore, the contaminant concentration at the source can be recovered from a measurement at any of the delivery points. Because of this reason, it is easy to ensure safe drinking water in a sealed distribution system (and track contaminations to their source) using endpoint checks.

If concentration measurements are taken at vertices in the network $X$ , they ought to be self-consistent. This suggests that the process of taking measurements in a network is represented by a sheaf morphism.

Definition 11. A sampling morphism of a concentration sheaf $\mathcal{C}$ on $X$ is a morphism $m : \mathcal{C} \rightarrow \mathcal{M}$ to a sheaf on $X$ which is the zero map on each edge and is surjective on the stalks over edges. The assignment $\mathcal{A}(a) = \ker m_a$ defines another sheaf, called the ambiguity sheaf over $X$ . The restrictions of the ambiguity sheaf are given by the restrictions in $\mathcal{C}$ , but restricted to the stalks of $\mathcal{A}$ .

The consistent specification of concentrations throughout the network is a global section of the concentration sheaf $\mathcal{C}$ , and a collection of measurements is a global section of the sheaf $\mathcal{M}$ . If the map induced on global sections by a sampling morphism (Lemma 8) is one-to-one, each global section of $\mathcal{M}$ corresponds to at most one global section of $\mathcal{C}$ . In this case, the samples contain sufficient information to reconstruct the concentrations over the entire network.

Although the induced map on global sections can be examined directly, it is helpful to have an alternate way to ensure that sufficiently many samples have been collected. This characterization is provided by the following sampling theorem, which is valid for all networks (see [7] for more examples).Theorem 12. If the ambiguity sheaf for a sampling morphism $m : \mathcal{C} \rightarrow \mathcal{M}$ has no nontrivial global sections, then the set of measurements completely specify all concentrations in the network.

Proof: (Sketch) A well-known algebraic construction called the Snake Lemma shows that the kernel of the induced map $m_* : \Gamma\mathcal{C} \rightarrow \Gamma\mathcal{M}$ is given by the global sections of the ambiguity sheaf. Therefore, if the ambiguity sheaf has no nontrivial global sections, then the induced map $m_*$ is one-to-one. ■

Example 13. Consider the concentration sheaf given by the diagram

RidRpr1prnRnMRidRprnpr1RidR\begin{array}{ccccc} \mathbb{R} & \xrightarrow{\text{id}} & \mathbb{R} & & \\ & \nearrow \text{pr}_1 & & \nwarrow \text{pr}_n & \\ & \vdots & \mathbb{R}^n & \xrightarrow{M} & \mathbb{R} \xleftarrow{\text{id}} \mathbb{R} \\ & \nwarrow \text{pr}_n & & \nearrow \text{pr}_1 & \\ \mathbb{R} & \xrightarrow{\text{id}} & \mathbb{R} & & \end{array}

in which the water moves left-to-right, and $M$ is given by the $1 \times n$ matrix representation of Equation (1). The sheaf associated to measuring the concentrations at the delivery point has the diagram

0id0idid0id00Ridid0id0\begin{array}{ccccc} 0 & \xrightarrow{\text{id}} & 0 & & \\ & \nearrow \text{id} & & \nwarrow \text{id} & \\ & \vdots & 0 & \xrightarrow{\text{id}} & 0 \xleftarrow{0} \mathbb{R} \\ & \nwarrow \text{id} & & \nearrow \text{id} & \\ 0 & \xrightarrow{\text{id}} & 0 & & \end{array}

The two sheaves are linked by a morphism, which is zero on all faces except the rightmost vertex, where it is an identity. On the other hand, the ambiguity sheaf has the diagram

RidRpr1prnRnMR00prnpr1RidR\begin{array}{ccccc} \mathbb{R} & \xrightarrow{\text{id}} & \mathbb{R} & & \\ & \nearrow \text{pr}_1 & & \nwarrow \text{pr}_n & \\ & \vdots & \mathbb{R}^n & \xrightarrow{M} & \mathbb{R} \xleftarrow{0} 0 \\ & \nwarrow \text{pr}_n & & \nearrow \text{pr}_1 & \\ \mathbb{R} & \xrightarrow{\text{id}} & \mathbb{R} & & \end{array}

The space of global sections for the ambiguity sheaf (also the kernel of the induced map of the sampling morphism) has dimension $n - 1$ , since global sections are parameterized by the subspace of $\mathbb{R}^n$ on which $M$ vanishes. This means that it is impossible to determine which input branch is responsible for the presence of a contaminant on the output.

This example indicates why environmental management is a difficult problem. Endpoint measurement cannot assign

Fig. 1. Schematic diagram of a FIR LTI filter

blame to polluters in water collection networks, but can be so used in water distribution networks. Since industrial areas are often along rivers with the topology of a water collection network, their nominal cooperation ensures compliance with environmental regulations.

IV. LINEAR TRANSLATION-INVARIANT FILTERS

Discrete linear translation-invariant filters are prevalent in modern signal processing systems, because they are particularly easy to implement. If the impulse response of a filter is $N$ timesteps long, then the filter can be constructed using an $N$ -word shift register as shown in Figure 1.

Definition 14. Let $l^\infty(V)$ be the vector space of bounded sequences of elements of a vector space $V$ . A discrete linear translation-invariant filter (LTI filter) is a linear operator $F : l^\infty(V) \rightarrow l^\infty(V)$ given by

(F(x))n=k=h(k)xnk,(F(x))_n = \sum_{k=-\infty}^{\infty} h(k)x_{n-k},

where $h : \mathbb{Z} \rightarrow \mathbb{R}$ is called the impulse response of the filter. If the support of $h$ is a finite set, we say that $F$ has finite impulse response (FIR).

Theorem 15. Every FIR LTI filter $F$ arises as the composition of linear maps $F = \lambda_ \circ p_^{-1} : \Gamma\mathcal{S}_1 \rightarrow \Gamma\mathcal{S}_3$ induced on global sections by a pair of sheaf morphisms

S1pS2λS3.\mathcal{S}_1 \xleftarrow{p} \mathcal{S}_2 \xrightarrow{\lambda} \mathcal{S}_3.

In this diagram, the invertible linear map $p_ : \Gamma\mathcal{S}_2 \rightarrow \Gamma\mathcal{S}1$ is induced by $p$ , and the map induced by $\lambda$ is $\lambda* : \Gamma\mathcal{S}_2 \rightarrow \Gamma\mathcal{S}_3$ .*

This theorem has a clear interpretation in terms of the typical hardware implementation shown in Figure 1. The global sections of $\mathcal{S}_1$ are precisely the possible input sequences, the global sections of $\mathcal{S}_3$ correspond to the output sequences, and the global sections of $\mathcal{S}_2$ correspond to the contents of the shift register. The proof of the theorem is a rather explicit construction, which outlines the evolution of these three timeseries.

Proof: Let $\mathcal{S}_1$ and $\mathcal{S}_3$ both be copies of $\mathcal{V}^{(1)}$ , the 1-term grouping sheaf. The global sections of this sheaf are simply timeseries of data present in the input and output registers of the filter. Suppose that the impulse response of $F$ is of length $N$ and let $\mathcal{S}_2 = \mathcal{V}^{(N)}$ be the sheaf that represents the contents of the filter's shift register.We construct the morphisms $p$ and $\lambda$ as the vertical arrows in the following diagram

0V0prNVN1VNσ+VN1L0V0\begin{array}{ccccccc} & \longrightarrow & 0 & \longleftarrow & V & \longrightarrow & 0 \longleftarrow \\ & & \uparrow & & \uparrow \text{pr}_N & & \uparrow \\ \longrightarrow & V^{N-1} & \longleftarrow & V^N & \xrightarrow{\sigma_+} & V^{N-1} & \longleftarrow \\ & & \downarrow & & \downarrow L & & \downarrow \\ & \longrightarrow & 0 & \longleftarrow & V & \longrightarrow & 0 \longleftarrow \end{array}

in which the first row is the diagram of $\mathcal{S}_1$ , the second row represents $\mathcal{S}_2$ , and the third row represents $\mathcal{S}_3$ . The maps $\text{pr}_N$ and $L$ are given by the formulas

prN(x1,,xN)=xN, and L(x1,,xN)=k=0N1h(k)xNk.\text{pr}_N(x_1, \dots, x_N) = x_N, \text{ and } L(x_1, \dots, x_N) = \sum_{k=0}^{N-1} h(k)x_{N-k}.

It remains to verify that the induced map $\lambda_* \circ p_^{-1}$ is the same as $F$ . Consider a sequence $x = (\dots, x_0, x_1, \dots) \in l^\infty(V)$ , which can be encoded as a global section $s_1$ of $\mathcal{S}_1$ by specifying that the value of the section over the vertex $n$ is $x_n$ , and the value over every edge is 0. This corresponds to a global section $s_2$ of $\mathcal{S}2$ , in which the value over each vertex is a sequence of $N$ consecutive terms of $x$ , and the value over each edge is a sequence of $N-1$ consecutive terms. Observe that the map $p$ induced by the morphism $p$ satisfies the equation $p_*(s_2) = s_1$ , which moreover is one-to-one. Therefore, with a slight abuse of notation, we write $p_^{-1}(x) = s_2$ . The map $\lambda_$ induced by the morphism $\lambda$ clearly computes weighted sums of adjacent blocks of $N$ consecutive terms of $x$ , whence

L(x1,,xN)=k=0N1h(k)xNk=k=h(k)xNk=(F(x))N.\begin{aligned} L(x_1, \dots, x_N) &= \sum_{k=0}^{N-1} h(k)x_{N-k} \\ &= \sum_{k=-\infty}^{\infty} h(k)x_{N-k} \\ &= (F(x))_N. \end{aligned}

Example 16. Consider the FIR LTI filter whose impulse response is zero except for three consecutive terms, all equal to $1/3$ . If this filter is presented with the input sequence $\dots, 1, 1, 9, 2, \dots$ , it will produce the output sequence $\dots, 2.7, 2.3, 3.7, 4, \dots$ . The encoding described by Theorem 15 can be organized into the diagram

02090(9,2)(1,9,2)(1,9)(1,1,9)(1,1)0403.70\begin{array}{ccccccc} 0 & \longleftarrow & 2 & \longrightarrow & 0 & \longleftarrow & 9 \longrightarrow 0 \\ & & \uparrow & & \uparrow & & \uparrow \\ (9, 2) & \longleftarrow & (1, 9, 2) & \longrightarrow & (1, 9) & \longleftarrow & (1, 1, 9) \longrightarrow (1, 1) \\ & & \downarrow & & \downarrow & & \downarrow \\ 0 & \longleftarrow & 4 & \longrightarrow & 0 & \longleftarrow & 3.7 \longrightarrow 0 \end{array}

Remark 17. The benefit of using sheaf morphisms to describe filters is that they can treat a number of additional cases. For

instance, each of the following are straightforward generalizations, requiring no additional theoretical work to construct:

    1. Infinite impulse response filters can be constructed simply by extending the definition of $\mathcal{V}^{(N)}$ to treat spaces of sequences instead of finite-dimensional vectors.
    1. Nonlinear, block processing filters can be constructed by modifying the component maps of the morphism $\lambda$ to be nonlinear functions. For instance, constant false alarm rate (CFAR) detectors can be encoded in this way.
    1. If $G$ is a finitely-generated group that acts on $X$ , then $G$ -equivariant simplicial maps can be used to generalize $\mathcal{V}^{(N)}$ to other simplicial complexes $X$ . This permits extensions of Theorem 15 to treat images, video, and more complex discrete datasets.

V. CONCLUSION AND DISCUSSION

Sheaf theory offers a more general framework for extending key ideas (filtering and sampling) used in signal processing. Sheaf theoretic examples of engineering problems often map directly to existing implementations. Because of this, the use of sheaf theoretic tools requires minimal change in engineering perspective.

In the near term, we can expect that sheaf morphisms will come to the forefront of their respective engineering applications. A more careful analysis of the morphisms that describe network measurements will probably result in tighter bounds on sampling requirements in general spaces with more general data than is possible with traditional methods. Similarly, it is easy to extend the construction in Theorem 15 to sheaves over different spaces. However, the capabilities of the resulting topological filters are essentially unexplored.

ACKNOWLEDGEMENT

This work was partly supported under Federal Contract No. FA9550-09-1-0643.

REFERENCES

  1. [1] M. Kashiwara and P. Schapira, Sheaves on manifolds. Springer, 1990.
  2. [2] K. Baclawski, "Whitney numbers of geometric lattices," Adv. in Math., vol. 16, pp. 125–138, 1975.
  3. [3] A. Shepard, "A cellular description of the derived category of a stratified space," Ph.D. dissertation, Brown University, 1980.
  4. [4] J. Lilius, "Sheaf semantics for Petri nets," Helsinki University of Technology, Digital Systems Laboratory, Tech. Rep., 1993.
  5. [5] R. Ghrist and Y. Hiraoka, "Applications of sheaf cohomology and exact sequences to network coding," preprint, 2011.
  6. [6] M. Robinson, "Asynchronous logic circuits and sheaf obstructions," Electronic Notes in Theoretical Computer Science, pp. 159–177, 2012.
  7. [7] —, "The Nyquist theorem for cellular sheaves," in Sampling Theory and Applications (SampTA), 2013.
  8. [8] J. Curry, "Sheaves, cosheaves and applications, arxiv:1303.3255," 2013.

Xet Storage Details

Size:
21.4 kB
·
Xet hash:
f55397a8d00f09938739b1480480d51c8a624265091aeafe233e66edf336890b

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.