Čech cohomology and illusions
Čech cohomology and illusions
This post builds off of Roger Penrose's article On the Cohomology of Impossible Figures, in which he shows a remarkable connection betweeen impossible drawings, such as the Penrose triangle, and nontrivial cohomology groups. In simple terms, the fact these perceptual illusions work is due to an obstruction preventing one from synthesizing the local information in the figure into a consistent global construct; this is precisely what cohomology measures. We give a brief introduction to Čech cohomology and show how a well-known auditory illusion, the Shepard tone, works by the same principles discussed by Penrose.
Čech cohomology
Let be a topological space and be an open cover of . We can build a simplicial complex out of by imagining the open sets as "thickened" vertices, as follows. Let be the set of all -tuples for , which we call vertices or -simplices. Intuitively, two vertices and bound a "line segment" or a -simplex if and have nonempty intersection. Continuing, we say that three -simplices , , bound a "triangle" or a -simplex if . In general, we may define the set of -simplices
We call the nerve complex of . Define partial boundary maps by
where signifies removal of an open set. As a direct consequence of our construction, every -simplex is "bounded" by three -simplices , , and , which are in turn bounded a total of six -simplices (counting duplicates), as demonstrated below.
Notice that each -simplex is counted twice, since the -simplices bounding join together at the vertices. This suggests that by appropriately giving each simplex in the boundary a "sign coefficient" and summing them together, we may make the double boundary of a simplex vanish. To formalize this idea, we consider the space of Čech -chains formed by all finite formal combinations of simplices with integer coefficients. Then we may define (oriented) boundary maps
One can then check that these choice of coefficients correctly eliminates the duplicate second boundaries. In other words, , so forms a chain complex.
Čech cochains
Recall that a presheaf of abelian groups on a topological space is a functor . In other words, assigns to every open set an abelian group of sections, and for every inclusion of open sets, there is a restriction morphism which takes a section over and restricts it to a section over . A simple example of a presheaf of abelian groups is given by the sheaf of continuous real-valued functions on a space . Here, addition of sections is just given by pointwise addition of functions, and restriction is given by the usual defintion of function restriction.
Given a presheaf of abelian groups , we may define a Čech -cochain with coefficients in to be a map which assigns each -simplex to a section , where is the intersection of all open sets in the simplex. This extends to a homomorphism
which we also refer to as a -cochain. These form an abelian group and between these we have similarly defined coboundary maps
where denotes the restriction of any of the above sections to the common support . To define this for all integers, for we set . As before, one can also check that , so these groups form a cochain complex. We say that cochains in the image of are -coboundaries and cochains in the kernel of are -cocycles. Then the quotient
is the th Čech cohomology group of with respect to the open cover and values in . It will be non-trivial when there exist -cocycles which are not equal to the coboundary of any -cocyle.
We can describe low-dimensional cocycles and coboundaries as follows. A -simplex is just determined by an open set , and so a -cochain just assigns a section in over each open set . The coboundary of a -cochain is given by the -cochain
In other words, compares the sections over and on their common intersection . By definition, the condition for to be a cocycle is that , which states that the sections given by must all agree on their common intersections. In principle, this means that they can be "glued" together to form a global section on (however, since is only a presheaf, we do not know that such an object is actually a section). On the other hand , so there are no nontrivial -coboundaries. It follows that when is a sheaf, we have .
A -simplex is given by a pair of intersecting open sets, so a -cochain assigns for each pair a section in . The coboundary of a -cochain is given by the -cochain
Intuitively, this evaluates on the "triangle" formed by taking the boundary of the -simplex . If is a -cocyle, then this vanishes, which implies that evaluating on the "path" formed by -simplices and is the same as evaluating on the path from to . In other words,
which is commonly called the cocycle condition.
Example. Let be a manifold and be a real line bundle on with a trivializing open cover . Then for each there exists a chart , and for any two indices such that , the transition map is of the form
where is a continuous function. One may check that these functions satisfy the cocycle condition where defined. Hence, the -cochain , with values in the sheaf of continuous -valued functions, defined by is a -cocycle.
If and is the Möbius strip, then we may cover the circle by two intervals and , such that their intersection becomes a disjoint union of two intervals and . Moreover, we may choose charts such that on and on , with the change in sign representing the twist throughout the strip.
Suppose, for the sake of contradiction, that for some -cochain . Then, without loss of generality, we can take , which implies is positive on and negative on . By continuity, this implies that must vanish at some point in , which is disallowed as . It follows that is nontrivial.
Shepard tones
A Shepard tone is an auditory illusion consisting of a periodic tone whose pitch appears to rise or fall indefinitely. The illusions works by exploiting the manner in which we perceive pitch color from a combination of raw frequencies.
A pure tone is simply a sinusoidal pressure wave of the form . Perceptually, pure tones with frequencies and are thought of as having the same "color". In common musical terms, we say that the two pure tones are separated by an octave. For example, in standard tuning the note is defined to be , and the note , which is at , sounds the "same", even though they have different frequencies. This partitions the set of pure frequencies, denoted by , into pitch classes , which we denote by the quotient .
In nature, a pure tone by itself is almost never encountered. Instead, they are often produced along with a set of overtones, which have frequencies at rational multiples of a lowest fundamental frequency. Our brains detect this fundamental frequency, which determines a single pitch color for the tone, despite being made up of a combination of different frequencies. A Shepard tone consists of an infinite superposition of tones separated by octaves (with decaying amplitudes), so that there is no one true fundamental frequency; such a tone can not be canonically modeled by any point in . Despite this, the brain still makes an abitrary choice, lifting a point to a frequency . A continuously changing Shepard tone is lifted to a continuous curve .
The ambiguity group
Write and let be the periodic Shepard tone given by . If is an open interval, then a partial lift is completely determined by its value on a single point . If and are two lifts of , then for all ,
In other words, the ambiguity in the lift of is determined by a single integer. Two observers listening to may perceive two different frequency curves, but they must "differ" by an integer. This is what Penrose refers to as the ambiguity group, which acts on the set possible observations, or lifts, of . Formally, since is a principal -bundle, each fiber is a homogeneous -space, for which subtraction is defined. Then the ambiguity group of an open set is defined to be
For example, if is the disjoint union of two intervals, then , since a lift is determined by a single point in each interval. If a lift exists over , then is the set of locally constant functions . Assigning the zero group for , this forms a presheaf of abelian groups on .
Cocycles and coboundaries
Let be an open covering of by subintervals such that there do not exist any triple intersections consisting of distinct open sets. For example, we may cover by two intervals, as done in the previous example. Now, imagine that a listener focuses on each interval separately, locally observing a lift (which does not necessarily have to agree with neighboring lifts). This defines a -cochain given by . Since every -simplex can be assumed to be of the form without loss of generality, must be a cocycle, since
Intuitively, this condition says that a listener can locally adjust parts of their observations on and by an element of the ambiguity group so that they agree. A natural question is if a listener can adjust each whole observation on (i.e, not just on the intersections) by elements of the ambiguity groups so that the lifts all agree. Such an adjustment corresponds to a -cochain with the property that
whenever and intersect. It follows that
Hence, the individual lifts can be made to agree if and only if is a coboundary. Perceptually, this would imply that one could listen to the entire periodic Shepard tone and observe a genuine periodic "melody" . Of course, this is not the case, which can be seen intuitively by observation, or by the fact that is a nontrivial principal bundle, and hence does not admit any global sections.
Alternatively, one may directly show is a nonzero element of the first Čech cohomology group . For example, if is a covering of the circle by two intervals and , then the intersection consists of two disjoint intervals, so . One can check that is of the form , while is of the form , implying that cannot be coboundary.
Other illusions
The fact that the Shepard tone works as an illusion can be seen as a consequence of the fact that is nontrivial. In essence this is the same obstruction described by Penrose for the tribar, whose paradoxical nature is connected to the fact that is nontrivial. As suggested in the conclusion of Penrose's article, it may be possible to construct more interesting visual and auditory illusions by considering more complicated ambiguity groups (for example, the Necker cube and ).