The topic of this post came up during a conversation with some physicists about the fractional quantum Hall effect (which is quite fascinating, but I don’t feel particularly qualified to discuss). I have decided to set it down here in the hope that, as long as I have an internet-capable device with me, I won’t have to rederive it in front of people again. Some of this material appears in Apostol’s Modular functions and Dirichlet series in number theory and Conway’s The sensual form. I’d be happy to hear about other good treatments.
For each positive integer , the Farey sequence
is the increasing sequence of rationals in
with denominator at most
. For example:
.
It is a standard exercise in basic problem-solving classes to prove that they have the following two remarkable properties:
- Two rationals
in the unit interval are neighbors in some Farey sequence if and only if they satisfy
.
- If
and
are neighbors in the Farey sequence
, then they will remain neighbors in successive Farey sequences until they are separated by the fraction
in the sequence
.
These properties are typically proved using direct algebraic methods, but I’d like to describe a way to look at them geometrically. The geometric context is provided by Ford circles. Given a pair of coprime integers , the Ford circle
is the circle of radius
centered at
in the complex plane (except when
, where I will decree that it is the line
together with an additional point called
). There is a minor ambiguity in identifying circles, since
and
are the same Ford circle. If we ignore the infinite case, the circle
is tangent to the real line at the rational point
, and each rational number is contained in a unique circle.
There is an immediate connection between Ford circles and Farey fractions: the Farey sequence is in bijection with the set of Ford circles that are tangent to the real line on the interval
and have radius at least
. A less immediate connection is that Ford circles only intersect at tangent points (whose locations can be explicitly computed). We end up with the following geometric interpretation of the two properties of Farey sequences:
- If we have rationals
, then
is nonempty (and indeed a singleton) if and only if
. That is, Farey neighbors correspond precisely to tangent pairs of Ford circles. Here, we adopt the convention that fractions are in lowest terms, and negative signs never appear in denominators.
- If the Ford circles
and
are tangent to each other, then the Ford circle
is the unique circle that is tangent to the real line and the other two Ford circles.
The purpose of this post is to point out that these properties (and many more) follow straightforwardly from a natural action of the group , which we call
, on the set of Ford circles. Even though the properties I described are proved with relatively short calculations, I think it doesn’t hurt to have a broader organizing principle in mind.
Recall that is made out of integer matrices
satisfying
. This is a group under matrix multiplication, and it has the notable property that its rows and columns are made out of coprime pairs of integers. It also acts on the complex upper half-plane by Möbius transformations:
yields the transformation
. One has the two distinguished generators
, and
. That is, any element of
can be made by composing a word made from these two elements and their inverses. We can say that
acts by Translation
, while
Spins the upper half-plane around
by a distorted half-rotation:
(this really is a half-rotation if you use the Cayley transformation to turn the half-plane to a disc).
Claim 1: The set of Ford circles has a transitive action of by Möbius transformations. In particular, given a matrix
, the corresponding Möbius transformation takes the infinite Ford circle
to the Ford circle
.
Proof: There are many ways to prove the second sentence, and I will say more general things about transforming circles and lines at the end of this post. Here, it is probably easiest to verify directly: Apply the Möbius transformation to points to get
, and check that the resulting points lie in
. To show that this map from the line (plus the point at infinity) to the circle is a bijection, you can check that the derivative is nonvanishing, and note that image points approach the real axis as
becomes large. To prove the first sentence, we note that by Euclid’s algorithm, any coprime pair of integers
admits a pair
such that
. This implies all Ford circles lie in the
-orbit of
.
QED
By applying the claim to the transformation , we find that
takes
to
. Note that the line
is (setwise) stabilized by the infinite group
, and the other circles are stabilized by conjugates.
In the proof of Claim 1, I gave a direct calculational basis for the fact that Möbius transformations take circles and lines to circles and lines. There are other explanations, for example using elementary inversive geometry, but I would be interested to see a solution that avoids calculation altogether. Another interesting question is: If instead of the direct definition we used, we were to define the Ford circles recursively by demanding that they are tangent to circles with smaller denominator, why should we expect their radii to depend only on the denominators? I only know how to motivate this using the group action.
Claim 2: The action of on the set of Ford circles induces an action on the set of ordered pairs
of Ford circles, preserving
.
Proof: The vectors and
generate a subgroup of
, and the corresponding quotient of
has area equal to the index of the subgroup. We need to show that this area is preserved by the
action and is equal to
when finite. For the first part, we use the previous claim, where we saw that the action of
on Ford circles induces an action on the corresponding integer row vectors by right multiplication, and the induced action on
preserves area. The second part follows from the standard theory of cross-products. QED
Now we can prove the Ford circle versions of the claims:
- We wish to show that for
, the set of coprime integer pairs
satisfying
and
is precisely the set of pairs satisfying
. By radius considerations, all Ford circles tangent to
have the form
for some integer
, and by Claim 1, there exists
that takes
to
. Therefore,
is tangent to
if and only if it is the image of some
under this transformation. By Claim 2, this holds if and only if
. The absolute value sign can be removed, since we have chosen a suitable orientation.
- We wish to show that if
and
are tangent to each other and to the real line, then
is tangent to both of them. Since the conclusion is symmetric with respect to switching the circles and changing signs, we may assume that the corresponding fractions have positive denominator and that
. Then
produces the following maps
. The claim then follows from the fact that
is tangent to
at
and to
at
.
There are several useful corollaries to the use of group symmetry. For example, from the fact that , we can immediately conclude that if
, then
. We can also see that if
, then this point of intersection lies on the semicircle whose diameter is the real interval
, since the semicircle in question is the image of the positive imaginary ray
under the transformation
. There is also a connection to continued fractions: Given a real number
, we can decree that a rational number
is a good approximation of
if
. The set of good rational approximations to
corresponds to the set of Ford circles
that intersect the line
nontrivially. The sequence of circles hit by the line as one approaches the
from above correspond precisely to the convergents of the signed continued fraction expansion of
. The signed continued fraction expansion of a convergent
yields its expansion in terms of the generators
.
See also the Stern-Brocot tree.
http://en.wikipedia.org/wiki/Stern%E2%80%93Brocot_tree
Thank you for pointing out the Stern-Brocot tree. I had ignored it mostly because I couldn’t think of a good way to fit it in to the discussion of symmetry. Although it has a lot of nice structure, it is missing the group action that you see with Ford circles.
If you take the orbit of
under the action of the free submonoid of
generated by
and
, you produce all of the positive rationals, but the structure you get from left-multiplication is the Calkin-Wilf tree. In order to get the Stern-Brocot tree, you may use the same monoid, but you have to describe the action by a kind of insertion: Given a word
in the monoid, the corresponding tree element is the rational corresponding to the vector
, and the descendents are
and
. The distinction is essentially the monoid version of right versus left actions of groups on Cayley graphs (or perhaps, Cayley graphs of opposite groups).
For the Stern-Brocot tree, the right-multiplication of a generator corresponds to adding 1 to the end of an unsigned continued fraction expansion of an entry. For Ford circles, one may also right-multiply words in
by generators, but this corresponds to operations on signed continued fractions.
The book “Motif in Mathematics”
http://www.amazon.com/Motif-Mathematics-History-Application-Sequence/dp/1453810579/ref=sr_1_1?ie=UTF8&qid=1319175746&sr=8-1
is full of interesting facts and stories about the Farey series.
The modern story goes back to the question posed by Mr. J. May jnr. of Amsterdam in the 1747 Ladies Diary
Click to access mayjnr.pdf
PS If you want to read Ford’s original book/paper
Click to access ford.pdf
http://www.maths.ed.ac.uk/~aar/papers/ford2.djvu
Thank you Andrew. Those are very interesting links.
Thanks. Incidentally, Ford was an American who worked at Edinburgh (my own university) 1914-1917
http://www-history.mcs.st-andrews.ac.uk/Biographies/Ford.html
I haven’t had time to read it, but might this recent article in the American Mathematical Monthly touch on similar ideas?
Ian Short’s paper is available on the Arxiv http://arxiv.org/abs/0912.1997
Nice post! One small comment, to make matrices in line look smaller you can try using the “smallmatrix” command. For example, \left(\begin{smallmatrix}1 & 0\\ 1 & 1\end{smallmatrix}\right) produces
.
I apologize if you were already aware of this.
There’s a $\Gamma$ that’s missing that accursed ‘latex’ needed to make it work.
Alex Youcis and John Baez: Thanks for the corrections! I think I have caught everything this time.
As to why we should expect the radii only to depend on the denominators, you might want to look at the kissing circles theorem.
The undergraduate project on the Farey sequence which I supervised in 2011-2012 is available from http://www.maths.ed.ac.uk/~aar/fareyproject.pdf