The topic of this post came up during a conversation with some physicists about the fractional quantum Hall effect (which is quite fascinating, but I don’t feel particularly qualified to discuss). I have decided to set it down here in the hope that, as long as I have an internet-capable device with me, I won’t have to rederive it in front of people again. Some of this material appears in Apostol’s Modular functions and Dirichlet series in number theory and Conway’s The sensual form. I’d be happy to hear about other good treatments.
For each positive integer , the Farey sequence is the increasing sequence of rationals in with denominator at most . For example:
- .
It is a standard exercise in basic problem-solving classes to prove that they have the following two remarkable properties:
- Two rationals in the unit interval are neighbors in some Farey sequence if and only if they satisfy .
- If and are neighbors in the Farey sequence , then they will remain neighbors in successive Farey sequences until they are separated by the fraction in the sequence .
These properties are typically proved using direct algebraic methods, but I’d like to describe a way to look at them geometrically. The geometric context is provided by Ford circles. Given a pair of coprime integers , the Ford circle is the circle of radius centered at in the complex plane (except when , where I will decree that it is the line together with an additional point called ). There is a minor ambiguity in identifying circles, since and are the same Ford circle. If we ignore the infinite case, the circle is tangent to the real line at the rational point , and each rational number is contained in a unique circle.
There is an immediate connection between Ford circles and Farey fractions: the Farey sequence is in bijection with the set of Ford circles that are tangent to the real line on the interval and have radius at least . A less immediate connection is that Ford circles only intersect at tangent points (whose locations can be explicitly computed). We end up with the following geometric interpretation of the two properties of Farey sequences:
- If we have rationals , then is nonempty (and indeed a singleton) if and only if . That is, Farey neighbors correspond precisely to tangent pairs of Ford circles. Here, we adopt the convention that fractions are in lowest terms, and negative signs never appear in denominators.
- If the Ford circles and are tangent to each other, then the Ford circle is the unique circle that is tangent to the real line and the other two Ford circles.
The purpose of this post is to point out that these properties (and many more) follow straightforwardly from a natural action of the group , which we call , on the set of Ford circles. Even though the properties I described are proved with relatively short calculations, I think it doesn’t hurt to have a broader organizing principle in mind.
Recall that is made out of integer matrices satisfying . This is a group under matrix multiplication, and it has the notable property that its rows and columns are made out of coprime pairs of integers. It also acts on the complex upper half-plane by Möbius transformations: yields the transformation . One has the two distinguished generators , and . That is, any element of can be made by composing a word made from these two elements and their inverses. We can say that acts by Translation , while Spins the upper half-plane around by a distorted half-rotation: (this really is a half-rotation if you use the Cayley transformation to turn the half-plane to a disc).
Claim 1: The set of Ford circles has a transitive action of by Möbius transformations. In particular, given a matrix , the corresponding Möbius transformation takes the infinite Ford circle to the Ford circle .
Proof: There are many ways to prove the second sentence, and I will say more general things about transforming circles and lines at the end of this post. Here, it is probably easiest to verify directly: Apply the Möbius transformation to points to get , and check that the resulting points lie in . To show that this map from the line (plus the point at infinity) to the circle is a bijection, you can check that the derivative is nonvanishing, and note that image points approach the real axis as becomes large. To prove the first sentence, we note that by Euclid’s algorithm, any coprime pair of integers admits a pair such that . This implies all Ford circles lie in the -orbit of .
QED
By applying the claim to the transformation , we find that takes to . Note that the line is (setwise) stabilized by the infinite group , and the other circles are stabilized by conjugates.
In the proof of Claim 1, I gave a direct calculational basis for the fact that Möbius transformations take circles and lines to circles and lines. There are other explanations, for example using elementary inversive geometry, but I would be interested to see a solution that avoids calculation altogether. Another interesting question is: If instead of the direct definition we used, we were to define the Ford circles recursively by demanding that they are tangent to circles with smaller denominator, why should we expect their radii to depend only on the denominators? I only know how to motivate this using the group action.
Claim 2: The action of on the set of Ford circles induces an action on the set of ordered pairs of Ford circles, preserving .
Proof: The vectors and generate a subgroup of , and the corresponding quotient of has area equal to the index of the subgroup. We need to show that this area is preserved by the action and is equal to when finite. For the first part, we use the previous claim, where we saw that the action of on Ford circles induces an action on the corresponding integer row vectors by right multiplication, and the induced action on preserves area. The second part follows from the standard theory of cross-products. QED
Now we can prove the Ford circle versions of the claims:
- We wish to show that for , the set of coprime integer pairs satisfying and is precisely the set of pairs satisfying . By radius considerations, all Ford circles tangent to have the form for some integer , and by Claim 1, there exists that takes to . Therefore, is tangent to if and only if it is the image of some under this transformation. By Claim 2, this holds if and only if . The absolute value sign can be removed, since we have chosen a suitable orientation.
- We wish to show that if and are tangent to each other and to the real line, then is tangent to both of them. Since the conclusion is symmetric with respect to switching the circles and changing signs, we may assume that the corresponding fractions have positive denominator and that . Then produces the following maps . The claim then follows from the fact that is tangent to at and to at .
There are several useful corollaries to the use of group symmetry. For example, from the fact that , we can immediately conclude that if , then . We can also see that if , then this point of intersection lies on the semicircle whose diameter is the real interval , since the semicircle in question is the image of the positive imaginary ray under the transformation . There is also a connection to continued fractions: Given a real number , we can decree that a rational number is a good approximation of if . The set of good rational approximations to corresponds to the set of Ford circles that intersect the line nontrivially. The sequence of circles hit by the line as one approaches the from above correspond precisely to the convergents of the signed continued fraction expansion of . The signed continued fraction expansion of a convergent yields its expansion in terms of the generators .
See also the Stern-Brocot tree.
http://en.wikipedia.org/wiki/Stern%E2%80%93Brocot_tree
Thank you for pointing out the Stern-Brocot tree. I had ignored it mostly because I couldn’t think of a good way to fit it in to the discussion of symmetry. Although it has a lot of nice structure, it is missing the group action that you see with Ford circles.
If you take the orbit of under the action of the free submonoid of generated by and , you produce all of the positive rationals, but the structure you get from left-multiplication is the Calkin-Wilf tree. In order to get the Stern-Brocot tree, you may use the same monoid, but you have to describe the action by a kind of insertion: Given a word in the monoid, the corresponding tree element is the rational corresponding to the vector , and the descendents are and . The distinction is essentially the monoid version of right versus left actions of groups on Cayley graphs (or perhaps, Cayley graphs of opposite groups).
For the Stern-Brocot tree, the right-multiplication of a generator corresponds to adding 1 to the end of an unsigned continued fraction expansion of an entry. For Ford circles, one may also right-multiply words in by generators, but this corresponds to operations on signed continued fractions.
The book “Motif in Mathematics”
http://www.amazon.com/Motif-Mathematics-History-Application-Sequence/dp/1453810579/ref=sr_1_1?ie=UTF8&qid=1319175746&sr=8-1
is full of interesting facts and stories about the Farey series.
The modern story goes back to the question posed by Mr. J. May jnr. of Amsterdam in the 1747 Ladies Diary
Click to access mayjnr.pdf
PS If you want to read Ford’s original book/paper
Click to access ford.pdf
http://www.maths.ed.ac.uk/~aar/papers/ford2.djvu
Thank you Andrew. Those are very interesting links.
Thanks. Incidentally, Ford was an American who worked at Edinburgh (my own university) 1914-1917
http://www-history.mcs.st-andrews.ac.uk/Biographies/Ford.html
I haven’t had time to read it, but might this recent article in the American Mathematical Monthly touch on similar ideas?
Ian Short’s paper is available on the Arxiv http://arxiv.org/abs/0912.1997
Nice post! One small comment, to make matrices in line look smaller you can try using the “smallmatrix” command. For example, \left(\begin{smallmatrix}1 & 0\\ 1 & 1\end{smallmatrix}\right) produces .
I apologize if you were already aware of this.
There’s a $\Gamma$ that’s missing that accursed ‘latex’ needed to make it work.
Alex Youcis and John Baez: Thanks for the corrections! I think I have caught everything this time.
As to why we should expect the radii only to depend on the denominators, you might want to look at the kissing circles theorem.
The undergraduate project on the Farey sequence which I supervised in 2011-2012 is available from http://www.maths.ed.ac.uk/~aar/fareyproject.pdf