A Panoramic Overview of Inter-universal Teichm¨uller Theorymotizuki/Panoramic...A Panoramic...
Transcript of A Panoramic Overview of Inter-universal Teichm¨uller Theorymotizuki/Panoramic...A Panoramic...
A Panoramic Overview of Inter-universal Teichmuller
Theory
By
Shinichi Mochizuki∗
Abstract
Inter-universal Teichmuller theory may be described as a sort of arithmetic version
of Teichmuller theory that concerns a certain type of canonical deformation associated to
an elliptic curve over a number field and a prime number l ≥ 5. We begin our survey of inter-
universal Teichmuller theory with a review of the technical difficulties that arise in applying
scheme-theoretic Hodge-Arakelov theory to diophantine geometry. It is precisely the
goal of overcoming these technical difficulties that motivated the author to construct the non-
scheme-theoretic deformations that form the content of inter-universal Teichmuller theory.
Next, we discuss generalities concerning “Teichmuller-theoretic deformations” of various
familiar geometric and arithmetic objects which at first glance appear one-dimensional, but in
fact have two underlying dimensions. We then proceed to discuss in some detail the various
components of the log-theta-lattice, which forms the central stage for the various construc-
tions of inter-universal Teichmuller theory. Many of these constructions may be understood to
a certain extent by considering the analogy of these constructions with such classical results
as Jacobi’s identity for the theta function and the integral of the Gaussian distribu-
tion over the real line. We then discuss the “inter-universal” aspects of the theory, which
lead naturally to the introduction of anabelian techniques. Finally, we summarize the main
abstract theoretic and diophantine consequences of inter-universal Teichmuller theory, which
include a verification of the ABC/Szpiro Conjecture.
Contents
§1. Hodge-Arakelov-theoretic Motivation
§2. Teichmuller-theoretic Deformations
§3. The Log-theta-lattice
§4. Inter-universality and Anabelian Geometry
2000 Mathematics Subject Classification(s): Primary: 14H25; Secondary: 14H30.Key Words: elliptic curve, number field, theta function, hyperbolic curve, anabelian geometry,ABC Conjecture, Szpiro Conjecture.
∗RIMS, Kyoto University, Kyoto 606-8502, Japan.e-mail: [email protected]
2 Shinichi Mochizuki
§ 1. Hodge-Arakelov-theoretic Motivation
The inter-universal Teichmuller theory developed in [IUTchI], [IUTchII], [IUTchIII],
[IUTchIV] arose from attempts by the author, which began around the summer of
2000, to overcome certain technical obstructions in the scheme-theoretic Hodge-
Arakelov theory of [HASurI], [HASurII] to applying this theory to diophantine geom-
etry [cf. the discussion of [HASurI], §1.5.1; [HASurII], Remark 3.7; [IUTchI], Remark
4.3.1]. Thus, we begin our overview of inter-universal Teichmuller theory with a brief
review of the essential content of those aspects of scheme-theoretic Hodge-Arakelov
theory that are relevant to the development of inter-universal Teichmuller theory.
Let l be a prime number. Then we recall that the module E[l] of l-torsion points
of a Tate curve Edef= Gm/qZ [over, say, a p-adic field or the complex field C], whose
q-parameter we denote by q, fits into a natural exact sequence
0 −→ μl −→ E[l] −→ Z/lZ −→ 0.
That is to say, one has canonical objects as follows:
a “multiplicative subspace” μl ⊆ E[l] and “generators” ±1 ∈ Z/lZ.
In the following discussion, we fix an elliptic curve E over a number field F
and a prime number l ≥ 5. For convenience, we shall use the notation
l�def= l−1
2 .
Also, we suppose that E has stable reduction at all nonarchimedean primes of F . Then,
in general, the module E[l] [i.e., more precisely, the finite etale group scheme over F ] of
l-torsion points of E does not admit
a “global multiplicative subspace” or “global canonical generators”
— i.e., a rank one submodule M ⊆ E[l] or a pair of generators of the quotient E[l]/M
that coincide with the above canonical multiplicative subspace and generators at all
nonarchimedean primes of F where E has bad multiplicative reduction. Nevertheless,
let us
suppose that such global objects do in fact exist!
Also, let us write
Kdef= F (E[l])
for the extension field of F generated by the fields of definition of the l-torsion points
of E, V(K) for the set of [archimedean and nonarchimedean] valuations of K, and
A Panoramic Overview of Inter-universal Teichmuller Theory 3
V(K)bad ⊆ V(K) for the set of nonarchimedean valuations where E has bad multiplica-
tive reduction. For v ∈ V(K), we shall write Kv for the completion of K at v, Ov ⊆ Kv
for the subset of elements f ∈ Kv such that |f |v ≤ 1, and mv ⊆ Ov for the subset of
elements f ∈ Kv such that |f |v < 1. Let N ⊆ E[l] be a rank one submodule such that
the natural morphism of finite group schemes over F
M × N∼→ E[l]
is an isomorphism. Then the Fundamental Theorem of Hodge-Arakelov Theory may be
formulated as follows [cf. [HASurI], §1; [HASurII], §1, §3; the explicit series representa-
tion of the theta function on a Tate curve given in [EtTh], Proposition 1.4]:
Theorem 1.1. (“Idealized” Version of Fundamental Theorem of Hodge-
Arakelov Theory) We maintain the notation of the above discussion. In particular,
we assume the existence of a global multiplicative subspace and global canonical
generators [as described above] for E[l]. Write ∗E def= EK/N for the elliptic curve over
K obtained by forming the quotient of EKdef= E×F K by N ; ∗E† → ∗E for the universal
vectorial extension of ∗E; L for the line bundle of degree one on ∗E determined by some
nontrivial F -rational 2-torsion point of ∗E; L|∗E† for the restriction of L to ∗E†. Then
restriction of sections of L|∗E† of relative degree [i.e., relative to ∗E† → ∗E] < l — a
condition which we shall denote by means of a superscript “< l” — to MKdef= M ×F K
via the composite inclusion MK ↪→ EK [l] → ∗E[l], followed by application of a suitable
theta trivialization of the restriction of L to M yields functions on M ; consideration
of the Fourier coefficients of such functions determines an isomorphism of K-vector
spaces of dimension l
Γ(∗E†,L|∗E†)<l ∼→l�⊕
j=−l�
(qj2 · OK)⊗OK K (∗HA)
— where we write
qdef= {qv}v∈V(K)bad
for the collection of q-parameters of EK at v ∈ V(K)bad [so qv ∈ mv] and
qdef= {q
v}v∈V(K)bad
for some collection of 2l-th roots “qv= q
1/2lv ” of the qv; the direct summand labeled j
of the codomain of the isomorphism (∗HA) is to be understood as a copy of K which
we regard as being equipped with the integral structure obtained from the integral
structure given by the ring of integers OK ⊆ K by replacing, for each v ∈ V(K)bad, the
integral structure given by Ov by the integral structure given by qj2
v· Ov. The domain
4 Shinichi Mochizuki
of this isomorphism admits a natural Hodge filtration F−i, for i ∈ N, given by the
sections of relative degree ≤ i; the subquotients of this Hodge filtration admit natural
isomorphisms
F−i/F−i+1 ∼→ ω⊗(−i)∗E ⊗K F 0
— where we write ω∗E for the cotangent space at the origin of ∗E. The codomain of the
isomorphism (∗HA) admits a natural Galois action, i.e., an action by Gal(K/F ). Fi-
nally, this isomorphism (∗HA) of K-vector spaces is in fact compatible, up to relatively
mild discrepancies, with the natural integral structures (respectively, metrics) at
nonarchimedean (respectively, archimedean) elements of V(K); here, the integral struc-
ture of the codomain at nonarchimedean v ∈ V(K) is as described above.
Over the complement in Spec(OK) of V(K)bad, as well as at the archimedean
primes of K, the content of Theorem 1.1 is essentially equivalent to the content of the
[“non-idealized”] Fundamental Theorem of Hodge-Arakelov Theory given in [HASurI],
Theorem A; [HASurII], Theorem 1.1 [cf. the discussion preceding [HASurII], Defini-
tion 3.1] — i.e., “non-idealized” in the sense that it holds even in the absence of the
assumption of the existence of the global multiplicative subspace and global canonical
generators. On the other hand, the portion of Theorem 1.1 that concerns the integral
structures at the valuations ∈ V(K)bad follows immediately [i.e., in light of the theory
reviewed in [HASurI], §1] from the explicit series representation of the theta function
on a Tate curve given in [EtTh], Proposition 1.4.
One way to understand [both the “idealized” and “non-idealized” versions of] the
Fundamental Theorem of Hodge-Arakelov Theory is as a sort of scheme-theoretic
version of the classical computation of the Gaussian integral∫ ∞
−∞e−x2
dx =√π
by applying a coordinate transformation from cartesian to polar coordinates — cf.
the discussion of [IUTchII], Remark 1.12.5, (i). Indeed, the function “j → qj2
” — which
may be thought of as a sort of discrete version of the Gaussian distribution “e−x2
” —
appears quite explicitly in the codomain of the isomorphism (∗HA) of Theorem 1.1. On
the other hand, the value “√π” may be thought of as corresponding to the [negative]
tensor powers of the sheaf “ω” that arise in the subquotients of the Hodge filtration that
appear in the domain of this isomorphism. Indeed, if, in the domain of this isomorphism,
one omits the restriction “< l”, then one obtains a natural crystal — cf. the theory
of the “crystalline theta object” discussed in [HASurII], §2. Unlike the crystals
that typically arise in the case of the de Rham cohomology associated to a family of
varieties, which satisfy the property of Griffiths transversality, this crystal exhibits a
property that we refer to as Griffiths semi-transversality [cf. [HASurII], Theorem
A Panoramic Overview of Inter-universal Teichmuller Theory 5
2.8], i.e., the crystal corresponds to a connection whose application has the effect of
shifting the Hodge filtration [not by one, but rather] by two steps! This property of
Griffiths semi-transversality gives rise to a Kodaira-Spencer isomorphism [cf. [HASurII],
Theorem 2.10] between ω⊗2 and the restriction via the classifying morphism associated
to the elliptic curve under consideration of the sheaf of logarithmic differentials on the
moduli stack of elliptic curves — i.e., in effect, between ω and the “square root” of
this sheaf of logarithmic differentials. Here, it is useful to recall that this sheaf of
logarithmic differentials admits a canonical generator in a neighborhood of the cusp at
infinity of the moduli stack of elliptic curves, namely, the logarithmic differential q−1 ·dqof the q-parameter, which, if one thinks in terms of the classical complex theory and
integrates this logarithmic differential once over a loop surrounding the cusp at infinity,
has an associated period equal to 2π. This provides the justification for thinking of
“ω” as corresponding to “√π”. Finally, we recall that this relationship between the
Fundamental Theorem of Hodge-Arakelov Theory and the classical Gaussian integral
may be seen more explicitly, via the classical theory of Hermite polynomials, when this
theory is restricted to the archimedean primes of a number field via the “Hermite model”
[cf. the discussion of [HASurI], §1.1].In this context, it is also of interest to note that unlike many aspects of the classical
theory of theta functions, the Hodge-Arakelov theory discussed in [HASurI], [HASurII]
does not admit a natural generalization to the case of higher-dimensional abelian
varieties [cf. the discussion of [HASurI], §1.5.2]. This is perhaps not so surprising
in light of the analogy discussed above with the classical computation of the Gaussian
integral reviewed above, which does not admit, at least in any immediate, naive sense,
a generalization to higher dimension. This phenomenon of a lack of any immediate
generalization to higher dimension may also be seen in the theory of the etale theta
function developed in [EtTh], which plays a central role in inter-universal Teichmuller
theory. In the case of the etale theta function, this phenomenon is essentially a reflection
of the fact that, unlike the case with the complement of an ample divisor in an abelian
variety of arbitrary dimension, the complement of the origin in an elliptic curve may be
regarded as a hyperbolic curve, i.e., as an object for which there exists an extensive
and well-developed theory of anabelian geometry, which may be [and indeed is, in
[EtTh]] applied to obtain various important rigidity properties involving the etale theta
function [cf. the discussion of the Introduction to [EtTh]].
As discussed in [HASurI], Theorem A, the isomorphism of Theorem 1.1 may be
considered over the moduli stack of elliptic curves Mell over Z. Indeed, as dis-
cussed in [HASurI], §1, a proof of the characteristic zero portion of the isomorphism of
Theorem 1.1 [or [HASurI], Theorem A] may be given by considering the corresponding
map between vector bundles of rank l over (Mell)Qdef= Mell ×Z Q: That is to say, an
6 Shinichi Mochizuki
easy explicit computation involving the explicit series representation of the theta func-
tion on a Tate curve [cf., e.g., [EtTh], Proposition 1.4] shows that this map between
vector bundles on (Mell)Q is an isomorphism at the generic point of (Mell)Q. Then one
concludes that this map between vector bundles is in fact an isomorphism on (Mell)Q
by computing the degrees of the domain and codomain of the map under considera-
tion and observing that these two degrees coincide. Although the precise computation
of these degrees is rather involved, from the point of view of the present discussion it
suffices to observe that the highest order portions of the average degrees [i.e., where
by “average”, we mean the result of dividing by the rank l] of the domain [i.e., “LHS”]
and codomain [i.e., “RHS”] vector bundles on (Mell)Q of the map under consideration
coincide:
1l · LHS ≈ − 1
l ·l−1∑i=0
i · [ωE ] ≈ − l2 · [ωE ]
1l · RHS ≈ − 1
l2 ·l�∑j=1
j2 · [log(q)] ≈ − l24 · [log(q)] = − l
2 · [ωE ]
— where we write [ωE ] for the degree on (Mell)Q of the line bundle ωE [i.e., the line
bundle determined by the relative cotangent bundle at the origin of the tautological
semi-abelian scheme over (Mell)Q]; we write [log(q)] for the degree on (Mell)Q of the di-
visor at infinity of (Mell)Q; we recall the elementary fact [a consequence of the existence
of the modular form typically referred to as the “discrimant”] that [log(q)] = 12 · [ωE ].
In this context, we remark that the line bundle F 0 [i.e., the line bundle on (Mell)Q that
corresponds to the line bundle denoted “F 0” in Theorem 1.1] is “sufficiently small” that
it may be ignored, i.e., from the point of view of computing portions of highest order.
Now let us return to considering arithmetic vector bundles on Spec(OK) in the
context of Theorem 1.1. The Hodge filtration F−i is not compatible with the direct
sum decomposition of the codomain of the isomorphism (∗HA) of Theorem 1.1. This
fact may be derived, for instance, from the explicit series representation of the theta
function on a Tate curve [cf., e.g., [EtTh], Proposition 1.4] and is closely related to the
theory of the arithmetic Kodaira-Spencer morphism arising from this isomorphism
[cf. the discussion of [HASurII], §3, especially, [HASurII], Corollary 3.6]. In particular,
it follows that, for most j, by projecting to the factor labeled j of this direct sum
decomposition, one may construct a [nonzero!] morphism of arithmetic line bundles
[i.e., of rank one K-vector spaces that is compatible “from below” with the integral
structures at nonarchimedean primes and the metrics at archimedean primes — cf.,
e.g., the discussion of [GenEll], §1](OK ⊗OK
K ≈) F 0 ↪→ (qj2 · OK)⊗OK
K
— where we remark that, from the point of view of computations to highest order,
one may think of the arithmetic line bundle corresponding to F 0 as being, essentially
A Panoramic Overview of Inter-universal Teichmuller Theory 7
the trivial arithmetic line bundle, and one may ignore the “relatively mild discrepan-
cies” referred to in Theorem 1.1. Such a nonzero morphism of arithmetic line bundles
implies an inequality between arithmetic degrees “degarith(−)” of arithmetic line
bundles, i.e., if we write ΩlogM |E for the arithmetic line bundle on Spec(OK) determined
by restricting the line bundle of logarithmic differentials on Mell [relative to the log
structure determined by the divisor at infinity] via the classifying morphism associated
to EK , then16 · degarith(log(q)) = htE < constant
— where we write htEdef= 2 ·degarith(ωE) = degarith(Ω
logM |E) for the canonical height
of the elliptic curve EK and degarith(log(q)) for the arithmetic degree of the arithmetic
divisor on Spec(OK) determined by restricting the divisor at infinity of Mell via the
classifying morphism associated to EK [cf., e.g., the discussion of [GenEll], §1, §3]. Here,
the arithmetic degrees are to be understood as being normalized [i.e., by dividing by the
degree over Q of the number field under consideration — cf. the discussion of [GenEll],
§1] so as to be invariant with respect to the operation of passing to a finite extension of
the number field under consideration.
Before proceeding, we pause to discuss the meaning of the “constant” that appears
in the inequality of the preceding display. First of all, to simplify the discussion, we
assume that the complex moduli of the elliptic curve EK at the archimedean primes
of K are subject to the restriction that they are only allowed to vary within some
fixed compact subset of (Mell)Cdef= (Mell)×Z C that does not contain the divisor at
infinity. It follows from [GenEll], Theorem 2.1, that, from the point of view of verifying
the ABC Conjecture, this does not result in any essential loss of generality. Then it
follows from the discussion of the “etale integral structure” in [HASurI], §1 [cf. also
[HASurI], Theorem A], together with elementary estimates via Stirling’s formula, that
this “constant” is roughly of the order of log(l). In particular, if [as is done, for instance,
in [GenEll], §4; [IUTchIV], §2] one assumes that l is roughly of the order of htE , then,
by possibly enlarging the “constant” of the inequality under consideration,
one may assume without loss of generality — i.e., so long as one respects the
restrictions just imposed on the complex moduli and the size of l relative
to the height! — that this constant is, in fact, independent of the elliptic
curve E, the number field F , and the prime number l.
For a more precise statement, we refer to [GenEll], Lemma 3.5, where an inequality is
derived by assuming, in effect, only the existence of a global multiplicative subspace [i.e.,
without assuming the existence of global canonical generators!] for E[l]; moreover, the
proof of [GenEll], Lemma 3.5, is entirely elementary and does not require the use of the
scheme-theoretic Hodge-Arakelov theory surveyed in [HASurI], [HASurII]. Thus, one
8 Shinichi Mochizuki
may conclude — i.e., either from the above discussion of the inequality arising from
Theorem 1.1 or from [the argument of] [GenEll], Lemma 3.5 — that
if one bounds the degree [F : Q] of the number field F over Q and the
complex moduli of the elliptic curve E at the archimedean primes of F ,
then the assumption that E[l] admits a global multiplicative subspace and
global canonical generators [i.e., where, in fact, the latter is unnecessary, if
one applies [the argument of] [GenEll], Lemma 3.5, as is done, for instance, in
the proof of [IUTchIV], Corollary 2.2, (ii)] implies that there are only finitely
many possibilities for the j-invariant of E [cf. [GenEll], Propositions 1.4,
(iv); 3.4].
[Here, we remark in passing that the various hypotheses on the prime l that appear
in the statement of [GenEll], Lemma 3.5, are applied only to conclude that the cyclic
subgroup under consideration is “global multiplicative”. That is to say, if one assumes
from the start that this cyclic subgroup is “global multiplicative”, then the argument
of the proof of [GenEll], Lemma 3.5, may be applied without imposing any hypotheses
on the prime l.]
In this context, it is of interest to consider the following complex analytic ana-
logue of the above discussion. Let ES → S be a family of one-dimensional semi-abelian
varieties over a connected smooth proper algebraic curve S over C that restricts to a
family of elliptic curves EU → U over some nonempty open subscheme U ⊆ S. Write
EU → U for the complex analytic family of complex tori determined by EU → U .
Let U → U be a universal covering space of the Riemann surface U . Then the classify-
ing morphism of the family EU → U determines a holomorphic map φ : U → H to the
upper half-plane H which is well-defined up to composition with an automorphism of H
induced by the well-known action of SL2(Z) on H. Now one verifies immediately that
this map φ : U → H is either constant or has open dense image in H. In particular, if
φ is nonconstant, then it follows that every point in the boundary ∂H of H [i.e., where
we regard H as being embedded, in the usual way, in the complex projective line P1C]
lies in the closure of the image Im(φ) of φ. On the other hand, the natural complex
analytic analogue of the condition that there exist a global multiplicative subspace
in the sense of the above discussion concerning elliptic curves over number fields may
be formulated as follows:
The local system on U in rank two free Z-modules determined by the [abelian!]
fundamental groups of the fibers of EU → U admits a rank one subspace that
coincides with the subspace of the “complex Tate curve Gm/qZ” determined by
the fundamental group of Gm in every sufficiently small neighborhood of a point
of the Riemann suface S associated to S where the family EU → U degenerates.
A Panoramic Overview of Inter-universal Teichmuller Theory 9
Moreover, one verifies immediately that this condition is equivalent to the condition
that the intersection with ∂H of the closure of Im(φ) be equal to a single cusp [i.e.,
a single point of the SL2(Z)-orbit of the point at infinity of P1C]. In particular, one
concludes that this condition can only hold if φ is constant, i.e., if the original family of
elliptic curves EU → U is isotrivial.
Now let us return to our discussion of elliptic curves over number fields. Then,
roughly speaking, the above discussion may be summarized as follows:
The assumption of the existence of a global multiplicative subspace and global
canonical generators may be thought of as a sort of arithmetic analogue of the
geometric notion of an isotrivial family of elliptic curves and, moreover, implies
bounds on the height of the elliptic curve under consideration.
This state of affairs motivates the following question:
Is it possible to obtain bounds — perhaps weaker in some suitable sense!
— on the height of arbitrary elliptic curves over number fields, i.e., without
assuming the existence of a global multiplicative subspace or global canonical
generators?
Put another way, one would like to somehow carry out the above derivation of bounds
on the height from the isomorphism (∗HA) of Theorem 1.1 without assuming the exis-
tence of a global multiplicative subspace or global canonical generators. As discussed
in [HASurI], §1.5.1, if one tries to carry out the above derivation by applying the “non-
idealized” version of Theorem 1.1 [i.e., which holds without assuming the existence of
a global multiplicative subspace or global canonical generators!], then one must con-
tend with “Gaussian poles” — i.e., with a situation in which the critical morphism of
arithmetic line bundles that appears in the above discussion has poles of relatively large
order! — which have the effect of rendering the resulting inequality essentially vacuous.
That is to say, from the point of view of generalizing the derivation of bounds on the
height given above, the above question may be reinterpreted as follows:
Is it possible to somehow reformulate the scheme-theoretic Hodge-Arakelov
theory of [HASurI], [HASurII] in such a way that one may circumvent the tech-
nical obstruction constituted by the Gaussian poles?
It was this state of affairs that motivated the author in the summer of 2000 to initiate
the development of the inter-universal Teichmuller theory established in [IUTchI],
[IUTchII], [IUTchIII], [IUTchIV].
From an extremely naive point of view, the approach that underlies the development
of inter-universal Teichmuller theory may be described as follows:
10 Shinichi Mochizuki
Suppose that the assignment{qj
2}j=1,...,l�
→ q (∗KEY)
— that is to say, for each valuation v ∈ V(K)bad, one considers the assignment
{qj2v}j=1,...,l� → q
v— somehow determines an automorphism of the
number field K!
Before proceeding, we remark that the assignment (∗KEY) may be thought of as a
sort of tautological solution to the problem discussed above of resolving the technical
obstruction constituted by the Gaussian poles— i.e., in short, the problem of contending
with the absence of the domain of the assignment (∗KEY) in a situation in which the
presence of the codomain of the assignment (∗KEY) may be taken for granted.
This approach may seem somewhat far-fetched at first glance but is in fact mo-
tivated by the classical analogy between number fields and function fields: that is to
say, if one thinks of a typical 2l-th root of a q-parameter qvas being, roughly speak-
ing, like some positive power of a prime number p, and one thinks of, say, the rational
number field Q as corresponding to the one-dimensional function field Fp(t), i.e., so p
corresponds to the indeterminate t, then the assignment (∗KEY) considered above cor-
responds, roughly speaking [i.e., if one reverses the direction of “ →”], to an assignment
of the form
t → ta
for some positive integer a, i.e., an assignment which does indeed determine a homo-
morphism of fields Fp(t) ↪→ Fp(t). Also, in this context, it is of interest to observe
that the proof of the result quoted earlier [i.e., [GenEll], Lemma 3.5] to the effect that
the existence of a global multiplicative subspace implies bounds on the height proceeds
precisely by observing that the existence of a global multiplicative subspace implies
that the elliptic curve under consideration is isogenous to — i.e., in effect, has roughly
[that is to say, up to terms of relatively negligible order] the same height as — an el-
liptic curve whose q-parameters are precisely the l-th powers of the q-parameters of the
original elliptic curve [cf. the application of [GenEll], Lemma 3.2, (ii), in the proof of
[GenEll], Lemma 3.5]: that is to say, in short, the existence of a global multiplicative
subspace implies that one is in a situation that is invariant, so to speak, with respect
to the transformation “q → ql”! As discussed in the introduction to [GenEll], §3,this technique is, in essence, a sort of miniature, or simplified, version of a technique
which dates back to Tate for proving “Tate conjecture-type results” and may be seen,
for instance, in Faltings’ work on the Tate conjecture.
At any rate, if one assumes that one has such an automorphism of the number
field K, then since such an automorphism necessarily preserves degrees of arithmetic
A Panoramic Overview of Inter-universal Teichmuller Theory 11
line bundles, it follows — since the absolute value of the degree of the RHS of the
assignment (∗KEY) under consideration is “small” by comparison to the absolute value
of the [average!] degree of the LHS of this assignment — in a similar fashion to the
discussion of the derivation of a bound on the height from Theorem 1.1 that a similar
inequality, i.e., a bound on the height of EK , holds:
16 · degarith(log(q)) = htE < constant.
Of course, needless to say, such automorphisms of number fields do not in fact
exist!! On the other hand, the starting point of inter-universal Teichmuller theory lies
in adopting the following point of view:
We regard the “{qj2}” on the LHS and the “q” on the RHS of the assignment
(∗KEY) {qj
2}j=1,...,l�
→ q
as belonging to distinct copies of “conventional ring/scheme theory”,
i.e., “distinct arithmetic holomorphic structures”, and we think of the
assignment (∗KEY) as a sort of arithmetic version of the notion of a quasicon-
formal map between Riemann surfaces equipped with distinct holomorphic
structures.
That is to say, this approach allows us to realize the assignment (∗KEY), albeit at the
cost of partially dismantling conventional ring/scheme theory. On the other hand,
this approach requires us
to compute just how much of a distortion occurs
as a result of deforming conventional ring/scheme theory. This vast computation is
the essential content of inter-universal Teichmuller theory. In the remainder of the
present paper, we intend to survey the ideas surrounding this dismantling/deformation
of conventional ring/scheme theory.
We begin by observing that one way to approach the issue of understanding such
dismantling/deformation operations is to focus on that portion of the objects under
consideration which is invariant with respect to — i.e., “immune” to — the dis-
mantling/deformation operations under consideration. For instance, in the case of the
classical theory of quasiconformal maps between Riemann surfaces equipped with dis-
tinct holomorphic structures, the underlying real analytic surface of the Riemann
surfaces under consideration constitutes just such an invariant. In inter-universal Te-
ichmuller theory, such invariant mathematical structures are referred to as cores, and
the corresponding property of invariance is referred to as coricity. In the case of
the deformations of the arithmetic holomorphic structure — i.e., the conventional
mono-analytic container
... = ... ?
... = ... ?
... = ... ?
... = ... ?
log-volume estimate
12 Shinichi Mochizuki
ring/scheme-theoretic structure — of a number field equipped with an elliptic curve
that are studied in inter-universal Teichmuller theory, the structures that correspond,
relative to the analogy with quasiconformal maps between Riemann surfaces, to the
“underlying real analytic surface” are referred to as mono-analytic. That is to say,
the term “mono-analytic” may be understood as a shorthand for the expression “the
arithmetic analogue of the term real analytic”. Thus, to summarize, in the terminology
of [IUTchI], [IUTchII], [IUTchIII], [IUTchIV], the coricity of mono-analytic struc-
tures plays a central role in the theory.
The approach taken in inter-universal Teichmuller theory to estimating the distor-
tion discussed above consists of constructing “mono-analytic containers” in which
the alien arithmetic holomorphic structures that appear — e.g., the arithmetic
holomorphic structure surrounding the domain of the assignment (∗KEY) discussed
above, when considered from the point of view of the arithmetic holomorphic struc-
ture surrounding the codomain of this assignment — may be embedded, up to certain
relatively mild indeterminacies — cf. Theorem 4.1 below; [IUTchIII], Theorem
A. Once such an embedding is obtained, one may then proceed to estimate the log-
volume of a region which is sufficiently large as to cover all the possibilities that
arise from such indeterminacies — cf. Fig. 1.1 below; [IUTchIII], Theorem B.
Fig. 1.1: Log-volume estimate region inside a mono-analytic container
In this context, it is of interest to note that the idea of constructing appropriate
“mono-analytic containers” has numerous classical antecedents. Perhaps the most fun-
damental example is the idea of studying the variation of the complex holomorphic
moduli of an elliptic curve by studying the way in which the [complex] Hodge fil-
tration is embedded within the topological invariant constituted by the first singular
cohomology module with complex coefficients of the underlying torus. A slightly less
classical example may be seen in conventional Arakelov theory, in which one studies
the metric aspects of — i.e., the analytic torsion associated to — the holomorphic
variation of a complex variety by embedding the space of holomorphic sections of an
ample line bundle into the space of real analytic [or, more generally, L2-] sections of the
line bundle. That is to say, the theory of analytic torsion may be thought of as a sort
A Panoramic Overview of Inter-universal Teichmuller Theory 13
of approach to measuring the metric aspects of this embedding [cf. the discussion of
[IUTchIV], Remark 1.10.4, (a)]. Indeed, scheme-theoretic Hodge-Arakelov theory
was originally conceived precisely as a sort of arithmetic analogue of these two relatively
classical examples [cf. the discussion of [HASurI], §1.3, §1.4; [IUTchIV], Remark 1.10.4,
(b)].
At a concrete level, the log-volume estimates discussed above may be summarized
as asserting that the “distortion” that occurs at the portion labeled by the index j of
the LHS of the assignment (∗KEY) is [roughly]
≤ j · (log-diffF + log-condE)
— i.e., where we write log-diffF for the log-different of the number field F and log-condEfor the log-conductor of the elliptic curve E [cf. [GenEll], Definition 1.5, (iii), (iv), for
more details]. In particular, by the exact same computation [cf. the discussion of
[IUTchIV], Remark 1.10.1] — i.e., of the term of highest order of the average over j
— as the computation discussed above in the case of degrees of vector bundles on the
moduli stack of elliptic curves over Q, we obtain the following inequality:
16 · degarith(log(q)) = htE ≤ (1 + ε) · (log-diffF + log-condE) + constant
This inequality is the content of the so-called Szpiro Conjecture, or, equivalently
[cf., e.g., [GenEll], Theorem 2.1], the ABC Conjecture — cf. Corollary 4.2 below;
[IUTchIV], Theorem A.
§ 2. Teichmuller-theoretic Deformations
In §1, we discussed in some detail the Hodge-Arakelov-theoretic motivation that
underlies the deformations of conventional ring/scheme theory — i.e., of arithmetic
holomorphic structure — that form the principal content of inter-universal Te-
ichmuller theory. In the present §2, we begin to take a closer look at certain qualitative
aspects of these deformations of arithmetic holomorphic structure.
The ultimate motivating example that lies behind these deformations considered
in inter-universal Teichmuller theory is the theory of deformations of holomorphic
structure of a Riemann surface that are studied in classical complex Teichmuller
theory. Here, we recall that such classical Teichmuller deformations are associated
to a nonzero square differential on a Riemann surface. Relative to the canonical
holomorphic coordinates obtained locally on the Riemann surface by integrating various
square roots of the given square differential, these classical Teichmuller deformations
may be written in the form [cf. Fig. 2.1 below]
z → ζ = ξ + iη = λx+ iy
14 Shinichi Mochizuki
— where 1 < λ < ∞ is the dilation factor [cf., e.g., [Lehto], Chapter V, §8]. The key
qualitative feature of such deformations that is shared by the deformations of arithmetic
holomorphic structure that occur in inter-universal Teichmuller theory may be described
as follows:
The two underlying real dimensions of the single holomorphic dimension un-
der consideration are “decoupled” from one another; then one of these two
underlying real dimension is dilated/deformed, while the other underlying
real dimension is left fixed/undeformed.
Fig. 2.1: One dimension is dilated, while the other is left fixed
Before proceeding, we introduce some terminology which will be useful in our dis-
cussion of deformations of various types of “holomorphic” structure. To get a sense of
the qualitative aspects of the terminology introduced, it is useful to keep in mind the
fundamental example of the situation in which one considers various complex linear
structures [i.e., “copies of C”] on a two-dimensional real vector space [i.e., “R2”]
— cf. the discussion surrounding [IUTchII], Introduction, Fig. I.3; [Quasicon], Ap-
pendix. This fundamental example is illustrated in Fig. 2.2 below. As discussed in §1,structures which are common to the various distinct “holomorphic” structures under
consideration — i.e., such as the underlying real analytic structure in the context of
deformations of the holomorphic structure of a Riemann surface or the underlying real
vector space structure in the context of various one-dimensional complex linear struc-
tures on a two-dimensional real vector space — will be referred to as coric. On the
other hand, structures which depend on a specific choice of “holomorphic” structure —
i.e., a specific choice of a “spoke” in the diagram of Fig. 2.2 — will be referred to as
uniradial.
In this context, perhaps the most subtle notion is the notion of amultiradial struc-
ture. This term refers to structures that depend on a choice of “holomorphic” structure,
but which are described in terms of underlying coric structures in such a way as to be
unaffected by alterations in the “holomorphic” structure. In the case of the example
illustrated in Fig. 2.2, a typical [albeit somewhat tautological!] example of a multiradial
A Panoramic Overview of Inter-universal Teichmuller Theory 15
structure is given by the GL2(R)-orbit of an R-linear isomorphism C∼→ R2. This
terminology is summarized in Fig. 2.3 below. If one thinks of “holomorphic” structures
as “fibers” and of the underlying coric structure as a “base space”, then multiradial
structures may be thought of as fiber spaces equipped with a “connection” that may be
applied to execute “parallel transport” operations of fibers corresponding to arbitrary
motions in the base space [cf. the discussion of [IUTchII], Remark 1.7.1].
C∼→ R2
. . . | . . .
C∼→ R2
. . .
—
GL2(R)
�
R2
|
— C∼→ R2
. . .
C∼→ R2
Fig. 2.2: Numerous one-dimensional complex linear structures
“C” on a single two-dimensional real vector space “R2”
coric structure underlying common structure R2
multiradial “holomorphic” structure described in � GL2(R)
structure terms of underlying coric structure C∼→ R2
uniradial structure “holomorphic” structure C
Fig. 2.3: Coric, multiradial, and uniradial structures
Another important motivating example may be seen in the p-adic Teichmuller
theory of [pOrd], [pTeich], [CanLift]. This theory concerns p-adic canonical lift-
ings of a hyperbolic curve over a perfect field of positive characteristic equipped with
a nilpotent ordinary indigenous bundle. Such canonical liftings of hyperbolic curves are
equipped with canonical Frobenius liftings — i.e., canonical liftings of the Frobe-
nius morphism in positive characteristic — which are compatible with certain canonical
. . . . . . .
pppp
pp
pp
pp
pp
pp
pp
pp
pp
p
16 Shinichi Mochizuki
Frobenius liftings on certain p-adic stacks that may be thought of as p-adic etale local-
izations of the moduli stack of hyperbolic curves of a given type. These canonical
Frobenius liftings on hyperbolic curves and [certain p-adic etale localizations of] mod-
uli stacks of hyperbolic curves may be regarded as p-adic analogues of the well-known
metric on the Poincare upper half-plane and the Weil-Petersson metric on com-
plex Teichmuller space — cf. Fig. 2.4 below. We refer to the Introductions of [pOrd],
[pTeich] for more detailed descriptions of p-adic Teichmuller theory.
Fig. 2.4: Frobenius liftings on hyperbolic curves and their moduli
thought of as “p-adic metric flows”
The analogy between inter-universal Teichmuller theory and p-adic Teichmuller
theory is as summarized in Fig. 2.5 below. In this analogy, the number field equipped
with a(n) [once-punctured] elliptic curve that appears in inter-universal Teichmuller the-
ory [cf. the discussion of §1!] corresponds to the positive characteristic hyperbolic curve
equipped with a nilpotent ordinary indigenous bundle that appears in p-adic Teichmuller
theory. That is to say, the number field, equipped with the finite set of valuations which
are either archimedean valuations or nonarchimedean valuations at which the given el-
liptic curve has bad multiplicative reduction, corresponds to the positive characteristic
hyperbolic curve [which may be thought of as a one-dimensional function field, equipped
with finitely many valuations, i.e., the “cusps at infinity”]. The [once-punctured] ellip-
tic curve over the number field then corresponds to the nilpotent ordinary indigenous
bundle over the positive characteristic hyperbolic curve. Then just as inter-universal
Teichmuller theory concerns the issue of deforming conventional ring/scheme theory
[i.e., “over Z”], p-adic Teichmuller theory concerns the issue of p-adically deforming
certain given objects in positive characteristic scheme theory. Finally, the canoni-
cal Frobenius liftings that play a central role in p-adic Teichmuller theory may be
thought of as corresponding to the log-theta-lattice in inter-universal Teichmuller
A Panoramic Overview of Inter-universal Teichmuller Theory 17
theory, which we shall discuss in more detail in §3 below. From this point of view, it is
of interest to recall the transformations
“t → ta” and “q → ql”
— which appeared in the discussion of §1, and which are somewhat reminiscent in form
of the usual Frobenius morphism in positive characteristic. Indeed, the key assignment
(∗KEY) {qj
2}j=1,...,l�
→ q
— i.e., which played a central role in the discussion of §1 and will play a central role in the
discussion of the log-theta-lattice in §3 — may be thought of as a sort of “deformation”
from the identity assignment
q → q
to the assignment
q(l�)2 → q
— i.e., which, if one reverses the direction of the arrow, is reminiscent of the Frobenius
morphism in positive characteristic. That is to say, the assignment (∗KEY) may be
thought of as a sort of “abstract formal analogue” of the notion of [the mixed character-
istic aspect of] a Frobenius lifting in the p-adic theory [cf. the discussion of [IUTchII],
Remark 3.6.2, (iii)].
Inter-universal Teichmuller theory p-adic Teichmuller theory
number field, equipped with hyperbolic curve over a perfect
a finite set of valuations field of positive characteristic
[once-punctured] elliptic curve nilpotent ordinary indigenous bundle
conventional ring/scheme theory over Z positive characteristic scheme theory
the log-theta-lattice canonical Frobenius liftings
Fig. 2.5: The analogy between inter-universal Teichmuller theory
and p-adic Teichmuller theory
At a very rough, qualitative level, p-adic Teichmuller theory may be thought of as
a canonical analogue for hyperbolic curves over a perfect field of positive characteristic
equipped with a nilpotent ordinary indigenous bundle of the well-known classical theory
of the ring of Witt vectors W (F) associated to a perfect field F of positive character-
istic. That is to say, the ring of Witt vectors may be thought of as a “p-adic canonical
lifting” of the given perfect field of positive characteristic. Moreover, this “p-adic canon-
ical lifting” is equipped with a canonical Frobenius lifting ΦW (F) : W (F) → W (F),
18 Shinichi Mochizuki
i.e., a lifting of the usual Frobenius morphism on the given perfect field of positive char-
acteristic. One central object in the theory of Witt vectors is the multiplicative group
of Teichmuller representatives
[F×] ⊆ W (F)
of the nonzero elements of the field W (F)/(p)∼→ F, which may be characterized by the
property ΦW (F)(λ) = λp, for λ ∈ [F×]. From this point of view, one central aspect of the
theory of canonical liftings in p-adic Teichmuller theory that sets this theory apart from
the classical theory of Witt vectors is the existence of canonical coordinates, i.e.,
q-parameters “qx”, in the completion of each closed point x of the ordinary locus of
a canonically lifted curve X over W (F) which may be characterized by the property
ΦX(qx) = qpx
— where we write ΦX : Xord → Xord for the canonical Frobenius lifting on the ordinary
locus Xord of [the p-adic formal scheme associated to] X. These q-parameters may be
regarded as generalizations of the q-parameters on the moduli stack of elliptic curves
over Zp that appear in Serre-Tate theory. From the point of view of the analogy
between p-adic Teichmuller theory and classical complex Teichmuller theory [i.e., as
reviewed above!],
these q-parameters “qx” may be thought of as corresponding to the underlying
real dimension that is subject to dilation.
From the point of view of the theory of p-adic Galois representations of the arith-
metic fundamental group of X ×Spec(Zp) Spec(Qp), the q-parameters “qx” — i.e., which
have the effect of “diagonalizing” the Frobenius lifting ΦX and exhibiting it as a “mixed
characteristic p-adic flow” [cf. Fig. 2.4] — may be thought of as corresponding to
positive slope Galois representations, i.e., Galois representations that “straddle the
gap” between “mod pn” and “mod pm” for n = m [cf. the discussion of the “positive
slope version of Hensel’s lemma” in [AbsTopII], Lemma 2.1; [AbsTopII], Remarks 2.1.1,
2.1.2]. By contrast, the group of Teichmuller representatives “[F×]” may be thought
of as corresponding to [a certain portion of] the structure sheaf of the given positive
characteristic hyperbolic curve that is held fixed — i.e., is coric — with respect to
the deformation to mixed characteristic. From the point of view of the theory of p-
adic Galois representations, the group of Teichmuller representatives “[F×]” may
be thought of as corresponding to [a certain portion of the] slope zero Galois repre-
sentations, i.e., at a more concrete level, to invariants of the Frobenius morphism in
positive characteristic. Relative to the analogy with classical complex Teichmuller
theory [i.e., as reviewed above!],
A Panoramic Overview of Inter-universal Teichmuller Theory 19
such slope zero Galois representations may be thought of as corresponding to
the underlying real dimension that is held fixed.
The above discussion may be related to the theory of absolute Galois groups of
p-adic local fields, as follows. Let Gk be the absolute Galois group of a p-adic local field
k [i.e., a finite extension of Qp]. Write Ok ⊆ k for the ring of integers of k, O×k ⊆ Ok for
the subgroup of units, and Ik ⊆ Gk for the inertia subgroup of Gk. Then at the level of
absolute Galois groups, slope zero and positive slope Galois representations correspond,
respectively, to the maximal unramified quotient Gk/Ik of Gk and the inertia subgroup
Ik ⊆ Gk. From the point of view of cohomological dimension, the two cohomological
dimensions of Gk may be thought of as consisting precisely of the one cohomological
dimension of Gk/Ik (∼= Z) and the one cohomological dimension of Ik. From the
point of view of local class field theory, the one cohomological dimension of Gk/Ik
corresponds to the value group k×/O×k , while the one cohomological dimension of Ik
corresponds to the group of units O×k .
The above discussion of p-adic local fields may be related naturally to the discussion
of complex Teichmuller theory at the beginning of the present §2. That is to say,
by applying the exponential function on C, the one real dimension consituted by the
real axis may be thought of as corresponding to the first factor “R>0” of the natural
product decomposition
C× = R>0 × S1,
while the one real dimension constituted by the complex axis may be thought of as
corresponding to the second factor “S1” of this product decomposition. If one thinks
in terms of the singular cohomology with compact supports of C or C×, then the two
underlying real dimensions of C or C× may be thought of consisting precisely of the
“cycles with compact supports” determined by these two real dimensions “R>0” and
“S1”, i.e., put another way, of the value group and group of units of the complex
archimedean local field C. That is to say, one obtains an entirely analogous description
to the description discussed above in the case of p-adic local fields.
One important point of view in the context of the above discussion of the “two
underlying arithmetic dimensions” of p-adic and complex archimedean local fields is the
following:
p-adic and complex Tate curves “Gm/qZ” allow one to relate, in a natural
fashion, the “two underlying arithmetic dimensions” of the local field
under consideration to the “two underlying geometric dimensions” of the
[elliptic curve constituted by such a] Tate curve.
This point of view plays an important role throughout inter-universal Teichmuller the-
ory. At a more concrete level, the Tate curve “Gm/qZ” admits a natural covering, in a
20 Shinichi Mochizuki
suitable sense, by “Gm”. Then by considering the points of this copy of “Gm” valued in
the local field under consideration, this natural covering serves to map the two under-
lying arithmetic dimensions of the above discussion onto the two underlying geometric
dimensions of the elliptic curve given by such a Tate curve. These two underlying di-
mensions may be seen concretely in the usual topological [i.e., in the complex case] or
etale [i.e., in the p-adic case] fundamental group of this elliptic curve. Moreover, if one
thinks of this elliptic curve as the compactification of the once-punctured elliptic curve
obtained by removing the origin, then the highly nonabelian structure of the resulting
nonabelian fundamental group may be thought of as representing the “intertwining”
of these two underlying dimensions — cf. Fig. 2.6 below. In this context, it is of interest
to recall that
a suitable quotient of this nonabelian fundamental group of a once-punc-
tured elliptic curve is naturally isomorphic to the theta group associated
to the ample line bundle on the elliptic curve determined by the origin
Fig. 2.6: Intertwining cycles on a once-punctured elliptic curve
— cf. the discussion at the beginning of [EtTh], §1, §2. Indeed, the above chain of
observations may be thought of as the starting point for the introduction of the theory
of theta functions as developed in [EtTh] in inter-universal Teichmuller theory. In
fact, the review of complex Teichmuller theory given at the beginning of the present §2was included precisely to motivate the following fundamental aspect of inter-universal
Teichmuller theory:
The local portions of the deformations of a number field equipped with an
elliptic curve that are constructed in inter-universal Teichmuller theory are
obtained precisely by dilating the “one underlying arithmetic dimension” con-
stituted by the value groups by means of a theta function, while the “other
underlying arithmetic dimension” constituted by the groups of units is left
fixed.
We refer to §3 below for a more detailed discussion.
In the context of the above discussion, it is of interest to recall that unlike theta
groups, the arithmetic fundamental group of a once-punctured elliptic curve over an
“arithmetic” field such as a number field or a p-adic local field satisfies highly nontrivial
A Panoramic Overview of Inter-universal Teichmuller Theory 21
rigidity properties, which are an important theme in [EtTh], and which play a central
role in inter-universal Teichmuller theory [cf. the discussion of §3, §4 below]. For
instance, one verifies immediately that a pro-l theta group — i.e., the pro-l group
generated by three generators α, β, γ satisfying the relations
α · γ = γ · α, β · γ = γ · β, α · β = β · α · γ
— admits automorphisms of the form
α → αλ, β → βλ, γ → γλ2
,
for λ ∈ Z×l . Such automorphisms cease to exist if one regards this theta group as a
subquotient of the arithmetic fundamental group of a once-punctured elliptic curve over
a number field or a p-adic local field.
So far in our discussion of “two underlying arithmetic dimensions”, we have concen-
trated on local fields, i.e., on the various archimedean and nonarchimedean localizations
“Fv” of a [say, for simplicity, totally complex] number field F . Note, however, that by
considering the second Galois cohomology module of the absolute Galois group GF of F
— i.e., in essence, the Brauer group of F — one may relate, via the various restriction
maps in Galois cohomology, the two cohomological dimensions of GF to the two
cohomological dimensions of the various nonarchimedean Fv.
On the other hand, in the various constructions of inter-universal Teichmuller the-
ory, the phenomenon of “two underlying arithmetic dimensions” in the context of global
number fields will also appear in a somewhat different incarnation, which we describe
as follows. The two underlying cohomological dimensions of the absolute Galois group
Gk of a p-adic local field k are easiest to understand explicitly if one restricts oneself to
the maximal tamely ramified quotient of Gk, i.e., which may be described explicitly as
a [“representatively large”] closed subgroup of the product over prime numbers l = p of
profinite groups of the form
Zl � Z×l
— where the semi-direct product arises from the natural action of the multiplicative
group Z×l on the additive group Zl. This motivates the point of view that the semi-
direct product of monoids
(Z,+) � (Z,×)
— where the semi-direct product arises from the natural action of the multiplicative
monoid (Z,×) [i.e., obtained by considering the multiplicative portion of the structure
of the ring of integers Z] on the additive group (Z,+) [i.e., obtained by considering the
additive portion of the structure of the ring of integers Z] — may be thought of as a
sort of “fundamental underlying combinatorial prototype” of the two underlying
arithmetic dimensions of a local field, as discussed above.
22 Shinichi Mochizuki
Moreover, from the point of view of Hodge-Arakelov theory [cf. the discussion
of §1], it is natural to attempt to “approximate Z” by means of the finite field Fl,
for some prime number l, which one thinks of being “large”, i.e., roughly of the order
of the height of the elliptic curve over a number field under consideration — cf. the
discussion of [HASurI], §1.3.4; [IUTchI], Remark 6.12.3, (i). In particular, we shall think
of the semi-direct product of monoids (Z,+) � (Z,×) as being approximated by the
semi-direct product of groups
Fl � F×l
— where the semi-direct product arises from the natural action of the multiplicative
group F×l on the additive group Fl. In fact, in the context of inter-universal Teichmuller
theory, this copy of Fl = Z/l · Z will appear as a quotient of the module of l-torsion
points of the elliptic curve over a number field under consideration. We refer to the
portion of §3 below concerning the structure of Hodge theaters for a more detailed
discussion.
§ 3. The Log-theta-lattice
The starting point of the various constructions of inter-universal Teichmuller theory
is a collection of initial Θ-data [cf. [IUTchI], §I1; [IUTchI], Definition 3.1]. Roughly
speaking, this data consists, essentially, of
• an elliptic curve EF over a number field F ,
• an algebraic closure F of F ,
• a prime number l ≥ 5,
• a collection of valuations V of a certain subfield K ⊆ F , and
• a collection of valuations Vbadmod of a certain subfield Fmod ⊆ F
that satisfy certain technical conditions — we refer to [IUTchI], Definition 3.1, for
more details. Here, we write Fmod ⊆ F for the field of moduli of EF , K ⊆ F for
the extension field of F generated by the fields of definition of the l-torsion points of
EF , XF ⊆ EF for the once-punctured elliptic curve obtained by removing the origin
from EF , and XF → CF for the hyperbolic orbicurve obtained by forming the stack-
theoretic quotient of XF by the natural action of {±1}. Then F is assumed to be
Galois over Fmod, Gal(K/F ) is assumed to be isomorphic to a subgroup of GL2(Fl) that
contains SL2(Fl), EF is assumed to have stable reduction at all of the nonarchimedean
valuations of F , CKdef= CF ×F K is assumed to be a K-core [cf. [CanLift], Remark
2.1.1], V is assumed to be a collection of valuations of K such that the natural inclusion
Fmod ⊆ F ⊆ K induces a bijection V∼→ Vmod between V and the set Vmod of all
valuations of the number field Fmod, and Vbadmod ⊆ Vmod is assumed to be some nonempty
set of nonarchimedean valuations of odd residue characteristic over which EF has bad
A Panoramic Overview of Inter-universal Teichmuller Theory 23
[i.e., multiplicative] reduction— i.e., roughly speaking, the subset of the set of valuations
where EF has bad multiplicative reduction that will be “of interest” to us in the context
of the constructions of inter-universal Teichmuller theory. Then we shall write Vbad def=
Vbadmod ×Vmod
V ⊆ V, Vgoodmod
def= Vmod \ Vbad
mod, Vgood def
= V \ Vbad. Also, we shall apply
the superscripts “non” and “arc” to V, Vmod to denote the subsets of nonarchimedean
and archimedean valuations, respectively. The data listed above determines, up to K-
isomorphism [cf. [IUTchI], Remark 3.1.3], a finite etale covering CK → CK of degree l
such that the base-changed covering
XKdef= CK ×CF
XF → XKdef= XF ×F K
arises from a rank one quotient EK [l] � Q (∼= Z/lZ) of the module EK [l] of l-torsion
points of EK(K) [where we write EKdef= EF ×F K] which, at v ∈ Vbad, restricts to the
quotient arising from coverings of the dual graph of the special fiber.
The constructions of inter-universal Teichmuller theory may, in some sense, be sum-
marized as a step-by-step dismantling of the conventional scheme-theoretic arithmetic
geometry surrounding the given initial Θ-data which is
sufficiently drastic as to allow us to realize the key assignment (∗KEY) dis-
cussed in §1, but nevertheless sufficiently controlled so as to allow us to
compute precisely the resulting distortion of various objects and construc-
tions from conventional arithmetic geometry.
The central stage on which these constructions are performed is the log-theta-lattice
[cf. Fig. 3.1 below], various versions of which are defined in [IUTchIII]. Each “•” in
Fig. 3.1 represents a Θ±ellNF-Hodge theater, which may be thought of as a sort of
miniature model of the conventional arithmetic geometry surrounding the given initial
Θ-data. Each vertical arrow “↑” in Fig. 3.1 represents a log-link, i.e., a certain type
of gluing between various portions of the Θ±ellNF-Hodge theaters that constitute the
domain and codomain of the arrow. Each horizontal arrow “→” in Fig. 3.1 represents
a Θ-link [various versions of which are defined in [IUTchI], [IUTchII], [IUTchIII]], i.e.,
another type of gluing between various portions of the Θ±ellNF-Hodge theaters that
constitute the domain and codomain of the arrow.
Relative to the analogy between inter-universal Teichmuller theory and p-adic Te-
ichmuller theory [cf. the discussion of §2], each Θ±ellNF-Hodge theater corresponds
to a model of positive characteristic scheme theory surrounding the given hyper-
bolic curve over a perfect field equipped with a nilpotent ordinary indigenous bundle;
each log-link corresponds to the Frobenius morphism in positive characteristic;
each Θ-link corresponds to a “mixed characteristic transition” from pnZ/pn+1Z
to pn−1Z/pnZ, for some positive integer n. In particular, the two dimensions of the
diagram constituted by the log-theta-lattice may be thought of as corresponding to the
24 Shinichi Mochizuki
two underlying arithmetic dimensions — i.e., “slope zero” and “positive slope” — of a
p-adic local field discussed in §2.Next, we proceed to take a closer look at the various components “•”, “↑”, “→”
of the log-theta-lattice. Each Θ±ellNF-Hodge theater — i.e., each “•” — consists of a
complicated system of Frobenioids equipped with various auxiliary data. A Frobenioid
is a category satisfying certain abstract properties, which imply, via the theory of [FrdI],
that it looks, roughly speaking, like the sort of category that arises from considering
the geometry of divisors and line bundles on the various connected coverings of an
irreducible normal noetherian scheme. Put another way, a Frobenioid may be thought
of, roughly speaking, as the result of “deleting the underlying scheme portion of a log
scheme” — i.e., so to speak
(Frobenioid) ≈ (log scheme) − (underlying scheme).
For instance, If k is an algebraic closure of a local p-adic field k, and we write Gkdef=
Gal(k/k) for the absolute Galois group of k and O�k
for the multiplicative monoid de-
termined by the nonzero integers of k, then the local data
Gk � O�k
— i.e., which we think of as consisting of some abstract topological group acting on an
abstract topological monoid — determines [and, indeed, is equivalent to the datum of]
a certain type of Frobenioid. On the other hand, if we write GFdef= Gal(F/F ) for the
absolute Galois group of F and F× ∪ {0} for the multiplicative monoid determined by
the elements of F , then the global data
GF � F× ∪ {0}
— i.e., which we think of as consisting of some abstract topological group acting on an
abstract monoid equipped with the various topologies determined by the valuations of F
— determines [and, indeed, is equivalent to the datum of] a certain type of Frobenioid.
Variants of the above examples of Frobenioids may be obtained if GF (respectively, Gk)
is replaced by one of the arithmetic fundamental groups ΠXK, ΠCK
of the orbicurves
XK , CK discussed above (respectively, by the arithmetic fundamental group “Πv” of
a certain finite etale covering “Xv” [when v ∈ Vbad], “X−→v
” [when v ∈ Vgood] of the
localization of XK at v ∈ Vnon). Moreover, at v ∈ Vbad, it will turn out to be more
natural to take this “Πv” to be the corresponding tempered arithmetic fundamental
group.
The local and global Frobenioids that appear in a Θ±ellNF-Hodge theater
correspond essentially to this sort of variant of the examples of Frobenioids just
discussed.
A Panoramic Overview of Inter-universal Teichmuller Theory 25
That is to say, at v ∈ Vnon, the field k is taken to be the completion Kv of K at
v; by abuse of notation, we shall write “F v” for any of the algebraic closures of Kv
determined by F and Gvdef= Gal(F v/Kv); we observe that Gv may be thought of as a
quotient Πv � Gv of Πv. An analogous theory exists at archimedean v ∈ V.
Throughout the theory of [IUTchI], [IUTchII], and [IUTchIII], we shall often work
with collections of data of a certain type indexed by the elements of V. Such collections
of data will be referred to as prime-strips. Roughly speaking, such “prime-strips” may
be thought of as a sort of monoid- or Galois-theoretic version of the classical notions
of adeles/ideles. For instance, by considering collections of data constituted by the
various “local data” — i.e., Gv � [a certain submonoid of] O�F v
, for v ∈ Vnon — or
“variant local data” — i.e., Πv � O�Fv
, for v ∈ Vnon — discussed above, we obtain
F�-prime-strips [in the case of the “local data”] or F-prime-strips [in the case of the
“variant local data”]. Alternatively, the topological group portion of such collections
of data — i.e., Gv or Πv, for v ∈ Vnon – gives rise to D�-prime-strips [in the case
of “Gv”] or D-prime-strips [in the case of “Πv”]. Here, we remark that the notation
�is used as an abbreviation for the term mono-analytic, i.e., “the arithmetic analogue
of the term real analytic” [cf. the discussion of §1]. We refer to [IUTchI], Fig. I1.2,
and the surrounding discussion for a more detailed discussion of the various types of
prime-strips that appear in the theory.
......⏐⏐ ⏐⏐
. . .−→ • −→ • −→ . . .
⏐⏐ ⏐⏐. . .−→ • −→ • −→ . . .
⏐⏐ ⏐⏐...
...
Fig. 3.1: The log-theta-lattice consists of “•” [i.e., Θ±ellNF-Hodge theaters],
“↑” [i.e., log-links], and “→” [i.e., Θ-links]
The significance of working with various types of prime-strips lies in the fact that
prime-strips allow one to
26 Shinichi Mochizuki
simulate a situation in which the natural map Spec(K) → Spec(F ) admits
“sections” which make it possible to consider global multiplicative sub-
spaces and global canonical generators of the l-torsion points of EK
— cf. the discussion of §1; Fig. 3.2 below. Indeed, the essential content of the various
technical conditions imposed on the collection of valuations V of K may be summarized
as the requirement that V constitutes just such a section [cf. the bijection V∼→ Vmod
discussed above]! Put another way, the geometry of a Θ±ellNF-Hodge theater is ar-
ranged precisely so as to enable one to construct tautological global multiplicative
subspaces and global canonical generators — cf. the discussion of [IUTchI], §I1, sur-rounding [IUTchI], Fig. I1.4. On the other hand, since such “sections” do not in fact
respect the ring structure of K, F — i.e., put another way, they amount to a partial
dismantling of this ring structure — it is necessary to keep track explicitly of the var-
ious constructions and objects arising from this ring structure that are of interest in
the development of the theory. One important example of such a construction is the
theory of global arithmetic line bundles on the various number fields under considera-
tion, which may be kept track of by considering suitable global Frobenioids, as discussed
above. In the case of mono-analytic prime-strips [i.e., prime-strips whose type includes
the symbol “�”], we shall work with the global realified Frobenioids associated to
the global Frobenioids discussed above — i.e., roughly speaking, Frobenioids that corre-
spond to considering arithmetic divisors with coefficients ∈ R. The resulting collections
of data [i.e., consisting of mono-analytic prime-strips and global realified Frobenioids]
are denoted by means of the symbol “�” [cf. [IUTchI], Fig. I1.2, and the surrounding
discussion].
· − · − · − . . .− · − · − ·· − · − · − . . .− · − · − ·
. . . K
· − · − · − . . .− · − · − ·· − · − · − . . .− · − · − ·
� Gal(K/F )
⊆ GL2(Fl)
⏐⏐�
· − · − · − . . .− · − · − · F
Fig. 3.2: Prime-strips as “sections” of Spec(K) → Spec(F )
A Panoramic Overview of Inter-universal Teichmuller Theory 27
In the discussion to follow, we shall use the following notation:
l�def= (l − 1)/2; l± def
= (l + 1)/2; F�l
def= F×
l /{±1}; F�±l
def= Fl � {±1}
— cf. the discussion of [IUTchI], §I1. The natural action of the stabilizer in Gal(K/F )
of the quotient EK [l] � Q [cf. the discussion at the beginning of the present §3] on Q
determines a natural “action up to certain indeterminacies” of F�l on CK , i.e., a natural
isomorphism of F�l with some subquotient [that is to say, where the “quotient” corre-
sponds to the “indeterminacies” just mentioned] of the automorphism group Aut(CK)
of the algebraic stack CK . The F�l -symmetry constituted by this action up to indeter-
minacies of F�l may be thought of as being essentially arithmetic in nature, in the sense
that the subquotient of Aut(CK) that gives rise to this poly-action of F�l is induced, via
the natural map Aut(CK) → Aut(K), by a subquotient of Gal(K/F ) ⊆ Aut(K). In a
similar vein, the natural action of the automorphisms of the scheme XK on the cusps of
XK determines a natural “action up to indeterminacies” of F�±l on XK , i.e., a natural
isomorphism of F�±l with some subquotient of Aut(XK). The F�±
l -symmetry consti-
tuted by this action up to indeterminacies of F�±l may be thought of as being essentially
geometric in nature, in the sense that the subgroup AutK(XK) ⊆ Aut(XK) [i.e., of
K-linear automorphisms] maps isomorphically onto the subquotient of Aut(XK) that
gives rise to this F�±l -symmetry. Finally, the global F�
l -symmetry of CK only extends
to a “{1}-symmetry” [i.e., in essence, fails to extend!] of the local coverings Xv[for
v ∈ Vbad] and X−→v[for v ∈ Vgood], while the global F�±
l -symmetry of XK only extends
to a “{±1}-symmetry” [i.e., in essence, fails to extend!] of the local coverings Xv[for
v ∈ Vbad] and X−→v[for v ∈ Vgood] — cf. Fig. 3.3 below.
Ultimately, the LHS {qj
2}j=1,...,l�
of the key assignment (∗KEY) discussed in §1 is obtained, in inter-universal Te-
ichmuller theory, by evaluating the [reciprocal of the l-th root of the] theta function
on the l-torsion points corresponding to the quotient EK [l]� Q. One important aspect
of this crucial operation of “Hodge-Arakelov-theoretic evaluation” is the issue of
keeping track of the labels j = 1, . . . , l� of these theta values. At a purely technical
level, one of the main functions of the apparatus constituted by a Θ±ellNF-Hodge the-
ater is precisely the task of serving as a bookkeeping device for these labels. In fact,
a Θ±ellNF-Hodge theater consists of two portions, namely, a Θ±ell-Hodge theater and
a ΘNF-Hodge theater. The scheme-theoretic F�l -symmetry discussed above induces
an F�l -symmetry on certain of the structures [essentially systems of Frobenioids] of a
ΘNF-Hodge theater and, in particular, on the labels j = 1, . . . , l�, which we think of
[i.e., by reducing modulo l] as elements of F�l , via the natural action of the group F�
l on
28 Shinichi Mochizuki
itself. In a similar vein, the scheme-theoretic F�±l -symmetry discussed above induces
an F�±l -symmetry on certain of the structures [essentially systems of Frobenioids] of
a Θ±ell-Hodge theater and, in particular, on the labels t = −l�, . . . ,−1, 0, 1, . . . , l�,
which we think of [i.e., by reducing modulo l] as elements of Fl, via the natural action
of the group F�±l on Fl. Finally, a Θ±ellNF-Hodge theater is obtained by gluing a
ΘNF-Hodge theater to a Θ±ell-Hodge theater along prime-strips equipped with labels
as discussed above by means of the correspondence of labels given by
(F�±l �) Fl ⊇ F×
l � F×l /{±1} = F�
l (� F�l )
— cf. [IUTchI], §I1; [IUTchI], §4, §5, §6, for more details.
{±1}� {X
vor X−→v
}v∈V
� �
F�±l� XK CK
F�
l�
Fig. 3.3: Symmetries of coverings of XF
The F�±l -symmetry has the advantange that, being geometric in nature [cf. the
above discussion], it allows one to permute various copies of “Gv” [where v ∈ Vnon]
associated to distinct labels ∈ Fl without inducing conjugacy indeterminacies. This
phenomenon, which we shall refer to as conjugate synchronization, plays a key role
in the Kummer theory surrounding the Hodge-Arakelov-theoretic evaluation of the
theta function at l-torsion points that is developed in [IUTchII]— cf. the discus-
sion of [IUTchI], Remark 6.12.6; [IUTchII], Remark 3.5.2, (ii), (iii); [IUTchII], Remark
4.5.3, (i). By contrast, the F�l -symmetry is more suited to situations in which one must
descend from K to Fmod. In inter-universal Teichmuller theory, the most important
such situation involves the Kummer theory surrounding the reconstruction of the
number field Fmod from ΠCK— cf. [IUTchI], Example 5.1; the discussion of [IUTchI],
Remark 6.12.6; [IUTchII], Remark 4.7.6. Here, we note that such situations necessarily
induce global Galois permutations of the various copies of “Gv” [where v ∈ Vnon] associ-
ated to distinct labels ∈ F�l that are only well-defined up to conjugacy indeterminacies.
In particular, the F�l -symmetry is ill-suited to situations, such as those that appear in
the theory of Hodge-Arakelov-theoretic evaluation that is developed in [IUTchII], that
require one to establish conjugate synchronization.
From the point of view of the discussion of §2 concerning the two underlying
arithmetic dimensions of the ring structure of a ring, the structure of a Θ±ellNF-
Hodge theater may be regarded as a decomposition of the ring Fl — which, as discussed
in §2, is often regarded, in the context of inter-universal Teichmuller theory, as a finite
approximation of the ring of integers Z — into its two underlying arithmetic dimensions,
A Panoramic Overview of Inter-universal Teichmuller Theory 29
i.e., addition [cf. the F�±l -symmetry] andmultiplication [cf. the F�
l -symmetry]. Mor-
ever, this decomposition into additive and multiplicative symmetries in the theory of
Θ±ellNF-Hodge theaters may be compared to groups of additive and multiplicative
symmetries of the upper half-plane [cf. Fig. 3.4 below]. Here, the “cuspidal” ge-
ometry expressed by the additive symmetries of the upper half-plane admits a natural
“associated coordinate”, namely, the classical q-parameter, which is reminiscent of the
way in which the F�±l -symmetry is well-adapted to the Kummer theory surrounding
the Hodge-Arakelov-theoretic evaluation of the theta function at l-torsion points [cf.
the above discussion]. By contrast, the “toral”, or “nodal” [cf. the classical theory
of the structure of Hecke correspondences modulo p], geometry expressed by the multi-
plicative symmetries of the upper half-plane admits a natural “associated coordinate”,
namely, the classical biholomorphic isomorphism of the upper half-plane with the unit
disc, which is reminiscent of the way in which the F�l -symmetry is well-adapted to the
Kummer theory surrounding the number field Fmod [cf. the above discussion]. For
more details, we refer to the discussion of [IUTchI], Remark 6.12.3, (iii).
Upper half-plane Θ±ellNF-Hodge theaters
(“Cuspidal”) Additive z → z + a, F�±l -
symmetry z → −z + a (a ∈ R) symmetry
“Functions” assoc’d qdef= e2πiz theta fn. evaluated at
to add. symm. l-tors.
(“Toral/nodal”) z → z·cos(t)−sin(t)z·sin(t)+cos(t) ,
Multiplicative z → z·cos(t)+sin(t)z·sin(t)−cos(t) F�
l -symmetry
symmetry (t ∈ R)
“Functions” assoc’d wdef= z−i
z+i elements of the
to mult. symm. number field Fmod
Fig. 3.4: Comparison of F�±l -, F�
l -symmetries with the geometry of the upper
half-plane
Each log-link — i.e., each “↑” — of the log-theta-lattice may be described as
follows. At each v ∈ Vnon, if we write pv for the prime number determined by v, then
the ring structures in the domain and codomain of the log-link are related by means of
the non-ring-homomorphism given by the [classical] pv-adic logarithm
logv : F×v → F v
— where the field F v is reconstructed from the topological monoid equipped with the
action of a topological group Πv � O�F v
[cf. the above discussion], in a fashion discussed
30 Shinichi Mochizuki
in detail in [AbsTopIII], §3; the “F v” in the domain (respectively, codomain) of the map
logv arises from the Θ±ellNF-Hodge theater in the domain (respectively, codomain) of
the log-link [i.e., the arrow “↑”]. The absolute anabelian theory surrounding such pv-
adic logarithms is one of the central themes of [AbsTopIII]. The starting point of this
theory of [AbsTopIII] consists of the key observation that the log-link is compatible
with “some” isomorphism
Πv∼→ Πv
between the arithmetic fundamental groups Πv on either side of the log-link, relative
to the natural actions via Πv � Gv of Πv that appear in the domain and codomain of
the map logv. At first glance, it may appear as though this unspecified isomorphism may
be regarded as the identity map. In fact, however, the gap between the ring/scheme-
theoretic basepoints in the domain and codomain necessitates the point of view that
corresponding arithmetic fundamental groups in the domain and codomain may only
be related by means of some indeterminate isomorphism of abstract topological
groups. This topic will be discussed in more detail in §4 below.
At v ∈ Varc, the log-link admits an analogous description. Moreover, if one allows
v to vary, then the log-link is also compatible with corresponding global arithmetic
fundamental groups — i.e., ΠXK, ΠCK
[cf. the above discussion] — in the domain
and codomain of the log-link. Indeed, this sort of global compatibility with the log-link
is one of the central themes of [AbsTopIII]. Thus, in summary,
corresponding arithmetic fundamental groups of the Θ±ellNF-Hodge theaters
in the domain and codomain of the log-link are related by means of indeter-
minate isomorphisms of abstract topological groups
Πv∼→ Πv, for v ∈ Vnon; ΠXK
∼→ ΠXK; ΠCK
∼→ ΠCK.
An analogous statement holds at v ∈ Varc. This invariance of the various arithmetic
fundamental groups that appear in a Θ±ellNF-Hodge theater — i.e., when regarded as
abstract topological groups — is referred to, in inter-universal Teichmuller theory,
as the vertical coricity [i.e., coricity with respect to the vertical arrows “↑”] of thesearithmetic fundamental groups, up to indeterminate isomorphism. In this context, we
recall that it follows from the absolute anabelian geometry developed in [AbsTopIII]
[cf. also [AbsTopI], [AbsTopII]] that the ring structure — i.e., “arithmetic holomorphic
structure” — of the various local and global base fields involved may be reconstructed
group-theoretically from these arithmetic fundamental groups, considered up to isomor-
phism. That is to say, this arithmetic holomorphic structure is an invariant of
each vertical line of the log-theta-lattice.
Each Θ-link — i.e., each “→” — of the log-theta-lattice may be described as
follows. At each v ∈ Vbad, the ring structures in the domain and codomain of the
A Panoramic Overview of Inter-universal Teichmuller Theory 31
Θ-link are related by means of the non-ring-homomorphism given as follows:
(Gv � O×μ
Fv)
∼→ (Gv � O×μ
Fv);
{qj
2}j=1,...,l�
→ q
— we write “O×μ def= O×/Oμ” for the quotient of O× by its torsion subgroup “Oμ ⊆
O×”; we regard O×μ
Fvas an abstract topological module equipped with the action of an
abstract topological group “Gv”, as well as with a certain “integral structure”, which
we shall not describe here in detail [cf. [IUTchII], Definition 4.9, for a more detailed
technical discussion]; the collection{qj
2}j=1,...,l�
may be regarded as a collection of
certain values of the [reciprocal of the l-th root of the] theta function [cf. the discussion
surrounding Fig. 3.3]; the latter portion of the assignment of the above display is to be
regarded as an isomorphism of abstract monoids [i.e., both of which are isomorphic
to the monoid N] generated, respectively, by the LHS and RHS of this assignment. Thus,
the latter portion of the Θ-link consists precisely of the key assignment (∗KEY) that
played a central role in the discussion of §1. One key observation concerning the Θ-
link is that since “O×μ
Fv” is regarded as an abstract topological module [equipped with a
certain “integral structure”], it is subject to the Z×-indeterminacies
O×μ
Fv� λ ∈ Z×
— i.e., given by “raising to the λ-th power”. Another key observation concerning the
Θ-link at v ∈ Vbad is that it is compatible with “some” isomorphism
Gv∼→ Gv
between the Galois groups Gv on either side of the Θ-link, relative to the natural
actions of Gv that appear in the domain and codomain of the Θ-link. At first glance,
it may appear as though this unspecified isomorphism may be regarded as the identity
map. In fact, however, the gap between the ring/scheme-theoretic basepoints in the
domain and codomain necessitates the point of view that corresponding arithmetic
fundamental groups in the domain and codomain may only be related by means of
some indeterminate isomorphism of abstract topological groups. This topic will be
discussed in more detail in §4 below.
One central aspect of the theory of Frobenioids developed in [FrdI] is the funda-
mental dichotomy [cf. the discussion of [FrdI], §I4; [IUTchI], §I2, §I3; the beginning
of [IUTchII], Introduction] between “etale-like” and “Frobenius-like” structures. In
the discussion above of various examples of Frobenioids, the etale-like portion of the cor-
responding Frobenioids consists of the various arithmetic fundamental groups Gk, GF ,
Gv, Πv, ΠXK, ΠCK
. Thus, the etale-like portion of a Θ±ellNF-Hodge theater, when
regarded up to indeterminate isomorphism, is vertically coric. In a similar vein, at
32 Shinichi Mochizuki
v ∈ Vbad, the pair [i.e., regarded as an abstract topological module equipped with the
action of an abstract topological group] (Gv � O×μ
F v), when regarded up to indetermi-
nate isomorphism, is horizontally coric, i.e., invariant with respect to the [horizontal
arrow “→” constituted by the] Θ-link. Unlike the case with the vertically coric arith-
metic fundamental groups Πv, ΠXK, ΠCK
, it is not possible to reconstruct the ring
structure of the local base fields involved from the absolute Galois group Gv. That is
to say, the vertically coric arithmetic holomorphic structures in each vertical line of
the log-theta-lattice [cf. the above discussion] are not compatible with the horizon-
tal arrows of the log-theta-lattice. On the other hand, the Frobenius-like portion of
the various examples of Frobenioids given above consists of the abstract monoids, i.e.,
O�k, O�
F v, F
× ∪ {0}. Here, we recall that such monoids are often reconstructed by
means of anabelian algorithms from various arithmetic fundamental groups, i.e., from
various etale-like structures. In such a situation, if one considers the reconstructed
monoid together with the arithmetic fundamental group and reconstruction algorithm
from which it was reconstructed, then the resulting collection of data constitutes an
etale-like structure. On the other hand, if one forgets the portion of such a collection of
data that concerns the way in which the monoid was reconstructed from an arithmetic
fundamental group, i.e., if one regards the monoid just as some abstract monoid, then
the resulting collection of data constitutes a Frobenius-like structure.
One important observation in the context of the log- and Θ-links is the following:
The “wall” — i.e., relative to the conventional ring/scheme structures in the
domain and codomain — constituted by the assignment that gives rise to the
Θ-link at v ∈ Vbad can only be defined by working with abstract topological
monoids, i.e., with Frobenius-like structures. An analogous statement holds
for the log-link.
Put another way, it is only by stripping the abstract topological monoids that occur in
the LHS and RHS of the assignment that gives rise to the Θ-link of any information
concerning how those abstract topological monoids were, for instance, reconstructed
from various arithmetic fundamental groups that appear [cf. the above discussion!] that
one may obtain isomorphic mathematical objects that allow one to define the gluing
of Θ±ellNF-Hodge theaters constituted by the Θ-link [cf. the discussion of [IUTchII],
Remark 1.11.3, (i)]. An analogous statement holds for the log-link. By contrast,
the importance of etale-like structures arises from the fact that they have the
power to penetrate these “walls”
— cf. the key vertical and horizontal coricity properties discussed above.
The Θ-link is defined for [both nonarchimedean and archimedean] v ∈ Vgood in such
a way as to be compatible with the definition at v ∈ Vbad, relative to the “product
A Panoramic Overview of Inter-universal Teichmuller Theory 33
formula” [i.e., of elementary number theory, that relates the values that arise as a result
of applying various valuations of a number field]. This compatibility with the product
formula allows one to define the Θ-link also for certain global realified Frobenioids.
One fundamental consequence of the definition of the vertical and horizontal arrows
of the log-theta-lattice by means of various Frobenius-like structures — i.e., such as
abstract monoids— is the following observation: the log-theta-lattice is not symmetric
with respect to permutations that switch the domain and codomain of the various
arrows that appear in the log-theta-lattice. By contrast, if, for some v ∈ Vnon, one
considers the “vertically coric copy of Πv considered up to isomorphism” associated to
each vertical line of the log-theta-lattice, and one regards such copies †Πv,‡Πv arising
from distinct vertical lines — i.e., distinct arithmetic holomorphic structures
— as being related by means of some indeterminate isomorphism [i.e., of topological
groups] of the associated quotients †Πv � †Gv,‡Πv � ‡Gv, then one obtains a diagram
[cf. Fig. 3.5 below] in which the spokes — i.e., each of which arises from a vertical
line of the log-theta-lattice — satisfy the property that the diagram is invariant with
respect to arbitrary permutations of the spokes. This diagram of etale-like structures
[i.e., arithmetic fundamental groups] is referred to as the associated etale-picture. In
this context, we remark that the diagram of Fig. 2.2 may be thought of as a sort of
“elementary prototype” of the diagram of Fig. 3.5.
arith. hol.
str. Πv
. . . | . . .
arith.
hol. str.†Πv
. . .
—
mono-
analytic
core ??Gv
|
—
arith.
hol. str.‡Πv
. . .
arith. hol.
str. ∗Πv
Fig. 3.5: The portion of the etale-picture at v ∈ Vnon
The transition from the Frobenius-like structures that play a central role in the def-
inition of the log-theta-lattice — which is reminiscent of the cartesian coordinates
in the elementary geometry of the plane — to the etale-like structures that give rise to
the etale-picture — whose symmetries are reminiscent of the polar coordinates in
34 Shinichi Mochizuki
the elementary geometry of the plane — is referred to, in inter-universal Teichmuller
theory, as the operation of Kummer-detachment and is effected by applying various
versions of Kummer theory. That is to say, this process of Kummer-detachment is
reminiscent of the transition from cartesian to polar coordinates in the classical compu-
tation of the Gaussian integral∫ ∞
−∞e−x2
dx =√π.
In particular, the log-volume computations that constitute the “final output” of inter-
universal Teichmuller theory [cf. Corollary 4.2 below] may be thought of as a sort
of global analogue over number fields of this classical computation of the Gaussian
integral.
The theory of Kummer-detachment developed in inter-universal Teichmuller theory
centers around the Kummer theory that concerns each of the three Frobenius-like
components of the collection of data that appears in the domain of the Θ-link, namely:
(a) the units modulo torsion O×μ
F vat v ∈ Vbad [and analogous data at v ∈ Vgood];
(b) the theta values{qj
2}j=1,...,l�
;
(c) the corresponding global realified Frobenioids.
In general, Kummer theory amounts to essentially a formality [cf. the discussion at
the beginning of [FrdII], §2] — i.e., by considering the Galois action on the various
roots of some element of a multiplicative monoid — once one establishes an appropriate
cyclotomic rigidity isomorphism between the cyclotome [i.e., copy of some quotient
of “Z(1)”] that arises from the multiplicative monoid under consideration and some
cyclotome that arises from the Galois group or arithmetic fundamental group under
consideration. For instance, in the case of the natural Galois action of Gv, for v ∈ Vnon,
on the topological monoid O�F v
, the corresponding Kummer theory is established by
applying the conjugate synchronization arising from the F�±l -symmetry [cf. the
above discussion], together with a certain natural cyclotomic rigidity isomorphism that
arises from a suitable interpretation of the well-known content of local class field
theory [cf. the discussion of [IUTchI], §I2]. On the other hand, the Z×-indeterminacy
[cf. the above discussion] acting on the topological module O×μ
F vof (a) has the effect
of invalidating this cyclotomic rigidity isomorphism in the sense that it renders the
isomorphism “valid” only up to a Z×-indeterminacy. A substantial portion of inter-
universal Teichmuller theory is devoted to overcoming this obstruction to cyclotomic
rigidity by establishing “non-classical cyclotomic rigidity isomorphisms” for the
data of (b), (c). A closely related issue is the issue of decoupling the data of (b),
(c) from the data of (a), i.e., so as to render the data of (b), (c) immune to the
Z×-indeterminacies that act on the data of (a).
A Panoramic Overview of Inter-universal Teichmuller Theory 35
In the case of the global number fields that give rise to the data of (c), this
cyclotomic ridigity/decoupling is achieved, in essence, by suitably interpreting various
well-known facts from elementary algebraic number theory, together with the ob-
servation that Q>0
⋂Z× = {1} — cf. [IUTchI], Example 5.1, (v); [IUTchIII],
Proposition 3.10. The related Kummer theory is performed in the ΘNF-Hodge theater
portion of a Θ±ellNF-Hodge theater, i.e., the portion where the F�l -symmetry appears
[cf. the above discussion]. In the case of the theta functions that give rise to the theta
values that appear in (b), this cyclotomic ridigity/decoupling is achieved by applying
the various ridigity properties of the etale theta function — cf. [EtTh], Introduc-
tion; [IUTchII], §1, §2, §3; [IUTchIII], §2. The related Kummer theory is performed in
the portion of a Θ±ellNF-Hodge theater constituted by a Θ±ell-Hodge theater, i.e., the
portion where the F�±l -symmetry appears [cf. the above discussion]. In this situation,
we remark that when one performs the Hodge-Arakelov theoretic evaluation of the theta
functions involved at l-torsion points whose labels are subject to the bookkeeping appa-
ratus provided by some Θ±ell-Hodge theater [cf. the above discussion], it is necessary
to verify that profinite conjugates [arising from the profinite geometric fundamental
groups that occur in the global portion of such a Θ±ell-Hodge theater] of the various
tempered arithmetic fundamental groups at v ∈ Vbad do not intersect one another in
an “unexpected fashion”. This non-existence of “unexpected intersections” is obtained
by applying a certain technical result [cf. [IUTchI], Theorem B; [IUTchI], §2; [IUTchII],
§2] which follows from the theory of [Semi]. Here, we remark in passing that, relative to
the analogy with the upper half-plane [cf. Fig. 3.4], this non-existence of unexpected in-
tersections is reminiscent of the discreteness in SL2(R) of the various SL2(Z)-conjugates
of the subgroup of upper triangular unipotent matrices(1 ∗0 1
)in SL2(Z), i.e., despite the
fact that the natural induced topology [i.e., induced by the topology of the ambient
complex projective line] on the set of cusps of the upper half-plane [i.e., Q ∪ {∞}] isfar from discrete.
One important aspect of this Kummer theory involving the data of (b), (c) is the
interpretation of the various rigidity properties in terms of the notion of multiradi-
ality, i.e., roughly speaking, as a sort of rigidity with respect to “parallel trans-
port” between spokes of the etale-picture — cf. the discussion of multiradiality in §2;[IUTchIII], Fig. 3.5, and the surrounding discussion. Relative to the analogy with p-adic
Teichmuller theory, this multiradial, or “parallel transport”, interpretation is reminis-
cent of the Frobenius crystal structure on the canonical indigenous bundles of p-adic
Teichmuller theory. Alternatively, from the point of view of the scheme-theoretic Hodge-
Arakelov theory reviewed in §1, this multiradial, or “parallel transport”, interpretation
is reminiscent of the theory of the crystalline theta object discussed in [HASurII],
§2.
36 Shinichi Mochizuki
In this context, we remark that one important consequence of the theory surround-
ing the multiradial interpretation of the rigidity properties of the etale theta function
in the case of the data of (b), together with the closely related F�±l -symmetry, is the
highly distinguished nature of the theta values{qj
2}j=1,...,l�
that occur in (b).
That is to say, the intricate technical apparatus that constitutes the technical backbone
of inter-universal Teichmuller theory would collapse if one attempted instead to work
with, say, arbitrary powers “qN” of q.
Finally, we observe that there is an interesting analogy [cf. Fig. 4.2 below] between
this multiradial intepretation and the classical theory of the functional equation of
the theta function, i.e., “Jacobi’s identity” — cf., e.g., [DmMn], §1.7.5, as well as
[HASurI], §2.1; [EtTh], Remarks 1.6.2, 1.6.3. First of all, we recall that this functional
equation is in essence a formal consequence of the invariance of the Gaussian dis-
tribution with respect to the Fourier transform on the real line [cf., e.g., [DmMn],
§1.7.5]. Moreover, this invariance property is closely related to and, indeed, may be
thought of as a sort of “function-theoretic version” of the computation of the Gaus-
sian integral discussed above [cf., e.g., [DmMn], §1.7.5]. From this point of view, we
observe that just as the computation of the Gaussian integral may be thought of as
being analogous to the log-volume computation that yields Corollary 4.2 below [cf. the
above discussion], the invariance of the Gaussian distribution with respect to the Fourier
transform may be thought of as being analogous to the multiradial description given in
Theorem 4.1 below.
At a more technical level, we observe that the multiradiality of the etale theta
function may be thought of as a sort of “parallel transport” — i.e., a sort of “inte-
gration” — that exhibits the rigidity of the theta functions involved, relative to a
situation in which the units modulo torsion “O×μ” are subject to a mono-analytic
Z×-indeterminacy — i.e., an indeterminacy with respect to an exponentiation oper-
ation on the units with respect to some λ ∈ Z×. This description is reminiscent of the
Fourier transform, i.e., an integral of the Gaussian distribution subject to the effect
of a [non-holomorphic!] real analytic factor constituted by a unit “eit ∈ S1 ⊆ C×”which is subject to an exponentiation operation with respect to some t ∈ R. In the
case of the theory of [EtTh], the desired rigidity properties are proven, in effect, by con-
sidering the quadratic structure of the commutator relation in the theta group [cf. the
discussion of [IUTchIII], Remark 2.1.1]. In the case of the computation of the Fourier
transform of the Gaussian distribution, the essence of the computation consists of an
analysis, by “completing the square”, of the quadratic expression that appears in the
exponent.
From the point of view of the discussion of §1, the theta values{qj
2}j=1,...,l�
that occur in (b) may be thought of as a sort of function-theoretic expression of the
A Panoramic Overview of Inter-universal Teichmuller Theory 37
canonical multiplicative subspace and generators that occur in a neighborhood
of a fixed cusp of the moduli stack of elliptic curves. Then, relative to the scheme-
theoretic Hodge-Arakelov theory of [HASurI], [HASurII], inter-universal Teichmuller
theory provides a means of globally extending [cf. the ΘNF-Hodge theater portion of a
Θ±ellNF-Hodge theater] this function-theoretic expression of the canonical multiplica-
tive subspace and generators [i.e., which is most naturally defined in the Θ±ell-Hodge
theater portion of a Θ±ellNF-Hodge theater]. This state of affairs is strongly reminiscent
of the essential content of Jacobi’s identity, i.e., which asserts that the global extension
to the upper half-plane of the Gaussian expansion of the theta function in a neigh-
borhood of the cusp at infinity of the upper half-plane admits a similar Gaussian
expansion in a neighborhood of the cusp at zero. Here, we recall that often classical
expositions [i.e., such as, for instance, the exposition given in [DmMn], §1.7.5] of thistopic only treat the restriction to the imaginary axis of the theta function on the up-
per half-plane, since this restriction is quite sufficient for the Fourier analysis involved.
[That is to say, if one writes z for the standard coordinate on the upper half-plane,
then the variable “t” of [DmMn], §1.7.5, may be thought of as being related to z via the
equation z = it.] From the point of view of the analogy with inter-universal Teichmuller
theory, the theory of analytic continuation to a neighborhood of the cusp at zero of
the function a priori only defined in a neighborhood of the cusp at infinity may be
thought of as corresponding to the absolute anabelian representation of local and
global arithmetic holomorphic structures afforded by the theory of [AbsTopIII] [cf. the
discussion of [IUTchI], Remarks 4.3.1, 4.3.2; the theory of Aut-holomorphic structures
given in [AbsTopIII], §2; the theory of Belyi cuspidalizations given in [AbsTopII], §3,which underlies the theory of [AbsTopIII], §1]. That is to say, just as the Fourier
transform of the Gaussian distribution induces — i.e., is “intertwined” with — the
“rotation” of the upper half-plane corresponding to the matrix(
0 1−1 0
)[i.e., in the no-
tation of [DmMn], §1.7.5, the transformation t → 1t ], the multiradial description of
Theorem 4.1 below is “intertwined” [cf. the discussion following Theorem 4.1] with
the “rotation” of the additive and multiplicative structures [i.e., the two underlying
arithmetic dimensions] of the arithmetic holomorphic structure associated to a vertical
line of the log-theta-lattice [cf. the discussion of [AbsTopIII], §I4; [IUTchIII], Remarks
1.4.1, 1.4.2].
In this context, it is perhaps of interest to recall that the approach taken in [IUTchI]
to constructing global multiplicative subspaces and canonical generators via prime-strips
[cf. the above discussion!] was originally motivated by an analysis of the [geometri-
cally!] global multiplicative subspaces that appear in p-adic Hodge theory, for instance,
in the case of the local system of p-adic Tate modules associated to the tautological
family of elliptic curves over the moduli stack of elliptic curves over Zp [cf. [HASurII],
38 Shinichi Mochizuki
Remark 3.7]. That is to say, the absolute anabelian approach taken in the context of
prime-strips was originally motivated by contemplation of the essential content of the
operation of tensoring, over Zp, such p-adic Tate modules with p-adically completed
rings that arise as normalizations of local affine rings on the moduli stack in maximal
profinite etale extensions in characteristic zero — a somewhat curious situation in the
sense that the same rings occur both as localizations of the base scheme and as rings
of extended coefficients of etale local systems of Tate modules [cf. [IUTchI], Remarks
4.3.1, 4.3.2; [HASurII], Remark 3.7; [IUTchIV], Remark 3.3.2]. Indeed, the natural
isomorphisms — which constitute the starting point of p-adic Hodge theory! — be-
tween Galois cohomology modules of such rings [which, a priori, are not compatible
with Zariski localization!] and modules of differentials [which are compatible with
Zariski localization!] are somewhat reminiscent of the local/global functoriality of
the absolute anabelian algorithms of [AbsTopIII], §1, as well as of the theory of
Belyi cuspidalizations [i.e., which may also be thought of as a sort of compatibility
of arithmetic fundamental groups with Zariski localization on a hyperbolic curve], which
underlies these absolute anabelian algorithms.
§ 4. Inter-universality and Anabelian Geometry
One important aspect of the log- and Θ-links — i.e., the vertical and horizontal
arrows of the log-theta-lattice — is the property that the assignments that constitute
the log- and Θ-links [cf. the discussion of §3] are not compatible with the ring
structures in the domain and codomain of these assignments. On the other hand,
from the point of view of the classical theory of Galois groups — or, more generally,
etale fundamental groups of schemes — a Galois group is typically regarded not as
some “abstract topological group”, but rather as a group of field automorphisms
Autfield(k)
of some algebraic closure k of a given [say, perfect] field k. In the classical theory of etale
fundamental groups of schemes, a morphism between the schemes under consideration —
i.e., in essence, up to Zariski localization, a ring homomorphism— gives rise, in a natural
fashion, to outer homomorphisms between the corresponding etale fundamental groups
by considering the homomorphism between algebraically closed fields associated [up to
suitable indeterminacies], relative to the geometric basepoints under consideration, to
such a ring homomorphism. That is to say, at least from an a priori point of view, there
is no natural functorial way to define an “induced homomorphism of Galois groups”
arising from an assignment which is not a ring homomorphism. In particular, when one
considers the relationship between the Galois groups in the domain and codomain of
such a non-ring/scheme-theoretic assignment, one must sacrifice the datum consisting
A Panoramic Overview of Inter-universal Teichmuller Theory 39
of the embedding of such a Galois group into some “Autfield(k)”. That is to say, in
this sort of a situation, one must work with the Galois groups involved as abstract
topological groups, which are not equipped with the “labeling apparatus” [i.e.,
on elements of such a Galois group] arising from the action via field automorphisms
[i.e., the embedding into “Autfield(k)”]; it is precisely this state of affairs that requires
one to apply various results in absolute anabelian geometry in order to describe
ring/scheme-theoretic structures on the opposite side of the “wall” constituted by the
log-, Θ-links [cf. the discussion of §3; Theorem 4.1 below]. Such a labeling apparatus
may also be thought of as the labeling apparatus determined by a specific choice of
ring/scheme-theoretic basepoint, or, alternatively, by the universe that gives rise to
the model of set theory that underlies the codomain of the fiber functor determined by
such a basepoint. It is for this reason that we refer to this aspect of the theory by the
term “inter-universal” [cf. the discussion of [IUTchI], §I3].Thus, if one considers the portion of the log-theta lattice, say, at some fixed v ∈
Vnon, then one must contend with, roughly speaking, a distinct universe — i.e., a
distinct labeling apparatus for the sets that occur in the model of set theory with
respect to which one considers some local Galois category at v — at each “•” of the
log-theta-lattice. In a similar vein — cf. the discussion of prime-strips as “sections”
in §3 — one must contend with “changes of universe” when one passes back and forth
between local [i.e., at some v ∈ Vnon] and global Galois categories [i.e., Galois categories
associated to global arithmetic fundamental groups such as ΠXK, ΠCK
].
In this context, it is perhaps useful to observe that in order to regard the correspon-
dence between the “Πv’s” in the domain and codomain of the portion of the log-link at
v ∈ Vnon as the “identity”, it is necessary to identify the copies of F v in the domain
and codomain of the log-link. On the other hand, such an identification may only be
defined if one makes use of the ring structure of F v, which is incompatible with the
Θ-link. Put another way, the domain of the Θ-link can only be considered in a meaning-
ful way if one disables such an identification between the domain and codomain of the
log-link. Thus, in summary, the highly non-commutative nature of the log-theta-
lattice has the effect of obligating one to work with distinct labeling apparatuses
in the domain and codomain of the vertical arrows of the log-theta-lattice.
From the point of view of the analogy between inter-universal Teichmuller theory
and Jacobi’s identity for the theta function discussed at the end of §3 [cf. also Fig.
4.2 below], it is perhaps of interest to note that the labeling apparatus for sets arising
from a universe may be thought as corresponding to the labeling apparatus for the points
of a space [i.e., such as the upper-half plane] provided by a choice of coordinates. In
particular, “changes of universe” may be regarded as corresponding to changes of co-
ordinates. In the context of Jacobi’s identity, the basepoint given by the cusp at infinity
40 Shinichi Mochizuki
gives rise to a choice of coordinates, as well as a canonical multiplicative subspace and
generator of the topological fundamental group [i.e., ∼= Z2] of the tautological family of
elliptic curves over the upper half-plane. The Gaussian expansion of the theta func-
tion may then be thought of as a function-theoretic representation of this canonical
multiplicative subspace and generator, i.e., of the given choice of basepoint. Thus, the
equality of Gaussian expansions consituted by Jacobi’s identity may be thought of as a
sort of “inter-basepoint/inter-coordinate system” comparison between the base-
points/coordinate systems arising from the cusp at infinity and the cusp at zero. From
an elementary computational point of view, this comparison gives rise to a somewhat
startling improvement in computational accuracy [cf. the discussion at the be-
ginning of [DmMn], §1.7.5]! Perhaps it is natural to regard this startling improvement
in computational accuracy as a sort of classical analogue of the somewhat “startling”
numerical consequence constituted by Corollary 4.2 of the following abstract result [cf.
[IUTchIII], Theorem A, for more details]:
Theorem 4.1. (Multiradial Description of the Domain of the Θ-link)
The three Frobenius-like components (a), (b), (c) of the collection of data that appears
in the domain of the Θ-link admit a multiradial description in terms of the [a priori
“alien” ] arithmetic holomorphic structure in the codomain of the Θ-link.
Approach to Compatibility
Component Related cyclotomic Multi-/uni- with log-link/
of the Θ-link symmetry rigidity radiality profinite
topology
(additive) uniradial; or,
(a) units F�±l -symm. local class alternatively, upper semi-
modulo � conj. synch., field theory tautologically compatible/
torsion temp. vs. prof. multiradial up compatible
conjugates to Z×-indet.(additive)
(b) theta F�±l -symm. mono-theta compatible/
values � conj. synch., environments multiradial compatible
temp. vs. prof.
conjugates
(c) global (multiplicative) compatible/
realified F�l -symm. Q>0
⋂Z× multiradial incompatible
Frobenioids = {1}
Fig. 4.1: Properties satisfied by the three components (a), (b), (c) of the Θ-link
A Panoramic Overview of Inter-universal Teichmuller Theory 41
This result may be thought of as the culmination of the abstract theory developed
in [IUTchI], [IUTchII], [IUTchIII], which, in turn, may be thought of, in essence, as
a consequence of the absolute anabelian and Frobenioid-theoretic theory developed in
[Semi], [AbsTopIII], [FrdI], [FrdII], [EtTh].
Inter-universal Teichmuller theory Classical theory
Multiradial interpretation Invariance of the
of rigidity properties Gaussian distribution with respect
of etale theta function to Fourier transform
multiradiality as a sort integration in the
of parallel transport Fourier transform
mono-analytic Z×-indet. real analytic factor given by unit
acting on units modulo tors. “O×μ” subject to exponentiation w.r.t. t ∈ R
Proof of rigidity properties Fourier transform invariance via
via quadratic structure completing the square, i.e., quadratic
of theta group commutator structure in exponent of the Gaussian
{qj2}j=1,...,l� Gaussian expansion
Absolute anabelian geometry Analytic continuation from cusp at ∞applied to “rotation” of �, � to cusp at 0 via “rotation”
(0 1
−1 0
), i.e.,
via log-link t → 1t
Absolute anabelian approach curious “⊗” in
to global mult. s/sp., can. gen. p-adic Hodge theory
local/global functoriality of compatible local isomorphisms between
absolute anabelian algorithms, Galois cohomology, differentials
Belyi cuspidalization in p-adic Hodge theory
Change of universe, i.e., Change of coordinates, i.e.,
labeling apparatus for sets labeling apparatus for points in a space
computation of log-volume of computation via polar coordinates of
mono-analytic containers Gaussian integral∫∞−∞ e−x2
dx =√π
Startling application to Startling improvement in
diophantine geometry computational accuracy
Fig. 4.2: The analogy between inter-universal Teichmuller theory and
Jacobi’s identity for the theta function
At first glance, it may appear as though the log-link is entirely irrelevant to the
content of Theorem 4.1. In fact, however, the theory surrounding the log-link plays a
central role in the construction of the multiradial description of Theorem 4.1. That is
to say, from a naive, a priori point of view, there is no natural way to obtain an action
42 Shinichi Mochizuki
of the “qj2
’s” that appear in component (b) on the multiplicative monoids “O×μ”
that appear in component (a). If, on the other hand, one considers the images via the
maps “logv” [cf. the discussion of §3] of these multiplicative monoids “O×μ”, then one
obtains additive monoids on which the “qj2
’s” that appear in component (b) — as well
as the global number fields that give rise to the global realified Frobenioids that appear
in component (c) — do indeed act naturally. Since, however, the log-theta-lattice is
highly non-commutative, it is a highly nontrivial task to formulate a “multiradial
description” that makes use of the mono-analytic containers [cf. the discussion of
§1] for the data of (b), (c) that arise, in effect, by taking the logarithm of the data of
(a). This highly nontrivial task is achieved by applying the interpretation of the theory
of [AbsTopIII] that is developed in [IUTchIII], §1, §3 — cf. the discussion surrounding
Figs. I.4, I.5, I.6 in [IUTchIII], Introduction. One important aspect of this portion
of the theory is the compatibility of the log-link with the Kummer theory of (a),
(b), (c) that was discussed in §3 [cf. [IUTchIII], Theorem A, (ii)]. In the case of (a),
this compatibility is achieved only in the form of an “upper semi-compatibility”,
that is to say, only at the cost of admitting a certain indeterminacy which, from the
point of view of Corollary 4.2 below, is ultimately reflected in the fact that the log-
volume computation that gives rise to Corollary 4.2 only yields an inequality, i.e.,
as opposed to a precise equality; classical analogues of this inequality may be seen in
the examples that appear in the discussion of hyperbolicity following Corollary 4.2. In
the case of (b), this compatibility follows from the rigidity properties of the etale theta
function, while, in the case of (c), it follows from a suitable interpretation of various
well-known facts from elementary algebraic number theory. In particular, we observe
in passing that, relative to the analogy with Jacobi’s identity for the theta function [cf.
Fig. 4.2], this compatibility may be thought of as corresponding, in the case of (b),
to the “intertwining” of the Fourier transform of the Gaussian distribution with the
rotation(
0 1−1 0
), and, in the case of (c), to the compatibility of this rotation with the
toral symmetries of the upper half-plane that appear in Fig. 3.4. In this context, it
is also important to avail oneself of the compatibility with the profinite topology
of the algorithms that give rise to the various local cyclotomic rigidity isomorphisms
[cf. the discussion of §3] that are applied to the data of (a), (b) [cf. the discussion
surrounding [IUTchIII], Fig. 3.5]. This compatibility is a property that is not shared
by the algorithms that give rise to the global cyclotomic rigidity isomorphisms that
are applied to the global number fields that give rise to the data of (c). The various
properties satisfied by the three Frobenius-like components (a), (b), (c) of the collection
of data that appears in the domain of the Θ-link are summarized in Fig. 4.1 above [cf.
also the discussion of §3]. These properties play a key role in the proofs of the main
results of [IUTchIII].
A Panoramic Overview of Inter-universal Teichmuller Theory 43
From a numerical point of view, the significance of the multiradial description of
Theorem 4.1 is that it yields
two tautologically equivalent ways to compute the log-volume — i.e., the
global arithmetic degree — of the arithmetic line bundle in the codomain of
the Θ-link determined by the [2l-th roots of the] q-parameter qvat the v ∈ Vbad.
We refer to the discussion surrounding Fig. I.8 of [IUTchIII], Introduction, for more
details. The statement of this numerical consequence of Theorem 4.1 [i.e., [IUTchIII],
Theorem A] is given in [IUTchIII], Theorem B. In [IUTchIV], §1, §2, various elemen-
tary computations of the log-volume estimates obtained in [IUTchIII], Theorem B, are
performed. These elementary computations have the following consequence [cf. the
discussion of §1; [IUTchIV], Theorem A]:
Corollary 4.2. (Diophantine Consequences) The ABC/Szpiro Conjecture
holds.
Relative to the analogy between inter-universal Teichmuller theory and p-adic/com-
plex Teichmuller theory [cf. the discussion of §2], these numerical consequences of the
multiradial description of Theorem 4.1 may be regarded as analogous to the following
inequalities, which, in essence, reflect the hyperbolicity of the curve/hyperbolic Rie-
mann surface under consideration [cf. the discussion of “negative curvature” towards
the end of [IUTchIII], Introduction]:
(i) the negativity of the degree of the Hasse invariant associated to a canonically
lifted curve of genus g in p-adic Teichmuller theory
0 > (2g − 2)(1− p)
— i.e., of 1p times the derivative of the canonical Frobenius lifting, a quantity
which may be thought of as a sort of numerical comparison of the lifting modulo p2
[cf. the domain of the Θ-link!] of the curve with the original positive characteristic
curve [cf. the codomain of the Θ-link!];
(ii) the negativity of the integral that appears in the Gauss-Bonnet theorem for a
hyperbolic Riemann surface S of genus g
0 >
∫S
(curvature of the Poincare metric) = 4π(1− g)
— a sort of numerical estimate in terms of real analytic structures such as the
Poincare metric and its curvature concerning the structure of the holomorphic Rie-
mann surface S.
44 Shinichi Mochizuki
Acknowledgements
The author wishes to thank Akio Tamagawa, Fumiharu Kato, Go Yamashita, and
Mohamed Saıdi for countless hours of useful discussions concerning the material dis-
cussed in the present paper. The author feels himself to be particularly indebted to
Go Yamashita and Mohamed Saıdi for their meticulous reading of and numerous help-
ful comments concerning the present paper as well as the following papers: [HASurI],
[HASurII], [Semi], [FrdI], [FrdII], [EtTh], [AbsTopIII], [GenEll], [IUTchI], [IUTchII],
[IUTchIII], [IUTchIV]. The author was also stimulated, in the course of preparing the
present paper, by comments and questions from numerous other mathematicians.
References
[DmMn] Dym, H. and McKean, H. P., Fourier Series and Integrals, Academic Press (1972).
[Lehto] Lehto, O., Univalent Functions and Teichmuller Spaces, Graduate Texts in Mathemat-
ics 109, Springer (1987).
[pOrd] Mochizuki, S., A Theory of Ordinary p-adic Curves, Publ. Res. Inst. Math. Sci. 32
(1996), 957–1151.
[pTeich] Mochizuki, S., Foundations of p-adic Teichmuller Theory, AMS/IP Studies in Ad-
vanced Mathematics 11, American Mathematical Society/International Press (1999).
[HASurI] Mochizuki, S., A Survey of the Hodge-Arakelov Theory of Elliptic Curves I, Arith-
metic Fundamental Groups and Noncommutative Algebra, Proceedings of Symposia in
Pure Mathematics 70, American Mathematical Society (2002), 533–569.
[HASurII] Mochizuki, S., A Survey of the Hodge-Arakelov Theory of Elliptic Curves II, Al-
gebraic Geometry 2000, Azumino, Adv. Stud. Pure Math. 36, Math. Soc. Japan (2002),
81–114.
[CanLift] Mochizuki, S., The Absolute Anabelian Geometry of Canonical Curves, Kazuya
Kato’s fiftieth birthday, Doc. Math. 2003, Extra Vol., 609–640.
[Quasicon] Mochizuki, S., Conformal and Quasiconformal Categorical Representation of Hy-
perbolic Riemann Surfaces, Hiroshima Math. J. 36 (2006), 405–441.
[Semi] Mochizuki, S., Semi-graphs of Anabelioids, Publ. Res. Inst. Math. Sci. 42 (2006), 221–
322.
[FrdI] Mochizuki, S., The Geometry of Frobenioids I: The General Theory, Kyushu J. Math.
62 (2008), 293–400.
[FrdII] Mochizuki, S., The Geometry of Frobenioids II: Poly-Frobenioids, Kyushu J. Math. 62
(2008), 401–460.
[EtTh] Mochizuki, S., The Etale Theta Function and its Frobenioid-theoretic Manifestations,
Publ. Res. Inst. Math. Sci. 45 (2009), 227–349.
[AbsTopI] Mochizuki, S., Topics in Absolute Anabelian Geometry I: Generalities, J. Math.
Sci. Univ. Tokyo 19 (2012), 139–242.
[AbsTopII] Mochizuki, S., Topics in Absolute Anabelian Geometry II: Decomposition Groups,
RIMS Preprint 1625 (March 2008).
[AbsTopIII] Mochizuki, S., Topics in Absolute Anabelian Geometry III: Global Reconstruction
Algorithms, RIMS Preprint 1626 (March 2008).
[GenEll] Mochizuki, S., Arithmetic Elliptic Curves in General Position, Math. J. Okayama
Univ. 52 (2010), 1–28.
A Panoramic Overview of Inter-universal Teichmuller Theory 45
[IUTchI] Mochizuki, S., Inter-universal Teichmuller Theory I: Construction of Hodge Theaters,
RIMS Preprint 1756 (August 2012).
[IUTchII] Mochizuki, S., Inter-universal Teichmuller Theory II: Hodge-Arakelov-theoretic
Evaluation, RIMS Preprint 1757 (August 2012).
[IUTchIII] Mochizuki, S., Inter-universal Teichmuller Theory III: Canonical Splittings of the
Log-theta-lattice, RIMS Preprint 1758 (August 2012).
[IUTchIV] Mochizuki, S., Inter-universal Teichmuller Theory IV: Log-volume Computations
and Set-theoretic Foundations, RIMS Preprint 1759 (August 2012).
See
http://www.kurims.kyoto-u.ac.jp/~motizuki/papers-english.html
for revised versions of the various RIMS Preprints listed above.