Gelfond-Schneider Theorem
Contents |
Theorem
Let $\alpha$ and $\beta$ be algebraic numbers (possibly complex) such that $\alpha \notin \left\{{0, 1}\right\}$.
If $\beta$ is irrational then any value of $\alpha^\beta$ is transcendental.
Proof
Let $\alpha$ be an algebraic number such that $\alpha \ne 0$ and $\alpha \ne 1$.
Let $\beta$ be an algebraic number such that $\alpha^\beta$ is algebraic.
The result will follow if we can show that $\beta \in \Q$.
First we consider now the special case that $\alpha, \beta \in R$ and $\alpha > 0$.
It is enough to have $\ln \alpha \in \R$.
Observe that $\alpha^{s_1 + s_2 \beta}$ is an algebraic number for all integers $s_1$ and $s_2$.
To establish the result, it is enough to show that there are two distinct pairs of integers $(s_1, s_2)$ and $(s'_1, s'_2)$ for which:
- $s_1 + s_2 \beta = s'_1 + s'_2 \beta$
We will choose $S$ sufficiently large and show such pairs exist with $0 \le s1, s2, s'_1, s'_2 < S$.
Lemma 1
Let $a_1 \left({t}\right), \ldots, a_n \left({t}\right)$ be non-zero polynomials in $R \left[{t}\right]$ of degrees $d_1, \ldots, d_n$ respectively.
Let $w_1, \ldots, w_n$ be pairwise distinct real numbers.
Then:
- $\displaystyle F \left({t}\right) = \sum_{j=1}^n a_j \left({t}\right) e^{w_j t}$
has at most $d_1 + \cdots + d_n + n − 1$ real zeroes (counting multiplicities).
Lemma 2
Let $f \left({z}\right)$ be an analytic function in the disk $D \subseteq \C: D = \left\{{z : \left|{z}\right| < R}\right\}$ for some $R \in \R$.
Let $f$ also be continuous on the closure of $D$, that is, on $D^- = \left\{{z : \left|{z}\right| \le R}\right\}$.
Then:
- $\forall z \in D^-: \left|{f \left({z}\right)}\right| \le \left|{f}\right|_R$
Lemma 3
Let $r$ and $R$ be two real numbers such that $1 \le r \le R$.
Let $f_1 \left({z}\right), f_2 \left({z}\right), \ldots, f_L \left({z}\right)$ be:
- analytic in $D \subseteq \C: D = \left\{{z : \left|{z}\right| < R}\right\}$
- continuous on the closure $D$, that is, $D^- = \left\{{z : \left \vert{z}\right \vert \le R}\right\}$.
Let $\zeta_1, \ldots, \zeta_L$ be complex numbers such that:
- $\forall j \in \left\{{1, 2, \ldots, L}\right\}: \left|{\zeta_j}\right| \le r$
Then the determinant:
- $\Delta = \det \begin{bmatrix} f_1 \left({\zeta_1}\right) & \cdots & f_L \left({\zeta_1}\right) \\ \vdots & \ddots & \vdots \\ f_1 \left({\zeta_L}\right) & \cdots & f_L \left({\zeta_L}\right) \end{bmatrix}$
satisfies:
- $\displaystyle \left|{\Delta}\right| \le \left({\frac R r}\right)^{−L \left({L−1}\right) / 2} L! \prod_{\lambda=1}^L \left|{f_λ}\right|_R$
Lemma 4
Let:
- $\Delta = \det \left[{\alpha_{i,j}}\right]_{L \times L}$
where the $\alpha_{i,j}$ are algebraic numbers.
Suppose that $T$ is a positive rational integer for which $T \alpha_{i,j}$ is an algebraic integer for every $i, j \in \left\{{1, 2, \ldots, L}\right\}$.
Also, suppose that $\Delta \ne 0$.
Then there is a conjugate of $\Delta$ with absolute value $\ge T^{−L}$.
Let $c$ be a sufficiently large real number (which will be specified in due course).
Consider integers $L_0, L_1, S$ each $\ge 2$.
Let $L = \left({L_0 + 1}\right) \left({L_1 + 1}\right)$.
Observe that we can find such $L_0, L_1, S$ (and we do so) with:
- $c L_0 \ln S \le L, c L_1 S \le L, L \le \left({2 S − 1}\right)^2$
For example, take $S$ large, and:
- $L_0 = \left \lfloor {S \ln S}\right \rfloor, L_1 = \left \lfloor {S / \ln S}\right \rfloor$
Note that we could take $c = \ln \ln S$.
Consider the matrix $M$ described as follows.
Consider some arrangement $\left({s_1 \left({i}\right), s_2 \left({i}\right)}\right)$ of the $\left({2S − 1}\right)^2$ integral pairs $\left({s_1, s_2}\right)$ with $\left|{s_1}\right| < S$ and $\left|{s_2}\right| < S$.
Also, consider some arrangement $\left({u \left({j}\right), v \left({j}\right)}\right)$, with $1 \le j \le L$, of the integral pairs $\left({u, v}\right)$ where $0 \le u \le L_0$ and $0 \le v \le L_1$.
Then we define:
- $M = \left[{ \left({s_1 \left({i}\right) + s_2 \left({i}\right) \beta}\right)^{u \left({j}\right)} \left({\alpha^{s_1 \left({i}\right) + s_2 \left({i}\right) \beta} }\right)^{v \left({j}\right)} }\right]$
so that $M$ is a $\left({2S − 1}\right)^2 \times L$ matrix.
The idea is to:
- $(1): \quad$ Consider the determinant $\Delta$ of an arbitrary $L \times L$ submatrix of $M$ (any one will do).
- $(2): \quad$ Use Lemma 3 to obtain an upper bound $B_1$ for the absolute value of $\Delta$ (or, more specifically, an upper bound for $\ln \left|{\Delta}\right|$).
- $(3): \quad$ Use Lemma 4 to motivate that if $\Delta \ne 0$, then $\left|{\Delta}\right| \ge B_2$ for some $B_2 > B_1$ (and assume this to be the case).
- $(4): \quad$ Conclude that $\Delta$ must be $0$ and, hence, the rank of $M$ is $< L$.
- $(5): \quad$ Take a linear combination of the columns of $M$ to obtain an $F \left({t}\right)$ as in Lemma 1 with $< L$ roots but with $F \left({s_1 \left({i}\right) + s_2 \left({i}\right) \beta}\right) = 0$ for $1 \le i \le L$.
- $(6): \quad$ Conclude that $\beta$ is rational as required.
We have not specified the arrangements defining $\left({u \left({j}\right), v \left({j}\right)}\right)$ and $\left({s_1 \left({i}\right), s_2 \left({i}\right)}\right)$.
Therefore it suffices to consider $\Delta = \det \left[{f_j \left({\zeta_i}\right)}\right]$ where:
- $f_j \left({z}\right) = z^{u \left({j}\right)} \alpha^{v \left({j}\right) z} \quad (1 \le j \le L)$
and:
- $\zeta_i = s_1 \left({i}\right) + s_2 \left({i}\right) \quad \beta (1 \le i \le L)$
Observe that $u \left({j}\right)$ is a non-negative integer for each $j$.
Also, $\alpha^{v \left({j}\right) z} = \exp \left({v \left({j}\right) z \ln \alpha}\right)$, and we fix $\ln \alpha$ so that it is real.
Hence, $f_j \left({z}\right)$ is uniquely defined.
Then $f_j \left({z}\right)$ represents an entire function for each $j$.
Observe that:
- $\left|{e^{z_1 z_2}}\right| = e^{\Re \left({z_1 z_2}\right)} \le e^{\left|{z_1 z_2}\right|} = e^{\left|{z1}\right| \left|{z2}\right|}$
for all complex numbers $z_1$ and $z_2$.
Hence, for any $R > 0$:
- $\left|{f_j}\right|_R \le R^{u \left({j}\right)} e^{v \left({j}\right) R \left|{\ln \alpha}\right|}$
We use Lemma 3 with $r = S \left({1 + \left|{\beta}\right|}\right)$ and $R = e^2 r$.
Then for some constant $c_1 > 0$, we obtain:
| \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \ln \left\vert{\Delta}\right\vert\) | \(\le\) | \(\displaystyle −L \left({L − 1}\right) + \ln L! + L \max_{1 \le j \le L} \left\{ {\ln \left\vert{f_j}\right\vert_R}\right\}\) | \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \) | |||
| \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \) | \(\le\) | \(\displaystyle −L \left({L − 1}\right) + L \ln L + L L_0 log R + L L_1 R \left\vert{\ln \alpha}\right\vert\) | \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \) | |||
| \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \) | \(\le\) | \(\displaystyle −L^2 + c_1 \left({L L_0 \ln S + L L_1 S}\right)\) | \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \) |
The constant $c_1$ above is independent of $c$.
Therefore, if $c$ is sufficiently large (that is,$ c \ge 4c_1$), then:
- $\ln \left|{\Delta}\right| \le −\dfrac{L^2} 2$
Suppose now that $T'$ is a positive rational integer for which $T' \alpha, T' \beta$ and $T'\alpha^\beta$ are all algebraic integers.
Then $T = \left({T'}\right)^{L_0 + 2 S L_1}$ has the property that $T$ times any element of $M$ (and, hence, $T$ times any element of the matrix describing $\Delta$) is an algebraic integer.
Therefore, by Lemma 4, if $\Delta \ne 0$, then there is a conjugate of $\Delta$ with absolute value:
- $\ge T^{−L} = \left({T'}\right)^{−L L_0 − 2 S L L_1}$
It may be reasonable to expect that a similar inequality might hold for $\left|{\Delta}\right|$ itself (rather than for
the absolute value of a conjugate of $\Delta$).
It will be shown later that if $\Delta \ne 0$, then there is indeed a constant $c_2$ (independent of $c$) for which:
- $(A) \qquad \ln \left|{\Delta}\right| \ge −c_2 \left({L L_0 \ln S + S L L_1}\right)$
By using our upper bound for $\ln \left|{\Delta}\right|$ above, we see that for $c$ sufficiently large ($c \ge 8 c_2$ will do), we obtain that $\Delta = 0$.
Since $\Delta = \det \left[{f_j \left({\zeta_i}\right)}\right]$ as defined above, we get that the columns of $\left[{f_j \left({\zeta_i}\right)}\right]$ must be linearly dependent over the real numbers.
In other words, there exist real numbers $b_j$, not all $0$, such that:
- $\displaystyle \sum_{j=1}^L b_j f_j \left({\zeta_i}\right) = 0$ for $1 \le i \le L$
By considering a particular ordering of the $\left({u \left({j}\right), v \left({j}\right)}\right)$, we deduce that
- $\displaystyle \sum_{v=0}^{L_1} \sum_{u=0}^{L_0} b_{\left({L_0 + 1}\right) v + u + 1} \zeta_i^u \alpha^{v \zeta_i} = 0$ for $1 \le i \le L$
But
- $\displaystyle \sum_{v=0}^{L_1} \sum_{u=0}^{L_0} b_{\left({L_0 + 1}\right) v + u + 1} \zeta_i^u \alpha^{v \zeta_i} = \sum_{v=0}^{L_1} a_v \left({t}\right) e^{w_v t}$
where:
- $\displaystyle a_v \left({t}\right) = \sum_{u=0}^{L_0} b_{\left({L_0 + 1}\right) v + u + 1} t^u, w_v = v \ln \alpha$
and:
- $t = \zeta_i = s_1 \left({i}\right) + s_2 \left({i}\right) \beta$
Each of the $L$ values of $\zeta_i$ is a root of $\displaystyle \sum_{v=0}^{L_1} a_v \left({t}\right) e^{w_v t} = 0$.
Since some $b_j \ne 0$, we deduce from Lemma 1 that there are at most $L_0 \left({L_1 + 1}\right) + \left({L_1 + 1}\right) − 1 < L$ distinct real roots.
Therefore, two roots $\zeta_i$ must be the same, and we can conclude that:
- $s_1 \left({i}\right) + s_2 \left({i}\right) \beta = s_1\left({i'}\right) + s_2 \left({i'}\right) \beta$
for some $i, i'$ with $1 \le i < i' \le L$.
On the other hand, the pairs $\left({s_1 \left({i}\right) + s_2 \left({i}\right)}\right)$ and $\left({s_1 \left({i'}\right) + s_2 \left({i'}\right)}\right)$ are necessarily distinct.
So we can conclude that $\beta$ is rational, completing the proof of Lemma 1.
To complete the proof of when $\alpha > 0$ and $\beta$ are real, all we need to do is to show that if
$\Delta \ne 0$, then $(A)$ holds.
We still assume that $\alpha \ne 1$ and $\alpha, \beta, \alpha^\beta$ are algebraic.
Let $T'$ be a positive rational integer for which $T' \alpha, T' \beta, T' \alpha^\beta$ are all algebraic integers.
Then $T = (T')^L_0 + 2 S L_1$ has the property that $T$ times any element of $M$ (and, hence, $T$ times any element of the matrix describing $\Delta$) is an algebraic integer.
It follows that $T^L \Delta$ is an algebraic integer.
Let $\Delta \ne 0$.
For an algebraic number $w$, we denote by $\left\Vert{w}\right\Vert$ the maximum of the absolute value of a conjugate of $w$.
Then we obtain that:
- $\left\Vert{T^L \Delta}\right\Vert = T^L \left\Vert{\Delta}\right\Vert \le T^L L!S^{L_0 L} \left({1 + \left \Vert{ \beta }\right \Vert}\right)^{L_0 L} \left\Vert{\alpha}\right\Vert^{S L_1 L} \left\Vert{\alpha^\beta}\right\Vert^{S L_1 L}$
where in the last inequality it should be noted that possibly $\left\Vert{\alpha}\right\Vert$ and $\left\Vert{\alpha^\beta}\right\Vert$ are $< 1$.
Alternatively, replace these with $\left\Vert{\alpha}\right\Vert + 1$ and $\left\Vert{\alpha^\beta}\right\Vert + 1$ respectively, and continue as below.
We have that $T^L \Delta$ is an algebraic integer in $\Q \left({\alpha, \beta, \alpha^\beta}\right)$.
So we can deduce that $T^L \Delta$ is a root of a monic polynomial $g \left({x}\right)$ of degree $N$ where $N$ is the product of the degrees of the minimal polynomials for $\alpha, \beta, \alpha^\beta$.
Note that each root of $g \left({x}\right)$ can be made to be a conjugate of $T^L \Delta$.
Since:
- the product of all the roots of $g \left({x}\right)$ has absolute value $\left|{g \left({0}\right)}\right| \ge 1$
and:
- each root of $g \left({x}\right)$ has absolute value $\le \left\Vert{T^L \Delta}\right\Vert$
it follows that:
| \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \left\vert{T^L \Delta}\right\vert\) | \(\ge\) | \(\displaystyle \dfrac {\left\vert{g \left({0}\right)}\right\vert} {\left\Vert{T^L \Delta}\right\Vert^{N−1} }\) | \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \) | |||
| \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \) | \(\ge\) | \(\displaystyle T^{− \left({N − 1}\right) L} \left({L!}\right)^{−N} S^{−N L_0 L} \left({1 + \left\Vert{\beta}\right\Vert}\right)^{−N L_0 L} \left({\left\Vert{\alpha}\right\Vert \left\Vert{\alpha^\beta}\right\Vert + 1}\right)^{−N S L_1 L}\) | \(\displaystyle \) | \(\displaystyle \) | \(\displaystyle \) |
Hence:
- $\ln \left|{\Delta}\right| \ge −N L \ln T − N L \ln L − N L_0 L \ln S − N L_0 L \ln \left({1 + \left\Vert{\beta}\right\Vert}\right) − N S L_1 L \ln \left({\left\Vert{\alpha}\right\Vert \left\Vert{\alpha^\beta}\right\Vert + 1}\right)$
Recall that $T = (T')^{L_0 + 2 S L_1}$.
Here, $T'$ and $N$ are constants depending only on $\alpha$ and $\beta$.
It follows that $(A)$ holds, and the proof for $\alpha > 0$ and $\beta$ real follows.
$\blacksquare$
Source of Name
This entry was named for Alexander Osipovich Gelfond and Theodor Schneider.
This was Problem 7 in the Hilbert 23.