(Courant Lecture Notes) Emil Artin - Algebra With Galois Theory-American Mathematical Society (2007) PDF
(Courant Lecture Notes) Emil Artin - Algebra With Galois Theory-American Mathematical Society (2007) PDF
(Courant Lecture Notes) Emil Artin - Algebra With Galois Theory-American Mathematical Society (2007) PDF
Emil Artin
Notes by Albert A. Blank
Library o f C o n g r e s s Cataloging-in-Publieatio n D a t a
Artin, Emil , 1 898-1 962 .
Algebra wit h Galoi s theor y / E . Artin , note s b y Alber t A . Blank .
p. cm . — (Couran t lectur e note s ; 1 5 )
ISBN 978-0-821 8-41 29- 7 (alk . paper )
1. Galoi s theory . 2 . Algebra . I . Blank , Alber t A . I L Title .
QA214.A76 200 7
512—dc22 200706079 9
Editors' Note
Chapter 1 . Group s
1.1. Th e Concept of a Group
1.2. Subgroup s
Chapter 2. Ring s and Fields
2.1. Linea r Equations in a Field
2.2. Vecto r Spaces
Chapter 3. Polynomials . Factorization into Primes. Ideals.
3.1. Polynomial s over a Field
3.2. Factorizatio n into Primes
3.3. Ideal s
3.4. Greates t Common Divisor
Chapter 4. Solutio n of the General Equation of nth Degre e
Extension Fields. Isomorphisms.
4.1. Congruenc e
4.2. Extensio n Fields
4.3. Isomorphis m
Chapter 5. Galoi s Theory
5.1. Splittin g Fields
5.2. Automorphism s of the Splitting Field
5.3. Th e Characteristic of a Field
5.4. Derivativ e of a Polynomial: Multiple Roots
5.5. Th e Degree of an Extension Field
5.6. Grou p Characters
5.7. Automorphi c Groups of a Field
5.8. Fundamenta l Theorem of Galois Theory
5.9. Finit e Fields
Chapter 6. Polynomial s with Integral Coefficient s
6.1. Irreducibilit y
6.2. Primitiv e Roots of Unity
Chapter 7. Th e Theory of Equations
7.1. Rule r and Compass Construction s
VI CONTENTS
Beeause what was in 1947 "modern" has now become Standard, and what was
then "higher" has now become foundational, w e have retitled this volume Algebra
with Galois Theory from the original Modern Higher Algebra. Galois Theory.
Jalal Shatah, Executive Editor
Paul Monsour, Managing Editor
August 2007
This page intentionally left blank
http://dx.doi.org/10.1090/cln/015/01
CHAPTER 1
Groups
EXERCISE 2. Dedue e the associative law for fou r element s from (2) , that is,
show that the five possible products o f four element s written i n a given sequenc e
are all equal. Furthermore , attemp t to determine the number of possible product s
of n element s give n i n a linear order . Fo r example, the elements a\, a 2, a^, a^ in
that the order yield the products {aia^ia^), a\{a 2{a^ü6S), etc. Hint: Le t a n b e
the number of products of a\, a 2,..., a n. Fin d a recursion formula fo r a n an d use
the Lagrange generating functio n
f(x) = ot\x + a 2x2 H h a nxn H .
EXERCISE 3. Th e associative law for n elements states that all possible prod-
ucts of n element s written in a prescribed order, e.g., a\, a 2, . . . , a n, yield the same
result. Prov e the associative la w fo r an y number o f element s usin g onl y (2 ) (th e
associative law for three elements).
PROOF FOR EXERCISE 3: W e assume the validity o f the associative la w fo r
all products of m factors, m <n, an d show that this implies the validity of the law
for n + 1 . Consider the particular product (n + 1 ) Yll={ ak whic h is obtained fro m
the n + 1 element s a\, a 2,..., a n+\ b y successively multiplying on the right, i.e.,
n<
k=l
ük = a\,
n+l / n
a a
W k=\Y\ k ]<*n+\-
k=l V= l
Let P n+\ b e any product o f the n + 1 elements a\, a 2,..., a n+\ take n i n that
order. Since P n+\ i s the result of at least one multiplication, we may write
Pn+l = P»P£\, \<m<n,
where P™ i s some product o f the elements a\, a 2i..., a m i n that order an d P^\
of the remaining elements a m + i, a m+2,..., a n+\. B y the induction hypothesi s w e
have
k=ß
for any ß, v such that v — ß + 1 <n. Specifically , w e have
n
/m ^ ^ n+l ^ m V / 1
\
/ a
"+i=(n^)( n *)=n^ \( n ^j-^+ i
\j=l ' ^k=m+l ' j = \ '-\ j k =1
:w+ / J
=
( n a J n a k ) *an+i
\ - = l k=m+l '
nx n+l
( Y\ k) ' a a
n+l = Yl ük>
k=\ / k=\
each step being a simple application of (2) . D
1.1. THE CONCEP T O F A GROUP 3
1.2. Subgroup s
If G i s a group an d 5 i s a subse t o f G tha t i s itsel f a group unde r th e sam e
Operation as G, then S is called a subgroup of G .
EXAMPLE. Tak e G t o b e th e se t o f rationa l number s othe r tha n zer o unde r
ordinary multiplication. G has, e.g., the subgroups
(a) th e positive rational numbers
(b) th e powers of any element
(c) th e set consisting of +1 an d —1
Trivially, (d) the set G itself or (e) the set consisting of the element 1 .
1.2. SUBGROUP S 5
PROOF: Th e cosets aS hav e the same number of elements as S. For let S con-
sist of the distinct elements s\,S2,... ,s n. aS consis t of as\, a$2, ..., as n, wher e
as\ ^ ask, i ^ k.
For otherwise we would have asi = asu and hence s t = Sk, i # k, contrary to the
definition o f the s (.
Consequently, aS consist s o f exactl y n elements . Le t j b e th e numbe r o f
cosets. B y Lemma s 1 . 2 an d 1 . 3 th e cosets cove r G withou t overlapping . I t fol -
lows that
N = jn. •
Take a e G. W e denote aa b y a 2 or , i n general , w e dehn e al l th e integra l
powers a ß o f a by
aß = aa- • • a (/ x times) for /x > 0 ,
a° = e,
aß = a~ la~l • • • a~~l (—/ x times) for \x < 0.
The set of all powers of a is a group and clearly the smallest group containing a.
The problem of determining the smallest group containing as few as two elements
is already of an entirely different nature . For example, what can be said about
(ab)n = ab • ab • • -ab (n times)?
If multiplicatio n i s commutative suc h products ca n b e handled, bu t this does no t
apply in general.
EXERCISE 8 . Sho w that the powers of elements obey the usual properties of
exponents
aßav =a ß+
\
(avY = a Vß
.
The first property implie s th e commutativ e la w fo r multiplicatio n o f power s
of a.
The set S of all powers of a forms a subgroup since S is closed under multipli-
cation and inverses exist (cf. Theore m 1 .1 ) .
We have shown that there are essentially two groups of order 4 and both are com-
mutative.
Groups of orde r 6 are essentially o f two kinds, the cyclic group and the non-
commutative grou p give n i n Exercis e 1 . Thi s las t i s th e simples t exampl e o f a
noncommutative group. On e of the unsolved problems of algebra is that of classi-
fying all the groups of order n. There is, of course, always the cyclic group of order
n and for n prime, only the cyclic group. For nonprimes there is no general theory
although a Classification has been achieved for special cases. The table below gives
a summary for the first few cases:
N 4 6 8 9 10 12 14 15
ß 2 2 5 2 2 5 2 1
V 0 1 2 0 1 3 1 0
where /n is the total numbe r an d v the number o f noncommutativ e group s o f Or-
der N.
EXERCISE 9 . Th e two noncommutative groups of order 8 are essentially:
(a) Th e symmetries of the Square, i.e., the rotations in space which take the
Square into itself.
(b) Th e group formed by the quaternion units ± 1, ±/, ± j, ±k.
Construct the multiplication tabl e for those two groups an d show that they do not
have the same structure.
(a) The symmetries of the Square.
3I f a rotation replaces the vertices (1234) by the
vertices (a\02^3^4), then denote the rotation simply
by ((21 020304) . Th e identity is clearly e = (1 234) .
Denote b y a = (2341 ) th e counterclockwis e ro -
tation throug h 90° . Le t a 2 = b = (341 2 ) an d
c = a 1 = (41 23) . We have a4 = e. Th e powers of
a form a group S of order 4. If s denotes a rotation
of 1 80 ° abou t th e axi s 1 - 3 w e have s = (1 432) .
4 Th e coset sS i s simply
(1432), sa = (21 43 ) = t,
saL = (321 4 ) = 1 1 , sa" = (4321 ) = v;
these together with the powers of a exhaust the symmetries of the Square:
'a b c s t uv
e e a b c s t u V
a a b c e V s t u
b b c e a u V s t
c c e a b t u V s
s s t u V e a b c
t t u V s c e a b
u u V s t b c e a
V V s t u a b c e
1.2. SUBGROUP S 9
CHAPTER 2
+ Even Od d X Even Od d
Even Even Od d Even Even Eve n
Odd Odd Eve n Odd Even Od d
If "Even" is replaced by the number 0 and "Odd" by the number 1 , these tables are
the same as for ordinary addition and multiplication, together with the special rule
1 + 1 =0 .
I
aO = 0
ab + a(—b)
or ab + a(—b) — 0 . Therefore a{~b) = — ab. B y a similar proof
{—a)b = —ab.
From the combination of these results we have
{-a){-b) = -((-a)b) = -(-(ab)) = ab.
In som e o f th e literatur e a set T satisfyin g (I* ) an d (II ) i s calle d a ring and
mention is made of "associative rings," i.e., rings which satisfy th e postulate
(III) a,b,c £T = * a(bc) = (ab)c.
We adopt a more customar y usag e an d dehn e a ring to be a set whic h i s a n
"associative ring" in the sense above. A ring, then, is a set, closed with respect to
addition an d multiplication , tha t is a commutative grou p wit h respect to additio n
and obey s th e distributive la w of multiplication ove r additio n an d th e associativ e
law of multiplication.
2.1. LINEA R EQUATIONS IN A FIELD 13
ümlLl = 0
obtained by "eliminating" the variable x\ fro m the last m — 1 equation s in [2]. Any
Solution o f [2 ] is obviously a Solution o f [3] . Conversel y an y Solutio n o f [3 ] is a
Solution of [2 ] since the Solution must satisfy L\ = 0 . I t suffices t o show that [3 ]
has a nontrivial Solution.
The system of equations
L 2 - Ü2\L\ = 0
[31
Lm Qm\'- J
\ 0
is essentially a system of m — 1 equations in the n — 1 unknowns x 2, x^, ..., x m.
From the induction assumptio n thi s system possesses a nontrivial Solution . Usin g
this Solutio n w e complete th e Solutio n o f [3 ] by substitutin g th e firs t equatio n t o
obtain x\. Th e proof of the theorem is in no way changed when the coefficients ar e
multiplied on the right. D
EXERCISE 2. Prov e by an induction similar to that of Theorem 2.1 :
THEOREM 2.2 A system ofn equations in n unknowns,
L\ = a\\X\ + ai2* 21
H - ain*n = b\
L2 = 021 * 1 + <222* 2 H h a 2nXn = b 2
has a Solution for any choice of b\, b 2, • • •, bn e F if and only if the System of
homogeneous equations
[ L\— a\\X\ + fli2*2 H V a\ nxn = 0
L2 = a 2\X\ + a 22x2 H h a 2nxn = 0
/ f e ) = ßi
xt ^ Xj for i i=- j
f(xn) = ß n,
where x\, x 2,..., x n, ß\, ß 2,..., ß n ar e an y preassigne d numbers . Thi s follow s
from the fact that the system
/(*i) = 0 ,
f(x2) = 0 ,
/(*») = 0,
of homogeneous linear equations has only the trivial Solution since no polynomial
of degree less than n can have n distinct roots.
(a) W e have
aO + aO = a(0 + 0) = aO.
Adding — aO on both sides we obtain
a0 = 0 .
(b) Similarly ,
0A + 0 A = ( 0 + 0) A = 0A .
Adding — OA on both sides gives
0A = 0 .
(c) Fro m ( 1 — 1 ) A = O A = 0 we have
lA + ( - l ) A = 0 ,
or (—1)A is the inverse of A, i.e.,
(-1)A = - A .
From these results we can prove that aA = 0 implies either a = 0 or A = 0 .
If a i=- 0 then from a A = 0 we have
a~l(aA) =0 = (a~ la)A = A .
Hence A = 0 .
The fi vector s Ai , A 2 , . . ., A n ar e said to be linearly dependent i f there exist
x\, *2 , • •.> xn € F wit h not all x; = 0 such that
[1] xi Ai + x 2A2 H h x„An = 0 .
Take n = 1 . A vector Ai i s said to be linearly dependent if there exists an x 7 ^ 0
in F suc h that x A = 0 , i.e., if A = 0 . I f the vector is not zero it is independent .
Assume that [1 ] holds for nontrivia l X(. The n we have, say, xn ^ 0 . I t is possible
to write
An = -x~ lxiAx - x~ 1 x2A2 x~ l
xn-\An-i.
A sum of the form
c\A\ +c 2A2-i h
is called a linear combination of the vectors A\, A 2,..., A n. Th e Statement that n
vectors are linearly dependent i s equivalent to the Statement that one of them is a
linear combination of the others.
The dimension o f a vector spac e V i s the maximum number of linearly inde-
pendent vectors in V . I f no such maximum exists the dimension of V is said to be
infinite.
EXAMPLE. Th e polynomials for m a vector spac e over the field of real num-
bers. I n particular, the polynomials 1 , x,..., x n ar e linearly independent. Clearl y
the dimension of the vector space of all polynomials is infinite.
The definitio n give s n o hin t o f a way t o obtai n th e dimensio n o f an y give n
vector space. In order to attack this problem we introduce
2.2. VECTOR SPACE S 17
7=1
where
I L\ = X\Ü\\ + x 2a2\ + • • • + x mam\
L2 = Xi<2i 2 + X2«2 2 + • ' * + X mam2
[3]
It suffices t o find nontrivial x; tha t make all L/ = 0 . Since m > n , the system L; =
0 of n equations in m unknowns has a nontrivial Solution according to Theorem 2.1 .
It follows tha t there are je,-, not all of them zero, such that [2 ] holds and therefor e
the theorem is proved. D
COROLLARY If V is a vector space in which all vectors are linear combinations
ofn given vectors, then the dimension ofV is less than or equal to n.
The vector space V i s said to be spanned by the vectors A\, A 2,..., A n €V
if every vector B e V is a linear combination of the A t.
THEOREM 2.4 If V is spanned by n linearly independent vectors, then the dimen-
sion Ny ofV is precisely n.
By the corollary t o Theorem 2.3 we have Ny < n. Bu t there exist n linearl y
independent vector s (e.g. , A\, A 2 , . . . , A n) i n V an d Ny i s the maximum numbe r
of independent vectors in V. Consequently , Ny > n. Therefor e Ny = n.
THEOREM 2.5 IfVisa vector space offinite dimension n, then there are n linearly
independent vectors in V which span the space.
PROOF: I f n i s the dimension o f V , then V contain s a set of n independen t
vectors; cal l the m A\, A 2,..., A n. Le t B b e an y vecto r o f V. Th e n + 1 vec-
tors B, Ai , A 2 , . . . , A n ar e linearly dependent sinc e n is the maximum number of
independent vectors. Thus there are xt e F , not all zero, such that
x0B + x\A\ H h x nAn = 0 .
18 2. RINGS AN D FIELD S
C/„ = ( 0 , 0 , . . . , 0 , 0 , 1 ) ,
are linearly independent sinc e
n
Y2CiUi = ( c l> c 2, • . . , Cn )
i=\
is not zero unless all the ct are zero. Furthermore , the n vectors span V n since any
vector (c\, c2,,.., c n) € V n can be written as a linear combination YH=i ci Ui• Th e
result follows from Theorem 2.4.
EXERCISE 5. Sho w that any vector field of finite dimension n over F i s iso-
morphic t o V n. B y "V is isomorphic t o V„ " we mean tha t V ha s essentially the
same structur e a s V n. I n other words , to each elemen t o f one space ther e corre -
sponds an element of the other which behaves in exactly the same manner unde r
2.2. VECTO R SPACE S 19
the Operations among vectors. Thi s concept will be dealt with later in a more pre-
cise manner.
In V n, consider the equations
[1] x\A\ + x 2A2 H h xnAn = B.
Setting the components on one side equal to the corresponding components on the
other, w e obtain n linea r equation s i n n unknown s a s in Theore m 2.2 . Equatio n
[1] has a Solutio n fo r al l B e V n i f an d onl y i f th e A / ar e linearl y independen t
and therefore spa n V n. But this is equivalent to the assertion that the homogeneous
equation
x\A\ + x 2A2 H h xnAn = 0
has only the trivial Solution, all Xf = 0 . I n terms of the components this is exactly
the Statement of Theorem 2.2.
This page intentionally left blank
http://dx.doi.org/10.1090/cln/015/03
CHAPTER 3
v b %v +b
J2®vx +Y^ v = X^ ^xV-
v=0 v=0 v=0
From this definitio n i t follows a t once that th e power serie s for m a commutativ e
group under addition with the zero element YlT=o ® ' %v-
All fields are assumed hereafter t o be commutative unless the contrary is stated.
21
22 3 . POLYNOMIALS. FACTORIZATION INT O PRIMES . IDEALS.
EXERCISE 1 . Sho w that the polynomials are a subgroup of the group of power
series under addition.
The product of two power series is defined by
00 00 OO
7 CyX 7 @>ßX — 7 C nX
with
n
&n = / J Cv&ß =
/ J CyCln—v
v/tx> 0
By proving the distributive law of multiplication ove r addition and the associative
law o f multiplication , w e now sho w tha t th e se t o f powe r serie s ove r F form s a
ring.
The distributiv e la w follow s fro m th e linearit y o f th e produc t an d fro m th e
distributive law for the field elements. We prove this in general. Le t {a n} and {b n}
be two sequences of elements in F. Defin e the product {a n} -{bn} = d n to be linear
in the a's an d b's bu t otherwise arbitrary . Thu s d n i s of the form d n = Ha^üibj
with ciij e F. Consequently ,
kn}[{^} + {b n}] = {d n} with dn = Y^OtijCiiüj + bj) = Y^OiijCiüj + OtijCibj
or
{dn} = {c n} • {a n} + {c n} • {b n}.
EXAMPLES. Th e product of vectors in physics.
The scalar product is a • b = a\b\ + 0^ 2 + #3^ 3 and is therefore distributive .
The vector product a x b has components of the form ±(a tbj — ajb t) an d hence is
distributive.
The associative law follows immediately fro m
n—O \i6+v= w /
= m
E( E <vVpJ* .
Since the result is symmetrical, it is independent of the placing of the parentheses.
This completes the proof that the power series forms a ring.
It is a simple matter to enlarge this ring to a field. First we note that the ring
already has a multiplicative identity, namely
2
1 + 0 - J C + 0-JC + -.-.
To obtain inverses with respect to multiplication, w e have only to include all ele-
ments of the form
VJ a nxn, m > 0.
3.1. POLYNOMIALS OVER A FIELD 2 3
v-\-ß—n v=—oo
k=0
w+1
= J2äk(xf.
Since computation with the barred symbols is essentially the same as eomputation
with the polynomials, the bar may be omitted withou t danger of confusion. Thu s
we have created new symbols for the polynomials fo r whic h the signs of additio n
and multiplication have meaning.
A polynomial ao + a\x + • • • + a nxn ca n be used to dehne the function f(x)
which assigns to any c e F an d f(c) e F where f(c) — ao + a\c + • • • + a ncn.
EXERCISE 4. I f f(x), g(x) ar e polynomials and c e F, sho w that
f(x) + g(x) = h(x) = » f(c) + g(c) = h(c)
and
3.3. Ideal s
Consider a ring R. A subset of R is called an ideal 21 if
(a) 2 1 is a group with respect to addition
„. a e 2t ] . nr
(b) b€R\=>abe*.
THEOREM 3.1 In the ring of integers there are no other ideals than those consist-
ing ofthe multiples of a given integer and the set consisting ofzero ahne.
PROOF: Le t 21 be an ideal in the ring of integers.
Case 1 . 2 1 consists of zero alone.
Case 2. Ther e is a nonzero a e 21 . If a < Othen — l-a = — a > Oand — a e 21.
Thus if an ideal contains nonzero elements it also contains positive elements. From
the set of positive integers in 21 take the least and call it d. B y (b) every multiple of
d i s an element of 21 . We prove that 21 is precisely th e set of multiples of d. Tak e
any a e 2t . By the division algorithm we have
a = qd + r , 0 < r < d.
3.4. GREATEST COMMO N DIVISO R 27
where x/, >v, z € iv . 2 1 is a principal ideal by postulate (4) applied to /?; therefore
2t consists of the multiples of a Single element d. W e now write
% = dR = a tR + a 2R H h a„£;
that is,
I a i s a multiple of <i, and
I a is expressible in the form [1].
3
The sum of tw o set s denoted b y S + T i s the set of elements s + t wher e s e S, t e T. Th e
union (o r logical sum) of the two sets is denoted differently b y S U T.
3.4. GREATEST COMMO N DIVISO R 29
or
(ep2)--Pr =q2'-q r.
The theorem follows by induction. D
It is conceivable that there are elements which possess no decomposition int o
primes. In other words, an element might be factored i n such a way that nonprimes
are include d i n th e factorizatio n n o matte r ho w fa r th e proces s i s carried . Fo r
integers an d polynomials there is no such danger sinc e the number of elements in
3.4. GREATEST COMMON DIVISOR 31
the produet is limited by the "magnitude" of the element being factored. However ,
the result is true in general and therefore ever y elemen t o f R possesses a unique
factorization int o primes.
LEMMA Let a\, 0,2, . .. G R be a sequence ofnonzero elements such that a; + i|a;
for all i. Then all the at from a given element ort are equivalent.
PROOF: Le t 21 be the set of all multiples of the <z/. 21 is an ideal; for take any
a,b G 21 . We have
a G 2 t O a = ajC,
b G 21 O b = ajd.
Assume i > j, say. The n a r |a ; . Therefor e Bs e R with b = a ts. Hene e a ± b =
ai (c ± s); 2t is closed with respect to multiplication. Furthermore , 21 is closed with
respeet to multiplication by elements of R since for r G R, a • r = a t(cr). 2 t is a
principal ideal by postulate (4) and hence there is a d e R such that 21 = dR. Thu s
d • 1 G 2t and d is in 21. Therefore there is an an which divides d. Consequently ,
&n, ß „ + i , *W2 > . . . \d.
But ai\ai = > al• e 2 1 => a t G dR. Henc e d\a n, a„+i, ««+2, • • • • We have proved
that all the a\ for / > n are equivalent to d. D
THEOREM 3.4 Every a G /? w either zero, a unit, a prime, or a produet of primes.
PROOF: Suppos e a is none of these, i.e., a ^ 0 , af 1, and a is neither prime
nor a produet o f primes. Sinc e a is not prime i t can be expressed a s a produet
bc = a wher e neithe r b nor c is equivalent t o a. Clearl y i ^ 0 , c / 0 . I f c
and & wer e each eithe r a unit, a prime, o r a produet o f primes, the n a would be
in one of these categories. Thi s possibility is mied out . I t follows tha t one of the
divisors, say b, has the same property as a. But this reasoning could be carried out
indefinitely t o give a sequence of elements satisfying th e hypothesis of the lemma
but for which the terms do not eventually becom e equivalent. Thi s indirect proof
establishes the theorem. D
We have proved that every element can be factored uniquel y into primes. Sup-
pose a has the factorizatio n
a = pip2-- p r
where the pt ma y be the same or distinet. I t is possible that the same prime and
its equivalent s ma y appear mor e tha n onc e i n this expression . I f all equivalent
elements are taken together we may write
Vi Vi y cn
CHAPTER 4
4.1. Congruenc e
Consider th e notatio n a = b. Th e sig n o f equalit y mean s tha t a an d b ar e
merely two ways of writing the same element. In other words, the symbols a and b
are interchangeable in any discussion. We have already considered relations which
are somewhat like equality i n this respect. Fo r example, in the preceding sectio n
"a is equivalent to fc" means that a and b are interchangeable in any discussion of
divisibility properties. Let us investigate relations of this kind in somewhat greater
generality.
Assume we are given a set S of elements a,b,c,... . A relatio n
a= b
(read: "a congruent fc")between two elements of S is called congruence (or equiv-
alence or similarity) if it satisfies the postulates
(A) a = a (reflexivity) ,
(B) a = b => > b = a (symmetry) ,
(C) a = b, b = c => a = c (transitivity) .
ExAMPLES. A relation nee d not satisf y an y o f th e postulates. Fo r instance ,
let S be the set of human beings with the relation "a loves bV Ever y day "a love s
a" i s violated by some suicide. Furthermore , " a love s fc" is nonsymmetric a s any
reader of novels can teil. True, an argument can be made in favor of the transitivity
of thi s relation unde r th e principle o f "Lov e me , lov e my dog"—bu t th e logic i s
dubious. Fo r a se t o f peopl e gathere d i n a pitch dar k room a t a seanc e we hav e
the relation "a can see b" which vacuously satisfie s th e last two postulates, but not
the first. A more orthodox example is the relationship "a approximates fc" among
the real numbers. If we understand this to mean that the difference betwee n a and
b lie s within som e give n limi t o f error , w e se e then thi s relation i s reflexive an d
Symmetrie but not transitive. A relation which violates only the Symmetrie law is
"a < b" i n th e se t o f integers . W e have show n b y th e las t thre e example s tha t
the postulates o f a congruence relatio n ar e independent; i.e. , n o postulate ca n b e
derived logically from the other two.
By means o f the congruence relation th e elements o f S ca n be classified int o
nonoverlapping "species. " For define S a a s the set of all s e S such that s = a. I f
Sa and Sb overlap at all they are completely identical. For suppose 3c e S such that
33
34 4. RESIDUE CLASSES , EXTENSION FIELDS , AND ISOMORPHISM S
and
By means of this new congruence relation we may now define a n ideal So , the set
of all elements a = 0. But
aeSoOa = 0<^a-0 = ae^i.
Clearly, the speeification o f a eongraenee relation of this type and the specificatio n
of an ideal are completely equivalent .
The congruence defined i n R by means of the ideal 2t is denoted by
a = b (modSl).
The classes S ö, S&,... are called the residue classes (mod 21). In a principal ideal
ring, 21 consists of the multiples of one element d. I n that case we use the notation
(mod d) instea d of (mod 21).
ExAMPLES. Conside r the congruence defined in the set of integers by the ideal
consisting of the multiples of 7
a = b(modl) =»7|(a-fe) .
Thus a = b means a = Im + b: A n intege r i s congruent t o its remainder afte r
division b y 7 . Th e ring o f integer s i s spli t thereby int o the seve n residue classe s
So, S\,..., Sg . These classes are the elements of a commutative ring. We have, for
example, S2 + S 4 = SO , S2 • S4 = S\ 9 S3 + S 5 = S\. I t is convenient to omit the S' s
and denote the elements o f the ring by the subscript s 0, 1 , . . . , 6 alone. Th e ring
contains a multiplicative identity 1 . We further not e that all nonzero elements have
inverses:
dement 0 1 2 3 45 6
inverse 1 45 23 6
Thus the residue classes (mo d 7 ) form a field. Linear equations may be solved in
the usual way; if 3x — 4, then x = 3 - 1 4 = 5- 4 = 6 . Quadrati c equations can be
solved by completing the Square; thus
x2 + x + l=0= (x + lj) + ^ = ( j t + 4 ) 2 - l = 0 ,
The residue classes can therefore be described in terms of only two types, Sa where
a e F , an d the class S x. Al l others ca n be obtained fro m thes e by addition s an d
multiplications.
38 4 . RESIDUE CLASSES , EXTENSIO N FIELDS , AND ISOMORPHISM S
so that the computation wit h the elements o f F i s in no way different fro m com -
putation with the elements of F. W e have shown that F i s isomorphic to F. 1 No w
consider the equation
p(x) = a o + a\x + • • • + a nxn = 0.
The corresponding equation with coefficients i n F is
Sa0 + Sa j X + • • • + S ünX = So-
This possesses the Solution X = S x i n F, for we have
n
\*0 "I " ^ Ö I ^ X + * * * + b anöx = o fl0 + aiJC _| \-a nx — Sp(jc ) = oo -
If 50^ ) = S^x), i t follows tha t p(x)\[(p(x) — \/r(x)] and hence (p(x) — \//(x) = 0
or 4>(x) — \jf{x).
EXAMPLE. a + bi = c + di => a = c, b — d. Th e sum S«^) + S ^ ) =
S(p(x)+i/f(x) is at once in the prescribed form sinc e d[(j)(x) + xl/(x)] < n.
For the product, however, the result is not so simple. We may write
(j)(x)\jf(x) = q(x)p(x) + r(x).
This yields
S(f)(x) ' S\ff(x) = S r(x).
Only in those cases where 9[000 • if(x)] < n is S^).^*) immediately an element
of the prescribed form .
We have shown that the elements of E are expressible in the form
^c 0 +cix+---+c n _ix n - 1 = c
0+ C
\SX + ' ' ' + C n-\SX
Two such elements are equal if and only if corresponding coefficients ar e equal.
EXAMPLE. Le t F b e the field of integer s (mo d 7). Th e equation p(x) =
x3 — x + 2 = 0 has no Solution in F. Consequently , sinc e p(x) i s of third degree
it is irreducible; for any factorization o f p(x) woul d have to contain a linear factor.
(On the other hand a fourth-degree polynomia l might have two quadratic factors. )
The extension field E consists of all elements a + ba + ca 2 where a,b,c e F and
p(a) = a 3 - a - 2 = 0.
We have a3 — a + 2, a4 — a 2 + 2a, a5 = a 3 + 2a 2 = 2a 2 + a + 2 , . .. . In this
manner any power of a—and henc e any polynomial in a—can b e reduced to one
of degree not greater than two.
Let F be a field, E a n extension o f F. Suppos e ot e E. W e distinguish two
cases:
Case 1 . Ther e is no nonzero polynomial over F which has a as a root.
EXAMPLE. Tak e F to be the field o f rational numbers. Th e real number e =
2.718 . .. i s the root of no polynomial with rational coefficients. I f a is an element
of this type it is said to be transcendental with respect to F.
(Lemma 4.3) , o r
(f){x) = y/(x) (mo d p(x)).
Conversely, i f </)(x) — \jr{x) = p(x)q(x), the n
(j)(a)-i/(a) = 0.
Thus w e hav e prove d
(p(a) = \//(a) O <f>(x) = xj/(x) (mo d p(x)).
The rule s fo r additio n an d multiplicatio n o f thes e element s ar e obviousl y th e
same as for polynomials. Thu s we see that the set consisting of the elements 4> («) i s
isomorphic t o the se t of the residue classe s (mo d p{x)) o f the polynomials ove r F.
4.3. ISOMORPHISM 41
4.3. Isomorphis m
The notion of isomorphism has already been touched upon here and there in the
text. We have mentioned an "essential sameness" of two mathematical Systems . It
has been implied that two Systems which are isomorphic differ in no important way;
Operations on the elements of one are "the same as" Operations on the elements of
the other. Th e purpose of this section is to replace this descriptive terminology by
a precise formulation .
Implicit i n the idea o f th e "essentia l sameness " of tw o set s is the knowledg e
that each element of one set has an "image" in the other. Specifically, conside r two
sets S and T. Th e set S is mapped into the set T if to each s e S there corresponds
ar e T, th e image of s. Th e Statements "S i s mapped into T" an d "f i s the image
of s" are denoted by
S — > T an d s — > t,
respectively. A mapping is nothing more than a single-valued functio n wit h argu-
ments in the set S and values in the set T. W e could have written t = f(s) instea d
of s - > t.
If ever y elemen t o f T i s a n imag e fo r som e elemen t o f S , w e sa y tha t S i s
mapped onto T an d write S — > T.
EXAMPLE. Fo r the set S take a group G. Le t the elements of T b e the cosets
of som e subgroup H c G. B y means of the mapping f(x)=xH,G i s mapped
onto the cosets of H. Fo r an isomorphism we require more, as this example shows.
Different element s of S should have distinct images. Thus if s\ ^ S2
From a mapping 5 — > T of this kind we can derive an inverse mapping T — > S.
For any t e T ther e is a Single s e S suc h that s— > t. Fo r the inverse mappin g
take t -+ s. Th e mapping has furnished a method of pairing off th e elements of S
and T. I n other words there is a one-to-one correspondence between the elements
of S an d th e element s o f T. I n thi s cas e w e sa y tha t th e mappin g i s 1 - 1 (read :
"one-to-one" for "1 -1 ". )
EXAMPLES. Th e ordinary Photographie imag e of a three-dimensional objee t
does not provide a 1 - 1 mapping , if , say , the objeet i s transparent. Tak e p(x), a n
irreducible polynomial over a field F, and let a be one of its roots in some extension
field E D F. Fo r S take the se t consisting o f al l the elements o f E whic h deriv e
from additio n an d multiplicatio n o f a wit h th e element s o f F\ S i s th e se t o f al l
polynomials (p(a) over F. Fo r T take the set of residue classes T^ X) wher e \j/{x) e
T^x) mean s i/(x) = (f>(x) (mo d p(x)). W e have shown that
(j)(a) = \l/(a) => 4>{x) = yjr{x) (mo d p{x)).
From this Statement it is easy to see that there is a 1-1 correspondence between the
elements of S and T, namely , </>(cc) ^ > T^x). Furthermore , we have
42 4 . RESIDUE CLASSES, EXTENSION FIELDS, AND ISOMORPHISM S
and a simila r resul t hold s fo r multiplication . Thu s th e sum (or product) o f the
images of two elements is the image of the sum (or product) of the elements. This
is what is meant by the essential sameness of two fields.
What do we mean when we say that two mathematical System s have the same
structure? Before we answer this question it is necessary to specify wha t we mean
by a mathematical System . A mathematical System is concerned with fundamenta l
elements of various classes Si, S2,... . Everything eise is defined in terms of these
elements. I n analysis , th e fundamental element s ar e real numbers ; i n geometry,
points, lines , planes, Fo r simplicity, le t us assume tha t the fundamental ele -
ments are all of one dass S. Relations are defined for the elements of S. A relation
R(x\, X2, .. • , xn) i s a Statement involving the elements x\, x 2,..., x n. W e do not
mean to imply by this notation tha t the number of elements in a relation is finite.
For example, the Statement that a sequence of real numbers has a limit is a perfectly
good relation.
To write a relation for specific elements does not mean that it is true.
EXAMPLES. I f S i s the set of integer s an d R(x\,x 2) mean s x\ = x 2 the n
R(5, 5 ) is true bu t R(5, 7) is not. Labe l th e vertices o f a Square in accord wit h
the diagra m o n page 8 . Defin e R(x, y) t o mean tha t th e vertices x, y ar e adja -
cent. /?(1 , 3) i s false , bu t /?(1 , 4) i s true . A n Operatio n ca n be considere d a s
a relatio n connectin g thre e elements . Fo r example, th e Operation o f multiplica -
tion can be considered completely in terms of the relation R(a,b,c) whic h means
a-b = c. The relations of a mathematical System are defined by their special prop-
erties. Thes e ma y be clumsy t o write down. Fo r example, th e special propertie s
of multiplication in a group are given by the postulates on page 3. Using the nota-
tion abov e we see that the third postulate gives the property R(e, a, a) is true. A
mathematical System, then, consists of elements and relations defined among these
elements.
Two mathematical System s S and T ar e said to be isomorphic if there is a 1-1
correspondence betwee n the elements an d relations o f S and T suc h that truth of
a relatio n i n one System implie s trut h o f the corresponding relatio n i n the other
System, and falsity o f a relation in one System implies falsity o f the corresponding
relation in the other.
EXAMPLES. Fo r both S and T take the set of real numbers. Let R(x, y) be the
relation x < y for S. For the corresponding relation R r{x', y f) i n T take x' > y''. S
can be mapped isomorphically on T by the transformation x' — — x.
EXERCISE 4. Le t S be the set of vertices of the cube with the relation R(x, y),
x an d y hav e a n edge in common. Sho w tha t thi s i s isomorphic t o the set T of
faces of the octahedron where the relation R'(x', y r) means that the faces x', y' are
adjacent. (Se e Figure 4.1.) Label the faces of the octahedron accordingly .
For the projective plan e take the relation R(P,l) t o mean that the point P is
on the line l. T o set up an isomorphism between two planes, use central projection
from a point outsid e both. R(P,l) <& R(P',l f). Fo r ordinary Euclidea n plane s
only parallel projection wil l give an isomorphism.
4.3. ISOMORPHIS M 43
FIGURE4.1
the rational numbers (Dedekind cut). Since the rational numbers remain fixed, it is
clear that each element can go only into itself. The only automorphism is identity.
The field of complex numbers, on the other hand, has at least one nonidentical
automorphism, a + bi o a — bi. I n fact, the set of automorphisms of the complex
number field has the cardinal number
22*0.
Let p{x) b e an irreducible polynomial ove r a field F, an d E b e an extensio n
field of F whic h contains a root a of p(x). W e denote the smallest extension field
between E an d F by F(a), E c F(a) c F. I t has been demonstrated (p . 41) that
F(a) i s isomorphic to the field of residue classes of polynomials mo d p(x) unde r
the mappin g (j)(a) <- > 0(JC) . I f d[p(x)] = n + 1 , it i s unnecessar y t o conside r
polynomials of degree greater than n (by the proof on p. 38). Thus any element of
F(a) ca n be written in the form
Co + c\ot H V c n an
where Co , c i , . . ., c n e F. Th e su m of two suc h elements i s a t once o f th e sam e
form. Th e product can be handled by the method of the followin g
EXAMPLE. Th e polynomial p(x) = x 5 — x — 1 is irreducible ove r the field
R o f rational numbers. Henc e if a i s a root of p(x), al l elements of R(a) ma y be
written as above:
Co + c\(x + - • • 4- C4G? 4.
The product o f tw o suc h elements i s a polynomial i n a o f degree < 8 . I t can be
reduced to the prescribed form by means of the rules
a5=:l+a, a 6
= a + a 2, a 7
= a 2 + a 3, a 8
=a3+a4.
This method is applicable to any irreducible polynomial.
The onl y difficult y i n th e demonstratio n o f th e field properties o f F(a) lie s
in writing the quotient ^(a)/x//(a), x//(a) 7 ^ 0, as a polynomial o f the prescribed
form. Pu t
(f)(a)/\lf(a) = Co + C\OL H h c nan,
where the coefficients co , c\, . .., c n ar e to be determined. We write
(j)(a) = xl/(a)(c 0 + C\OL H h c nan).
If the right side is reduced according to the rules for multiplication of two elements,
we obtain
<p(a) = L 0 + L\OL H h L nan,
where Lo , L i , . . ., L n ar e linear combination s o f th e c t. Equatin g correspondin g
coefficients w e obtai n n linea r equation s i n n unknowns . Thi s Syste m o f linea r
equations always has a Solution since the System of homogeneous equations given
by <j>(a) = 0 ha s only the trivial Solution
co = c\ = • • • = c n = 0 .
We have obtained a method o f handling Operation s o n the elements o f F(a).
This treatment is based on the assumption tha t a i s the root of a given irreducibl e
46 4. RESIDUE CLASSES, EXTENSION FIELDS, AN D ISOMORPHISMS
( a\ - , a
C
8
It s vertices are given by the roots of the polynomial x17 — 1 ,
then
9 = c 0 + c xs3 + c 2s6 + • • • + c l5en,
where each power of s abov e the 1 5 th is reduced by using s 1 1 — 1 and p(s) = 0 ;
for example,
(s 3 ) 11 = (s) 33 = (s) 1 6 = _ ( i + s + . . . + £ 1 5 ).
If, instea d o f £ 3, w e tak e 6: u wher e v ^ 0 (mo d 1 7 ) w e ca n find an x suc h tha t
vx = = 1 (mod 1 7) . Henc e (s y)* = s . Therefore , b y the same reasoning a s above,
each value of v gives a different automorphis m of the field R(s). I t will be shown
that the nature of these 1 6 automorphisms permits us to see that the polygon of 1 7
sides possesse s a construction i n th e Euclidean sense . B y th e sam e methods w e
will be able to see that no construction exists for the 1 3-side d polygon.
http://dx.doi.org/10.1090/cln/015/05
CHAPTER 5
Galois Theory
THEOREM 5.1 There is an extension field E D F such that f(x) is the product of
linear factors in E. It is then said that fix) split s in the field E.
THEOREM 5.2 Let fix) be any polynomial in F and Q any extension field Q D F,
in which fix) can be split into linear factors,
The smallest field in Qfor which fix) splits is the field E obtained by the method
of Theorem 5.1.
combinations o f sums and products of the a t wit h the elements of F\ i.e. , E con -
tains all polynomials i n the a*. 1 I f this set of polynomials is a field—and w e shall
prove that it is—it is certainly the smallest Splitting field of f(x) betwee n Q and F.
Consider the set of all polynomials
0(ai,a 2 , ...,«* )
with coefficients i n F. W e now prove that this is the field E o f the previous theo-
rem. Since ot\ i s algebraic over F the polynomials (j>{ot\) over F form a field F(pt\).
Furthermore, sinc e F C F(a\), a 2 i s algebraic in F(pt\). (I t satisfies th e equation
f(x) = 0 over F(qt\).) Therefore , th e set of al l polynomials i n a 2 whos e coeffi -
cients are elements of F(ct\) —that is , polynomials in ct\—form a field F(ct\, a 2).
It follows by induction that
E = F(ai,a 2, . . . , a w),
the set of all polynomials in the a t. Not e that this field is, in fact, th e same as the
field obtained in Theorem 5.1 . D
The field E i s called the Splitting field of f(x) betwee n F an d Q.
EXAMPLE. Le t F b e the field R of rational numbers, Q the field of real num-
bers. Take
f(x) = (x 2 - 2)(x 2 - 3 ) = (J C + V2)(x - y/2)(x + V3)(x - V3) .
Clearly, E = / f (V2, A/3 ) an d hence consists of elements
(a + bVl) + (c + dV2)V3 = a + bV2 + cV?> + dV6,
a, b, c,d e R.
The dimension o f th e vector spac e E o f th e polynomials 2 in the GL[ ove r F i s
called the degree of E ove r F. Thu s the degree of 7?(\/2, V3) i s at most 4.
EXERCISE 1 . Sho w that the degree of R(Vl, y/3) ove r R is exactly 4.
THEOREM Let /(JC) be a polynomial over F. Any two Splitting fieldsoff(x) over
F are isomorphic.
We shall prove this result in the more general form :
THEOREM 5.3 If f(x) is any polynomial over F and f(x) is the corresponding
polynomial over an isomorphic field F, and if E is the Splitting field of f(x), E of
f{x), then the isomorphism between F and F can be extended to E and E.
PROOF: Writ e the factorization o f f(x) int o irreducible factors ove r F:
f(x) = c(x - a\)(x -a 2)--'(x- a r)pi(x)p2(x) • • • p s(x)
where the pi (x) ar e the irreducible factors o f degree higher than 1 . Since F an d F
are isomorphic this gives the factorizatio n
The reason for provin g the more general theorem is that in order to be able to
use induction one must make the Statement of the theorem for r + 1 identica l with
that for n.
= Co + CiCXr+t H h C r _iQrJ + J .
The ordinary rules for operating with derivatives, however, remain the same.
It is easy to verify tha t taking the derivative is a linear Operation, i.e.,
f
[af(x) + bg(x)]' = af(x)+bg (x)
where a, b € F.
LEMMA 5.7 For the derivative ofa product we have the usual rule
[f{x) • gix)] f = fix) • gix) + fix) • g\x).
PROOF: I f th e Statemen t o f th e lemm a i s tru e fo r tw o choice s f\ an d f o f
fix), i t is true for any linear combination af\ + fc/2, a,b e F: Assum e
(hg)' = f[g + fig', (f2g)' = fig + hg'-
It follows that
[(fl/i + bf 2)g]' = aifgY + bifgY = iaf + bf)g + (a/ i + bf 2)g'
r
= iaf l+bf2y§ + iaf l+bf2)g .
Since a polynomial i s a linear combination o f powers o f x, i t is sufficien t t o take
fix) t o be a power o f x. Moreover , sinc e the product o f fix) wit h gix) woul d
then be a linear combinatio n o f products o f powers o f x, w e need onl y prove the
lemma for products of two powers of x. Se t
fix) — x r an d gix) = x s.
This yields
ifgY = ix r+s)f = ( r + s)x r+s'1 = irx r~l)xs + x risxs~l). D
This lemma may now be used for the proof of the following:
56 5. GALOIS THEORY
LEMMA 5.8 Let a be a multiple root of fix); then it is a root of fix). Conversely,
ifa is a simple root of fix) then fiot) ^ 0 .
PROOF: T O say that a i s a multiple root of f(x) mean s that the factor x — a
appears at least twice in the factorization o f f(x) int o irreducible polynomials over
F(a). Consequently , we may write
2
f(x) = (x-a) (j>(x)
in the extension field F(a). Applyin g Lemma 5.7 we find that
fix) = 2(x - a)(/>(x) + (x- of)V(jc) ,
whence f\ot) = 0 . Conversely, if a i s a simple root we may write
fix) = (x — a)(f>{x) wher e 0(a) ^ 0 .
Lemma 5.7 yields
f(x) = <Kx) + (x-a)4>'(x)
whence
/'(<*) = 0(<*)^O . D
We now use Lemma 5.8 to determine the nature of those irreducible polynomial s
which may have multiple roots.
Let P(x) b e a polynomia l irreducibl e ove r a field F. Construc t F(a) 9 th e
extension o f F b y a root a o f P(x). I f th e multiplicit y o f a i s greater tha n 1 , it
follows tha t P'(a) = 0 or P\x) i s another equation i n F fo r a. Bu t we assume d
that P(x) i s irreducible; therefor e P(JC)|P'(JC ) (Lemm a 4.3, p. 40). However , by
definition, th e degree o f P'(x) i s less tha n tha t o f Pix) an d necessarily w e have
P'(x) = 0 . Thusifwese t
P(x) = «o + a\x + • • • + a nxn, a n ^ 0,
we obtain
P'(x) = a\ + la-ix + 3ß3X 2 + • • • + na nx
n l
~.
We conclude that P(x) ca n have a multiple root only if
a\ = 2^ 2 = 3^ 3 = • • • = na n = 0.
But n > 2 > 0 and an ^ 0 . Thus if F is a fieldof characteristic zero it is impossible
that na n = 0. This result yields the
COROLLARY An irreducible polynomial over a field of characteristic zero can
have only simple roots.
Assume now that F ha s the characteristic p. Unde r wha t conditions wil l the
polynomial P'{x) b e zero ? I f a vxv i s th e genera l ter m o f P(x), w e mus t hav e
vav = 0. Thus either a v = 0 or v — 0; for each nonzero coefficient a v, th e index v
must be a multiple of p. W e conclude that P(x) ma y be rewritten in the form
P{x) = c 0 + cix p + c 2x2p + • • • + c mx
mp
.
Setting
fix) = co + c\x -\ h c mx m
5.4. DERIVATIVE O F A POLYNOMIAL: MULTIPL E ROOT S 57
we obtain P(x) = f(x p). I t may very well happen that f(x) ha s the same form as
2
P, i.e. , f(x) = g(x p) o r P(x) = g(x p ) . However, it is clear that there is a largest
integer r for which P(x) ca n be expressed in the form P(x) = 0(x p ) . Evidentl y
0'(jt) 7 ^ 0, for otherwise we would write c/>(x) = \j/{x p) an d hence r woul d not be
the largest. Furthermore, 0(JC) is irreducible, for otherwise
4>(x) = g(x)h(x) = > 0 ( * 'r ) - g(x pr)h(xpr) = P(x),
which contradicts the assumption that P(x) i s irreducible.
To recapitulate: <p(x) is irreducible and 0'(JC ) 7 ^ 0; the roots of 0(JC ) ar e then
all distinct. Tak e for the domain of the discussion the Splitting field of P(JC)0(JC) ,
i.e., the field which contains al l roots of both polynomials. Denotin g th e roots of
<t>(x) by ßi, ß 2, . . . , ß s, w e may write
<P(x) = (x-ß l)(x-ß2)-.-(x-ßs)
= a p± ( P
\ap~lb + • • • (±1)' O^aP-ttf + • • • + (±l) p p
b,
Thenfor n + 1 ,
(a ± by" +l = (a± bY"" = [(a ± b)?Y = [a"" ± b»"]" = a^ ± ^" +'.
We note further tha t
[(a + b)+ cY = (a + b) p + c p = a p + b p + c p.
REMARK. I t follows fro m thi s las t resul t tha t i f m i s an integer i n a field of
characteristic p, i.e. ,
ra = 1 + 1 + h l (m times),
then
mP = lP + lP + ... + lP = m .
But this is a simple generalization of the famous Fermat theorem in arithmetic:
mp = m (mo d p).
Now let us return t o the problem: Th e Splitting field of P(x)(j)(x) i s also of char-
acteristic p sinc e 1 e F i s a n element o f an y extension field. I t follows tha t th e
extraction o f th e (p r)th root s o f ßt give s a unique result . Fo r i f oti is a root o f
xp — ßi then af = ßt. Consequently ,
xpr -ßi=x pr
-af =(x-a t)
pr
.
The Splitting of P(x) ma y now be written in the form
-]Pr
P(x) = \y[(x-a i)
•-1=1
which displays the fact that all roots appear with equal multiplicity p r.
EXERCISE 3. Verif y that the multiplicity p r o f the roots of an irreducible poly-
nomial P{x) ove r a field of characteristic p is the exponent of the greatest common
divisor of the nonconstant terms of P(x).
6
Read: " £ ove r F" for " E / F . "
60 5. GALOIS THEORY
But the cot ar e independent wit h respect to F ; w e conclude that each a^ = 0 and
the proof is complete. D
COROLLARY IfQDEDFand (E/F) = ( ß / F ) , then Q = E.
This is a direct consequence of the following :
PROPOSITION
(E/F) = 1 => E = F .
PROOF: I f the degree of E/F i s one, then E i s generated by any singl e de -
ment whic h i s independent (i.e. , nonzero). Bu t 1 is independent; therefor e ever y
dement of E i s in F. Consequentl y E = F . D
This corollary ca n also be obtained a s a special case of the corollary t o Theo-
rem 2.5 (p. 17).
LEMMA 5.11 Let X\(x), X 2(x), ..., X n(x) be n distinct characters of G with val-
ues in E. Then ifa linear combination
c\k\{x) + c2X2(x) H \-c nXn(x) = 0
for all x G G, itfollows that C[ — 0, / = 1 , 2, . . ., n.
PROOF: Le t us assume the contrary, tha t there are nontrivial linea r relation s
among the Xt. Selec t one of these for which the number of nonzero coefficients i s
least, say
[1] c\ki(x) + c 2X2(x) H h crXr(x) = 0
where c * ^ 0 , / = 1 , 2, . . ., r. Evidentl y r / 1 , for c\X\(x) = 0 implies tha t
c\ = 0 since A.I(JC) is never zero. Therefore r > 1.
The relation [1 ] is assumed to be true for all x e G. Consequently , it must be
true if we Substitute for x an y argument in G. Replacing x by ax where a e G , we
find
[2] ciXi(a)Xi(x) + C2X2(a)\ 2(x) H h crXr(a)Xr(x) = 0 .
Multiply by Ar(a) in [1] and subtract the result from [2] . This yields the relation
[3] c\[\\(a) - X r(a)]Xi(x) -\ \- c r_i[Ar_i(a) - X r(a)]Xr_i(x) = 0
which is shorter than [1 ] . If it can be shown that not all these coefficients ar e zero,
then this result contradicts the assumption that r is least and we are through.
We have assumed tha t X\(x) an d Xr{x) ar e distinct functions. Henc e there is
an a e G for which X\(x) ^ X r(a). Le t this be the a we have chosen above . I n
that cas e c\[X\(a) — Xr(a)] ^ 0 and [3] is a nontrivial relatio n whic h i s shorte r
than [1 ] . D
REMARK. Th e symbol a— > ä for a mapping is replaced by a functional no-
tation; in particular, the image of a field dement a through an automorphism a is
denoted by a(a). Clearl y
cr(l) = 1 ^ 0
and a(xy) = o(x)o(y). Therefor e a i s a character fo r the multiplicative grou p
consisting of the nonzero elements of the field. Lemma 5.1 1 provides us with the
important
THEOREM 5.12 Let E be afield, andco\, &>2, .. . ,co n distinct automorphisms ofE.
Then if
c\<D\(x) + c 2o)2(x) H h cncon(x) = 0
for all x G E, itfollows that C{ — 0, / = 1 , 2, . . ., n.
LEMMA 5.1 3 The set F ofall elements of E which remain fixed under the auto-
morphisms 0[ (i.e., the set consisting ofall a G E such that at (a) — a) is a subfield
ofE.
62 5. GALOIS THEORY
[3]
Uiö*i(crö>r) H h ^o-„(cr ^ r ) = 0.
By adding the left side s in [3] we find
W o ) +$2*2(0 ) + ' • ' +$nOn(ß) = 0
where $i , $2, • • •» ?« a re n ot a H z e r o an d 0, given by [1], may be any elemen t of
E. Bu t this is contrary to the result of Theorem 5.1 2 an d therefore th e hypothesis
r < n is inadmissible. D
The rational functions ove r F are the symbolic quotient s of polynomials (p(x)/\ls(x), \jf{x) 7^
0, which ar e defined t o equate, add, and multiply in the same manner a s fractions .
5.7. AUTOMORPHIC GROUPS OF A FIELD 63
'Cf. Exercis e 1 , p. 1 .
64 5. GALOIS THEOR Y
x{a) — a \
There are two possibilities:
(1) W e may in this manne r be able to obtain an y number o f automorphism s
we please. In that case (E/R) — 00 .
EXAMPLE. Le t R(x) b e the field of rational function s ove r the field R of ra-
tional numbers . Fro m th e automorphism a r , f(x) - > / ( * + 1 ) , we obtain the
automorphisms
cr[f(x)] = f(x + l),
o2[f(x)] = f(x + 2),
which are all distinct. The degree of R(x) ove r R is therefore infinite .
(2) O n the other hand, the set of all possible products of the a t ma y be finite.
In that event they form a group from the result of
9
See p. 2.
10
Cf. Exercis e 4, p. 62.
5.7. AUTOMORPHIC GROUP S O F A FIELD 65
Since these are n distinct elements of G , it follows tha t they are merely the a's i n
another arrangement. Consequently , if
[1] a = o r 1 ( 0 ) + G r 2 ( ö ) + .--+or fI(0)
Xj = Y^ ai(xj)> j = l,2,...,m,
<I>(X) = Y\( X ~ a
k).
k=\
It follows tha t
rr
5.16. If the roots of an irreducible polynomial are all simple, the polynomial is said
to be separable. I n general, any polynomial will be called separable if each of its
irreducible factor s i s separable . Whe n the roots o f a polynomial ar e simple, it is
certainly separable . However , this is not a necessary condition since , e.g., [<t>(x)] 2
is separable. It will be recalled that Theorem 5.5 on fixed fields requires the use of
separable polynomials.
(R(y/2)/R) = 4 an d (E/R) = 8.
What are all automorphisms of E which leave R fixed? We have shown (p. 44)
that the rational numbers always remain fixed. Hence we have only to find all au-
tomorphisms of E. Sinc e f(x) i s separable, Theorem 5.5 teils us that no elements
but thos e o f R remai n fixed. Bu t th e se t o f al l automorphism s i s a group an d R
is the fixed field of this group. W e conclude that there are exactly eight automor -
phisms. It is not difficult t o write these down:
n i EXERCISE 8 . Demonstrat e tha t thi s grou p o f eigh t
I n i automorphisms i s isomorphi c t o th e grou p o f sym -
0 m i metries o f a Square . Denot e b y er th e automorphis m
a1 -n i powers o f er form a cycli c
subgroup o f order 4. I f we denote by r th e automorphis m
a3 -in i
{y/2, i) -+ (y/2, - i ) , then w e can describe al l produets in
X n —i terms o f o an d r b y mean s o f the rules o 4 = / , r 2 = / ,
ax in —i xo — cr 3r, r<7 2 = a 2 r , rcr 3 = ax.
a2x -n —i
ah -in —i
5. GALOIS THEOR Y
splits in E since each factor splits in E. Moreover , among the roots are co\, CL>2, • •.,
con. Hence no smaller field than E ca n possibly be the Splitting field of f(x). Th e
proof is complete. D
COROLLARY 5.18 IfE/F is normal and ifQ is any field intermediate between E
and F, F C Q C E y then E/Q is normal.
PROOF: E is the Splitting field of a polynomial f(x) ove r F and consequently
is the Splitting field of the same polynomial over £2. D
COROLLARY 5.1 9 If G is the group of E/F (i.e., F is the fixed field under the
group G of automorphisms of E), then there is a 1-1 correspondence between the
subgroups ofG and the subfields of E which contain F:
S C G O 3Q such that F C Q C E where S is the group ofE/ Q.
The proof is obvious.
EXAMPLE. (Cf . wit h exampl e o n p . 67. ) Conside r th e field E = R(J/2, i)
over the field of rational numbers. Eac h intermediate field between E an d R cor -
responds t o a subgrou p o f G . Thu s ther e ar e thre e subfield s o f degre e 2 corre-
sponding to the subgroups of order 4 and five subfields o f degree 4 corresponding
to the subgroups o f order 2. I t is easy to find the fields of degree 2: R(i) <e > G4,
R(V2) *± G41 , R(iy/2)<- > G42. O f degree 4 we have the fields R(i, y/l) o G 2u
R(V2) *> G 22, R(iy/2) ** G 24. However , i t is not alway s easy t o teil on sigh t
which field corresponds to a given group. What are the fixed fields of G23 and G25?
The fixed field o f G2 3 consist s o f element s whic h ar e no t change d b y a r ,
(v^2, i) - » (i\/2, —i). The general element of E ma y be put in the form
9 = c 0 + ciy/2 + C2V2 + c 3(J/2)3 + c 4i
+ c 5i^/l + c 6iV2 + c 7 i(v^) 3 ,
70 5 . GALOIS THEOR Y
whence
(0) = c 0 + c xiV2 + c 2(-y/2) + c 3(-iV2?) + c 4(-i) + c 5Ü2
+ c 6 /V2 + c 7 (-v / 23).
If 6 remains unchanged
Co arbitrary, c-i = 0, c 3 = — c-j, Q = 0 , ce arbitrary.
Namely,
6 = c 0 (l + i)\/2 + ceiy/l + c 3 (l - i)Vl?
of the above equations. Th e root \/2 ca n have only three possible images, CD only
two. Ther e are si x possible combinations an d sinc e there ar e si x automorphism s
all combinations occur. The automorphic group is given by the table:
I a a2 X ax a2x
^2 n o)\fi co2^/2 1/2 £«v2 co2^/2
CO CD CD CO CO1 CO1 a>2
The group multiplication follows the rules
o I, x 2 = / ; ax = a 2x, xa 1
= ax.
It is easy to verify that this is the group of symmetries of the triangle (cf. Exercis e 1,
P.D.
The group of E/R possesse s one subgroup of order 3, G3: I,a,a 2; an d three
subgroups of order 2,
G21: / , r ; (722 : /, ox\ G23 : /, a 2x.
The subgroup s correspon d t o the one quadratic field R(co) and three cubic fields,
R(y/2), R(coy/2), and R(co 2^/2), respectively.
EXERCISE 9 . Determin e the automorphisms o f the polynomial for which \fl
is a root.
Consider th e genera l proble m o f a n arbitrar y irreducibl e cubi c Pix) ove r a
field F wit h distinct roots a\, 01 2, a 3. Th e Splitting field
E = F(ai,a 2,a3)
is in general of degree six . If , however , the equation alread y split s in F(o?i) , then
(E/F) = 3 (Lemma 5.9) . I n this specia l cas e the group of E/F bein g o f prirne
order i s therefor e cycli c an d consists o f th e powers o f on e demen t I,a,cr 2. W e
conclude that a mus t permute the roots cyclically; for all other permutations would
leave one root fixed, and hence would consis t o f a t most the transposition o f two
roots and be therefore o f period 2. Thus a i s represented either by the permutation
(231) or (31 2 ) anc* ^ere ar e no other possibilities.
EXERCISE 1 0 . Fin d th e irreducibl e polynomia l ove r R wit h roo t 2 co s ^f.
Show tha t thi s i s a cubic o f th e abov e specia l type . (Hint: Us e sevent h root s o f
unity.)
On the other hand, suppose F(a\) i s not the Splitting field. Since x — a\ ma y be
factored ou t in F(ai) , w e must have (E/F{ct\)) = 2 and consequently (E/F) =
6. Ther e mus t the n b e si x automorphism s o f E whic h leav e F fixed. Thes e ar e
determined by the way they permute the roots. Since there are six permutations of
three elements, all permutations ar e possible. Thu s we have shown that the group
of automorphisms o f the Splitting field of an irreducible cubic is either the triangle
group or the cyclic group of order 3. N o other cases occur. W e shall see that this
result implies the solvability of the general cubic by radicals.
72 5. GALOIS THEOR Y
For the irreducible equation of fourth degre e there are 24 possible permutation s
of th e roots . I n mos t case s th e Splittin g field wil l actuall y b e o f degre e 24 . Th e
special case s correspon d t o subgroups o f the permutation grou p o f fou r objects . I n
order t o kno w al l possibilities, tak e th e grou p o f 2 4 element s an d find al l possibl e
subgroups correspondin g t o th e irreducibl e case . I n general , th e analysi s o f th e
general equatio n o f n th degre e involve s th e grou p o f permutation s o f n objects .
The Splittin g field i s mos t ofte n o f th e highes t possibl e degree— n\. I t wil l appea r
later tha t thi s metho d o f treatin g th e Solutio n enable s u s t o tei l whethe r o r no t an y
given equatio n i s solvabl e in terms o f radicals .
THEOREM 5.2 0 Let U be a field containing
(1) F, the ground field.
(2) E, the Splitting field of any polynomial f(x) (not necessarily separable)
over F .
(3) Q, a field intermediate between E and F , E D Q D F .
(4) Q\ an extension field of F which is isomorphic to Q in a mapping which
leaves the elements of F fixed.
Itfollows that Q f C E and that the isomorphism between £2 and Q ; is contained in
some automorphism of E.
Thus w e se e that it is possible t o generalize ou r argument s t o al l polynomials .
PROOF: Denot e th e roots o f f(x) b y a\, #2 , • • • , «„. Th e Splittin g field E o f
f(x) i s obtained b y adjoinin g th e roots:
E = F(ai,a 2, . . . , « „ ).
Since E D Q D F , E i s also the Splittin g field o f f(x) ove r Q. Furthermore , f(x)
is a polynomial i n Q' an d the Splittin g field o f f(x) ove r Q f i s som e field i n U
E' = ß ' ( a i , a 2 , . . . , « « ) .
By Theore m 5. 3 th e isomorphism betwee n Q an d Q f ca n be extended t o E an d E'.
Now le t CD be an y demen t o f Q , co f its imag e i n Q\ Sinc e co e E i t can b e writte n
in th e for m
co = cp(ai,a 2, . . . , < * „ )
where 0 i s a polynomial wit h coefficients i n F . I n the isomorphism betwee n E an d
E\ co can onl y b e mapped o n
Two elements are equivalent if and only if they lie in the same coset
er = x <s > o(x) = r(x) fo r all x e Q O er e rS.
In this manner we dehne a new group G/S calle d the factor group of G with respect
toS.
EXERCISE 1 1 . Sho w that an equivalence relation which preserves multiplica -
tion in a group G,
a = b, c = d^a-c = b'd,
may be defined i n terms of a unique invariant subgroup S so that two elements are
equivalent if and only if they are contained in the same coset of S.
REMARK. Th e right and left coset s of an invariant subgroup with respect to a
given element are the same and, conversely, if the right and left cosets of a subgroup
with respec t t o an y elemen t ar e the same , the n th e subgrou p i s invariant , fo r w e
have
aSa~x =S OoS = So.
Let us determine al l the subgroup s an d factor group s o f an y cycli c group G .
Since the group is commutative, it follows by the remark above that every subgroup
is invariant. If G is of order N, w e may write its elements as follows:
CT, C T , (7 , . . . , C T , CT = 1 .
Gö = {a , er , . . . , ( 7} =< 7 o .
The cosets are better written
crS, (crS)2, . . . , (aSy~\ (crS) r
= S.
Hence the factor grou p G/S i s a cyclic grou p o f orde r r. I n brief, fo r an y cycli c
group, all subgroups and all factor groups are cyclic.
As an example, suppose the group of E/F i s cyclic of order 1 2 . The hierarchy
of fields normal with respect to F i s best described by the diagram in Figure 5.1.
5.9. FINITE FIELDS 7 5
FIGURE5.1
Thus to reach a cyclic extension of degree 1 2 from F we must make two quadratic
extensions and a cubic extension.
Namely, ab = 0 = » a = 0 or b = 0 .
76 5. GALOIS THEORY
We have proved
LEMMA 5.23 The number q of elements in afinitefield F is the nth power ofthe
characteristic where n = (F/R p).
The q — \ nonzer o elements of F for m a multiplicative group of order q — 1 .
Hence for all nonzero a e F
q l
[1] a ~ = 1.
Therefore, for all a i n F w e have
q
[2] a = a,
a generalization o f the Fermat theore m o f arithmetic . I t follows tha t th e polyno-
mial x q — x ha s q roots—th e totalit y o f element s o f F . Sinc e th e degre e o f th e
polynomial is q, it can have no other roots. Hence
q
[3] x -J C = J~[(JC-GO .
aeF
or
xq - 1 = J^(jc-a) .
a^O
Settins x = 0 we obtain
-^c-D^n« a^O
whence
Y\a = (-l)q.
Since either q is odd, or the characteristic is 2 and — 1 = + 1 , we have
n«—i.
aeF
a^O
5.9. FINITE FIELD S 77
(p-l)\ = -l ( m o d p ) .
We have see n tha t F i s the Splittin g field o f x q — x ove r R p. No t onl y doe s F
contain th e roots o f this polynomial—it consist s entirel y o f the roots. Ther e ca n b e
no smalle r Splittin g field tha n F. S o ther e i s essentiall y n o mor e tha n on e field o f
degree n ove r R p. Fo r i f F r als o ha s p n elements , the n i t too i s a Splittin g field o f
the polynomia l
x""-x
and i s hence isomorphi c t o F.
Conversely, i f q = p n i s given , w e ca n construc t a field o f q elements—th e
Splitting field F o f f(x) = x pH — x ove r R p.
/'(*) = - l # 0 ;
therefore ther e ar e n o multipl e roots . W e ma y the n writ e f(x) = n f = i ( x ~ a t)
where th e « / ar e distinct . F contain s n o othe r element s tha n th e a t sinc e th e a t
already constitut e a field. I n proof , conside r an y tw o o f th e a's , sa y ot\ an d a 2 -
From oi\ = ct\, a^ = <#2 > w e deriv e the rules:
Multiplication: ((X\(X2) pn = ^1 ^ 2
pn
Division: iß\/^i) — ot\/ai1
{0 2 7^ 0)
Sums, products, an d quotient s ar e again roots a.
We have prove d
THEOREM 5.2 4 To each power p n ofa prime p there is exactly one field (apart
from isomorphism) with p n elements. There are no other fields.
In th e further investigatio n o f finite fields w e shal l require a number o f group -
theoretic lemmas .
LEMMA 5.2 5 Let a, b be elements of a commutative group and denote their peri-
ods by a and ß, respectively. Then there is an element c oftheform c = a vbß such
that the period of c is the least common multiple ofa and ß.
PROOF: Conside r th e factorization o f a an d ß int o primes (a s on p. 31 )
ffi Vi ffm
ff? '
we then obtain
ff? ff?
But, by our assumption, /x has the factor qf* = ^. It follows tha t the exponent of
b is certainly divisible by ß. W e conclude that
v/ffl__ffm_
4
whence
vy q{ • • • a tm
<ti
or, in particular,
tm
vy'q? • •• a
4?
But r r = £;, 5/ = 0. Therefor e
<lo\Y'-
We immediately conclude that
where s, = ?,. We have now proved that for the period y' of c
product o f two linear factors. Ther e ar e five possible factors , henc e five polynomi -
als wit h doubl e roots , te n wit h distinc t roots . Th e remainin g te n polynomial s ar e
irreducible. W e not e immediatel y tha t x 2 — 2 an d x 2 — 3 ar e irreducible . W e ar e
thereby provide d wit h tw o ways o f constructin g th e field o f 5 2 elements . W e mus t
then b e abl e t o expres s V 3 i n term s o f >/2 . Le t 0 denot e a root o f x 2 — 2. Rs(9)
consists of the elements a + bO wher e a, b e R$, 0 2 = 2 . T o represent %/ 3 in Rs(9)
consider th e equation (a + b6) 2 = 3 , which i s equivalent t o
a2 + 2b 2 + 2ab9 = 3 .
We can onl y hav e a = 0 , b = ± 2 .
Let F b e th e field o f n th degre e ove r R p. Wha t ar e th e automorphism s o f
F/Rp? On e automorphis m i s a(a) = a p. B y th e propositio n o n pag e 5 7 w e se e
that
a(a ±ß) = (ot± ß) p =a p±ßp = cr(a) ± <x(ß),
and a simila r resul t hold s fo r multiplication . W e have onl y t o sho w tha t th e corre -
spondence i s 1 -1 :
a(ä) = a(ß) = ^ a p = ß p = » (et - ß) p
= 0 = ^ a = ß.
Having on e automorphis m w e may iterat e until w e ge t repetitions :
a=ap, a 2
= a p\ . . . , cr d
= a pd, . . . .
If d i s the period o f a, the n o d = I, th e identity automorphism . Thu s d i s the leas t
integer fo r whic h
er = a
for al l q element s a i n F. Th e equation x p — x = 0 must then have all q element s
as Solution s an d therefor e p d > q = p n. Henc e d > n. But , o n th e othe r hand ,
apH = ÖL fo r al l a. Consequently , cr n(a) = a fo r al l a, i.e. , a n = / . Th e perio d o f
o ca n be nothing othe r than n.
We have show n tha t the automorphism s
1, <T , (7 , . . . , O
CHAPTER 6
There is yet one question which has occurred repeatedly and has not been dealt
with in any adequate way. This is the question as to whether any given polynomial
in th e rationa l field i s irreducible . W e cannot dela y th e answe r t o thi s questio n
any longer—fo r otherwis e w e shal l no t b e abl e t o solv e an y specia l equations .
Therefore w e shal l deviat e fro m th e mai n cours e o f thes e lecture s t o discuss th e
topic of irreducibility.
Let R be any commutative ring with
(1) n o divisors of zero,
(2) a unit element,
(3) uniqu e factorization int o primes; 1 i.e., every element o f R i s either zero,
a unit, a prime, or a product of primes.
Let R[x] denot e the ring o f all polynomials wit h coefficients i n R. W e write
a\f(x) if a e R i s a divisor of all the coefficients o f fix); i.e. , if we have fix) —
ag(x) wher e g(x) ha s coefficients i n the ring.
THEOREM 6.1 (Gauss) If p e R is a prime and f{x), g(x) e R[x], then
Now p\üjbk bu t p divide s all the other terms. It follows that p\cj+k- D
The greatest common divisor of the coefficients o f a polynomial f(x) e R[x]
is called the content of f(x). I f the content of f(x) i s 1 , fix) i s said to be primitive.
Denoting the content of f(x) b y d, we write
f(x) =dgix).
Thus any polynomial may be written as the product of a ring element with a prim-
itive polynomial.
l
R i s therefore less special than a principal ideal ring.
81
82 6. POLYNOMIALS WIT H INTEGRA L COEFFICffiNT S
If for fix), gix) e R[x] there is an hix) e R[x] such that fix) = gix) -hix),
we say that gix) divide s fix) i n the strong sense, or simply gix) divide s fix) an d
we write gix)\fix). I f there is a ring element a such that gix) divide s afix), the n
gix) i s said to divide fix) i n the weak sense.
LEMMA 6.3 If gix) is primitive and gix) divides fix) in the weak sense, then
gix) divides fix) in the strong sense.
PROOF: I f gix) divide s fix) i n the weak sense , then there is an a e R an d
an hix) e R[x] suc h that
<*f(x) = g(x) -hix).
We may put
hix) = dhoix)
where d is the content of hix) an d hoix) i s primitive. Similarly , we may write
fix) = bfoix)
where foix) i s primitive. Thus we obtain
abfoix) = dgix) • hoix)
where d is the content of the right side and ab is the content of the polynomial on
the left . I t follow s tha t d\ab an d ab\d\ d an d ab ar e equal excep t fo r perhap s a
unit factor. B y including the proper unit factor i n one of the polynomials, we may
ensure ab = d. Th e fact tha t there ar e no divisors o f zer o permits the use of the
cancellation law so that
foix) = gix) • hoix)
from which follow s
fix) = bfoix) = gix) - bhoix).
This is the desired result. D
The ring R may be extended to the so-called quotient field F by the method of
Lemma 5.22 (p. 75). We shall now refer to the elements of R as integers and those
of F a s rationals. Th e ring R[x] o f polynomials wit h integral coefficients i s con-
sidered to be imbedded in the ring F[x] o f polynomials with rational coefficients .
If fix), gix) e R[x], gix) primitive , and if gix) divide s fix) i n F[x]\ i.e. , if
there is an hix) e F[x] suc h that
fix) = gix) -hix),
then gix) divide s fix) i n the weak sense and therefore i n the strong sense.
6. POLYNOMIALS WITH INTEGRAL COEFFICIENTS 8 3
EXERCISE 1 . Wha t are the units in R[x]l Complet e the proof tha t R[x] i s a
unique factorization ring .
EXAMPLE. R[x] satisfies the conditions of a unique factorization ring . There-
fore we may adjoin a new variable to obtain a new ring in which unique factoriza -
tion holds. Thi s is the se t of all polynomials i n y whos e coefficient s ar e polyno-
mials in x—the rin g R[x, y] of polynomials in two variables. Apparently, the ring
of polynomials in n variables over a unique factorization domai n is again a unique
factorization domain . A field is a unique factorizatio n domai n (ever y elemen t i s
either zero or a unit) so this remark applies to fields. We have proved even that the
polynomials ove r a field form a principal idea l ring. Thi s i s not tru e for polyno -
mials in more than one variable, however. Fo r example, the set of all polynomials
in two variables which are zero at the origin, i.e., have no constant term, definitel y
form an ideal—but it is not principal. For let & be the set of all polynomials f(x,y)
which vanish for x = 0, y = 0. Clearly x,y e &. But if £ consist s of the multiples
of a Single element 0 the n <p\x. Hence , either 0 = x o r 0 = c , a constant. Sinc e
y i s als o a multiple o f 0 w e must have 0 = c. W e cannot tak e 0 = c sinc e no
nonzero constant is in the ideal.
6.1. Irreducibilit y
Let us consider a specific example, the polynomial x 5 — x — 1 , in order to see
what difficulties occu r in proving irreducibility. Se t
p(x) = x 5 — x — 1 .
Does p(x) hav e a linear factor? I f so, it must have integer coefficients. Sinc e the
leading coefficient i s 1 w e may write
p(x) — (x -a)(x 4 H )
where a is an integer. Comparin g terms, we see that a\ 1 so that a = 1 or a = — 1.
Neither is a root so the possibility of a linear factor is excluded.
The only remaining possibility is that p{x) i s the product of a quadratic and a
cubic, say
p(x) = g(x) -h(x)
where
g(x) = x 2 + ax + b, h(x) = x 3 + ex 2 + dx + e,
a, b, c, d, e are integers. Le t us see what possibilities ther e are for value s of g{x)
for different value s of x:
X p(x) g(x)
2~ 29 ±1 o r ± 2 9
1 -1 ±1
0 -1 ±1
-1 -1 ±1
- 2 -31 ±1OT ±3 1
6.1.IRREDUCIBILITY 85
X fix) 8(x)
*0 /o d° d° d°
l
*1 /l d 1 ^Z 1 d
xr /r J r ^/ r
^/ r
a a
lj 2 '3 '* ' *
where the S ar e the divisors of fk. I t is clearly to our advantage to choose values of
x for which f(x) i s prime and large. The method is now to interpolate polynomials
through the possible values of g(x), e.g. ,
r
g(x0) = d^, g(xi) = d\, . . ., g{x )=d[.
If an interpolation doe s not lead to a polynomial with integral coefficients, w e can
reject it at once. Otherwise we must test by long division into f(x) o r by expanding
our table. The method must be repeated for all possible combinations of the d's t o
be a sufficient proo f of irreducibility.
This approac h i s obviousl y th e las t resor t o f th e desperate . W e shal l soo n
discuss certain sufficient condition s for irreducibility which are often o f great use.
EXAMPLE. Le t us determine the values of a for which
f(x) = x 5 — ax — 1
is irreducible. If f(x) ha s linear factors i t must have either +1 or — 1 a s a root
1- a - 1 =0;a =0
- l + a - l = 0 ;a = -2 .
If f(x) ha s a quadratic factor we may write
f(x) = je 5 - ax — 1 = (x 2 + bx + c)(x 3 + dx 2 + ex + f).
86 6. POLYNOMIALS WIT H INTEGRAL COEFFICffiNT S
Equating the coefficients o f the terms of equal degree we obtain the relations
[1] b + d = 0,
[2] e + bd + c = 0,
[3] f + be + cd = 0,
[4] bf + ce = -a,
[5] cf = -l.
Equation [5 ] yields
c = -f = ±l.
Using b = —d (from [1 ] ) in [3] we obtain
d(c-e) = -f = ±h
whence d = + 1 an d (c — e) = ± 1 . From [2]
e + c = l, e — c = ± 1,
whence either c = 0 or c = 1 . The first case is impossible if we are to satisfy [5] ,
So we obtain at once:
c= l,e = 0,/ = -l,d = l, b = -l.
From (4) we have a = — 1. There is then only one possibility for a quadratic factor:
x5 +x - l = (x 2 -x + l)(x 3 +x 2 - 1 ) .
There are only three reducible cases: a = 0 , a = 2 , a = — 1. In the first two cases
there cannot be a quadratic factor s o f(x) factor s int o a linear and a fourth degre e
factor.
Consider another example. For what values of a is the polynomial
/(JC) = x5 — x — a
reducible? There are obviously an infinite number of possibilities for linear factors
for we need only take a = b 5 — bis an y integer.
EXERCISE 2. Fo r how many values of a does the polynomial
f(x) = x 5 — x — a
have a quadratic factor ?
Hint: This problem leads to a diophantine equation which has only a few Solutions.
THEOREM 6.7 (Eisenstein) Iffor the polynomial
f(x) = a nxn + a n-\xn~x H h a0
there is a prime p such that
(1) p\dn,
(2) p\at for i = 0, 1 , 2, . . ., n - 1 ,
(3) p 2Wo,
then f(x) is irreducible.
6.2. PRIMITIVE ROOTS OF UNITY 87
is irreducible. Put
<P(x) = f(xP"~ l) = f(y)
where
yP 1
y=x'-\ f(y) = -
y-l
We have
f(y + 1 ) = y p~l + (term s divisible by p) + p.
Now
0(JC + 1 ) = /([J C + l]^" 1 )
=
f([ xpn " ^ (terms divisible by /?)] + l )
= [x p" + (term s divisible by p)]p + (term s divisible by p) + p
= x^ p~X)p + (term s divisible by p) + p.
The proof is immediate by Eisenstein's criterion.
The proof show s that
n
xP - 1 = (x pn~l - 1 )0(JC )
<M*)= n <*-*") •
(y,m)=l
0<v<m—1
6.2. PRIMITIVE ROOTS OF UNITY 8 9
EXERCISE3.
(a) Prov e that $>i m(x) = <f> m(—JC) for m odd.
(b) Prov e that if p\m, wher e p i s prime, then <b pm(x) = $> m(xp)/<&m(x).
Discuss the case when p\m.
The polynomials O m ar e called th e cyclotomic (circle-dividing) polynomials .
The field R(e), wher e s is a root of a cyclotomic polynomial, is sometimes called
a cyclotomic field.
This page intentionally left blank
http://dx.doi.org/10.1090/cln/015/07
CHAPTER 7
constructible an d sinc e any construction whic h cannot be performed wit h this re-
stricted choice of points is certainly impossible for a completely arbitrary choice.
Given any lengths i j we ca n
easily construct the sum x + y and
the difference x — y 9 and (see fig-
ure) th e produc t an d quotient , xy
and x/y. I t follow s tha t w e ca n
construct an y elemen t o f th e field
R(x,y) generate d b y thes e ele -
ments. In general, if x\, X2, ..., x n
are the given data, we can certainly1
construct an y elemen t i n the field
R(x\, X2, . . . , x n). W e may extend
the se t o f constructibl e element s furthe r b y considerin g th e intersection s o f cir -
cles with straight lines or with circles. Th e intersection o f two straight lines gives
nothing new. Th e coordinates of a point on a straight line satisfy a linear relation.
Hence the determination of a point as the intersection of two straight lines involves
the Solutio n o f a pair o f linea r equation s an d doe s no t tak e us ou t o f th e groun d
field. The problem of two intersecting circles can be reduced to the intersection of
a straight line on a circle and this will usually necessitat e goin g outside the field.
The determination o f a point a s the intersection o f a circle with a straight line in-
volves the Solution of a quadratic equation. Shoul d this be irreducible, we append
the Solution to the ground field and thereby obtain an extension of degree 2.
Thus, in any construction, w e begin with certain accessibl e elements, the ele-
ments of the ground field R(x\, X2, • • •, xn), an d at each step we obtain a new field
of accessible elements where, if say F n is the field at the nih step , we have
(Fn+l/Fn) > 2.
Consequently, if a construction can be performed i n n steps, the degree of F n over
the ground field must be a power of 2,
(Fn/F) = 2 v (v<n).
Suppose th e Solutio n require s a segmen t o f lengt h a, e.g. , a i s th e chor d o n th e
unit circle subtended by the angle 6/3. Th e length a must be an element of F n and
therefore th e field F(a) i s intermediate between F an d F n,
accept the sid e length o f th e given cube a s the unit, thi s problem i s equivalent t o
solving the irreducible equation
x3 = 2.
Since (R(y/l)/R) = 3 , the construction cannot be performed .
We have found a necessary condition that a length a b e constructible, namely,
that
(F(a)/F) = 2\
Thus, in many cases we can prove the impossibility o f a construction. I t is natural
to ask for a condition that is both necessary and sufficient. Sinc e any constructible
length can be derived from th e data of the problem by the rational Operations and
extractions o f squar e roots , thi s conditio n i s tha t i t b e possibl e t o find a field E
containing a fo r whic h ther e i s a chain o f fields with E a t th e to p an d F a t th e
bottom,
F = F X C F 2C • •• C F n = E,
and (iVfi/F v ) < 2 .
Let us examine the problem of constructing the m-gon, the regulär polygon of
m sides. Consider the factorization o f m into primes,
m
= P\ Pl '"Pr '
If the m-gon can be constructed, then plainly we can construct any d-gon where d
is a divisor of m. I n particular, we can construct the polygons of p\ [, p^ 2,..., p v/
sides. Conversely , i f i t i s possible t o construc t thes e p^-gon s ( / = 1 , 2 , . . ., r) ,
then we can construct the m-gon. For the numbers
v
m/p\\m/p%, ...,m/p /
are relatively prime and therefore the diophantine equation
mx\/p\x + mx 2/pV2 H h mx r/pvrr = 1
has a Solution in integers x\, x 2, . . . , x r. Dividin g b y m we obtain Y^( xi/P?) —
1/m. Henc e a n m th par t o f a circle consist s o f a su m o f (p^) th parts . W e need
therefore conside r only powers of primes.
EXAMPLE. Th e problem of constructing a 15-gon reduces by these considera-
tions to the problem of constructing an equilateral triangle and a regulär pentagon.
We must find integers x, y wit h
5x + 3y = 1 .
The numbers x = — 1, y = 2 work, —\ + \ = ^ - T o construct the 1 5 th part of a
circle we first construct an angle of 1 44 ° and then subtract 1 20° .
REMARK. W e shall say an imaginary number a + ib is constructible if the real
and imaginar y parts separatel y ar e constructible . Th e introductio n o f imaginar y
elements does not affect ou r theory. Th e sum, difference, product , and quotient of
94 7. TH E THEOR Y O F EQUATIONS
, \a + Va 2 + b 2 . -a + ja 2 + b 2
v^+^ = y — -— + *y .
The proble m o f constructin g a n m-go n i s equivalen t t o th e proble m o f con -
structing the length cos ^. I f this length is constructible, so is sin ^ an d also
2n 2iz
8 = cos h / sin — .
mm
e is a root of the equation 8 m = 1 . If m is a power of a prime, m = p v, then , by the
results starting on page 87, 8 is a root of the irreducible polynomial
x"v-l
%P — 1
which is of degree p v~l(p — 1). Bu t we can construct 8 only if the degree of the
extension field R(8) i s a power of 2, i.e., only if
pv-\p-\) = 2».
We can have only p = 2 or v = 1 . Except for 2 , no power of a prime higher than
the first can be admitted. In particular, the polygon of 9 sides cannot be constructed
and it is therefore impossible to trisect the angle of 1 20° . As a side result, we have
shown that the trisection problem cannot have a general Solution.
For v = 1 we have p = 2 ß + 1 . We are interested in all primes of this form. If
ix has an odd divisor, /x = X(2n + 1 ) , then
X
2^ + 1 = 2 x(2n+l) + 1 - (2 + 1 )( - • •)
is not a prime. The only primes which are allowable must therefore have the form
p = 2 2k + l.
The numbers o f this form ar e prime for values of k up to 4. Thes e are the primes
3, 5, 17, 257, 65537. Fermat' s famou s conjectur e i s that these numbers ar e prime
for al l values of k. Actually , this breaks down at 5; 2 32 + 1 is divisible by 641 . It
is not known whethe r there are an infinite numbe r of primes o f this form. I n any
case, the only constructible m-gons ar e those for whic h m i s a product of power s
of 2 with Ferma t primes, none of th e latter appearin g t o a power highe r tha n th e
first. We have really only shown that no other polygons are constructible. I n order
to sho w tha t th e constructio n o f thes e polygon s i s possible , w e shal l nee d mor e
refined tools.
Furthermore, we have
ai a2
[aaxol+a2o2+...+anony = r [cr1 (a )a2(a ) • • -an (^")]
a Tcr
=a l l+a2T(J2+-+anT:crn
ß ß
Now, if the group of E/F is G = {<J\,cr2, . . ., or„}, we dehne the norm Na for
a e E to be the product of the images of a under G,
(Tl+ai+ +(Tn
Ate = o x{a)a2{a) • • -(^(a) = a '" .
We must have Na e F since
(iVa)T = a aTl+aT2+-+(7Tn
and the r^ ar e simply the elements of G permuted. If G is cyclic of order n,
G = {/,<x,<r 2, ...,(T n - 1 },
we have (using N(aß) = A/a • Afß)
M * 7 " * = (MX) 7 "* - a (/-a)(l+cr +fl r 2 +..-h,»-1 ) = „/-* » =a 0= l
EXERCISE 1 .
(a) Suppos e E/F normal , (E/F) = n. Conside r the n2 equation s
X(ji — X(jX T
an-l(ß) = e n~lß,
and these elements ar e distinct. Consequently , ß satisfie s a n equation o f degree n
over F (Theore m 5.1 6 , p . 66) an d therefor e E = F(ß). Furthermore , o(ß n) =
[cr(ß)]n = s nßn = ß n\ i.e. , ß n i s fixed in the automorphisms of E/F an d therefore
ßn G F. Consequently , ß i s the root of an equation
xn=b (beF).
The field E cycli c o f degre e n ove r F ma y b e obtained simpl y b y adjoinin g on e
radical, ß = \fb. Sinc e ß i s the root of an irreducible equation o f n th degree , we
conclude that
xn - b = (J C - ß)(x -eß)---(x- e n l
~ ß). D
EXAMPLE. Conside r th e field R(^/Tl) o f degre e 2 ove r R. Th e primitiv e
square root of 1 is in R, s = — 1. N(—l) = ( — l)2 = 1 . Thus Ns~ l = 1 . We can
therefore writ e —l=ß l~a,ß^z0, wher e ß has the form
ß = 0 + 0 ae = 0-e tT
.
6 = 1 is not acceptable . Consequently , 9 = VT T must work , an d i n fact yield s
ß = 2vTT .
98 7. THE THEORY OF EQUATIONS
The result we have just obtained is constructive in that we may single out the
element which generates the field. For ß w e take
ß = e + s- xe° + S- 29G1 + • • • + 8- {n-l)ean~{
since a(s) = e. Now ß cannot be zero for all n basis elements of E over F. Henc e
we need to try at most n values of 0 to determine ß. Havin g found ß, w e obtain ß n
and this must be an element of F .
Let us attac k the converse problem. I f F i s a field containing a primitive n th
root of unity, what is the Splitting field of the equation
Let ß be any root of ijf(x) and form the field E = F(ß). Sinc e the distinct elements
ß, eß, ..., s n~lß ar e al l roots o f \jr(x), w e conclude tha t E i s the Splittin g field.
Now two possible cases may arise:
Case 1 . x/s(x) i s reducible.
In that event let 0(x) b e the irreducible facto r ove r F , wit h (ß(ß) = 0 . (p(x)
will be responsible for a number of the roots
0w= n (*-^)-
some v
The constan t ter m i n 0(JC) , a n element o f F , mus t b e o f th e form ±s^ß r wher e
r = d[(p(x)]. Consequently, ß r = c e F ; i.e., ß satisfie s th e equation x r — c = 0
of r th degre e over F. Sinc e 0(JC) is irreducible of degree r, it follows (Lemm a 4.5,
p. 40) that
(j)(x) = x r — c.
Now E = F()8 ) = F(sß) = • • • = F(£ n~lß) sinc e £ e F . I t is clea r tha t
every s vß satisfie s a n irreducible equation of r th degree ,
xr - (s vßY = x r - s vrc = 0 .
We conclude that \j/(x) factor s int o polynomials al l of the same degree r an d con-
sequently that r\n. W e have ß r = c e F . Puttin g n = rs, w e obtain
ßn = ß rs
= c
s
=b.
Hence the reducible cas e occurs onl y i f b is a power o f c\ s\n. Conversely , i t is
evident if b = <r , s\n, tha t the polynomial yjr(x) ma y be factored int o polynomials
of equal degree r = n/s.
The reducible case is included i n the irreducible cas e since F mus t contain a
primitive r th roo t of unity if it contains a primitive nth root .
A wor d o f caution . Thes e criteri a ca n onl y b e applie d whe n F contain s a
primitive n th roo t of unity. Consider , for example, the polynomial x 4 + 4 over the
rational field R. Th e number —4 cannot be expressed as the Square or fourth power
of any element in Ä, yet we have
x4 + 4 = (x 2 - 2x + 2)(x 2 + 2x + 2).
7.2. SOLUTION OF EQUATIONS BY RADICALS 9 9
Nevertheless, a germ of the result remains even when the roots of unity are not
contained in the ground field. W e digress to discuss the one case in which we may
dispense with the roots of unity.
Let
x//(x) = x p — b, p prime,
and suppose that p is not the characteristic of F. Unless b is a p th power ofsome
element in F, \/f(x) is irreducible.
PROOF: Le t E b e the Splitting field of \jr{x). The roots of \//(x) ar e certainly
distinct since \j/\x) = 0 only if x = 0 and zero is not a root. Le t ß b e any one of
the roots. The roots of \/f(x) ar e then
ß, eß, s 2ß, .. . , e p~lß,
where s i s a primitive p th roo t o f unity. Henc e the Splittin g field of x p — b must
contain the p th root s of unity.
Now, if if(x) coul d be factored i n F, we could find some irreducible factor of
lower degree, say
<t>(x)= Y\(x-e vß).
some v
The constan t ter m i n (/>(x), a n elemen t o f F , i s o f th e for m ±s ßßr wher e r =
3[0(JC)] < p. Wehav e
eßßr =c e F , ß p
= be F .
Now r < p, p prim e =>• (r, p) = 1 . It follows that the equation
rx + py = 1
has a Solution in integers x, y,
cxby = (s ßßr)x(ßp)y = e^ß e F.
Clearly, then, if -^r(jc) is reducible it has a root in the ground field and b is after al l
a p th powe r in F. D
or briefly
100 7. THE THEORY O F EQUATIONS
LEMMA 7. 3 If p is prime and F is any field whatever which does not have the
characteristic p, then a necessary and sufficient condition that the polynomial x p —
b be irreducible is that b is not a p th power of any element in F.
E = F ( a i , a 2 , ...,<**) .
7.2. SOLUTION OF EQUATIONS BY RADICAL S 101
The relatio n betwee n th e fields is given b y the scheme i n the figure below. De -
note the groups of E/F an d E/F b y G and G, respectively. Wha t relation exist s
between G and Gl
An elemen t ö o f G is an automorphisms o f E
that leaves all F fixed and consequently al l F. Now
each ä i s define d b y a permutation o f th e roots of
f(x). Consequently , ä map s th e field E = F(a\,
. . . , a n) ont o itself. Thu s eac h ä provide s a n auto-
morphism of E in which F is fixed. In this way we
associate an element of G with every element of G.
Furthermore, onl y on e element o f G is determine d
by an y ä sinc e ä an d its imag e ar e both uniquel y
determined by the same permutation of the roots.
Now if two successive automorphism s ä , f produc e a certain permutatio n of
the roots, then clearly so do their images, i.e.,
är = p <s > ox — p.
We conclude that G is isomorphic to a subgroup S of G. I t is easy enough to see
which subgrou p i t is. W e shall describ e S b y determining th e field to which i t
belongs.
Let Q be the field corresponding t o S. Q is
some field between E and F, and it consists of ex-
actly thes e element s o f E whic h ar e left fixed by
S an d hence consist s o f thes e element s o f E lef t
fixed by G . Bu t G leaves no other elements fixed
than those of F. Consequently , Q consists exactly
of thos e elements o f E whic h ar e also in F. Q is
the intersection or common par t of the two fields.
(We write Q = E n F.) A schematic diagra m is
shown.
In a measure this result is disappointing. Th e
extension F is helpful onl y insofar a s it contains a
part o f E. Whateve r econom y i s achieved b y the
introduction o f F alread y coul d hav e been achieve d wit h ß . However , w e may
console ourselve s wit h the knowledge tha t we have eliminated an y possibility of
introducing mysterious Solutions.
REMARK. O f course, there are other methods of obtaining roots than algebraic
extension. In the field of real numbers, for example, every fifth degree equation has
a root, the root being defined b y some limiting process. I f we adjoin on e element
to this field, the root of the equation x2 + 1 = 0, then all equations are immediately
solvable in the ground field. How does the introduction of limits tie in with solving
equations?
We shall not digress to answer this question here but leave it as a provocative
query.
102 7. THE THEORY OF EQUATIONS
Suppose the Splitting field of an equation is cyclic. If the ground field does not
contain n distinct n th root s of unity, then these may be adjoined excep t in the case
where the characteristic i s a divisor of n. I n that instance it is useless to hope fo r
a primitive n th roo t of unity. Fo r if p i s the characteristic w e may write n = p vm
where p\m. Bu t we have
xn - 1 =x pVm
- 1 = (x m - l) pV
(proposition o n p. 57). Thu s the n th root s of unity ar e the sam e as the m th. Fo r a
Solution in terms of radicals we must therefore assum e (n, p) = 1 . In that case the
equation x n — 1 == 0 plainly has n distinct roots.
However, something remains to be said for the case when the characteristic is
a divisor of the degree of the Splitting field. If the group is cyclic, we do not expect
a Solution in terms of radicals, but the Solution remains simple.
THEOREM 7. 4 Consider a field F of characteristic p. An extension field E is
normal of degree p over F if and only if E is the Splitting field of an irreducible
equation of the form
p
[1] x -x-a = 0.
This condition is equivalent to the requirement that E be an extension of F by
means of a Single adjunction E — F(a) where a is a root ofthe equation [1].
The root a behave s somewhat like the radical in Theorem 7.1 . W e shall occa-
sionally refer to such elements as "modified radicals. "
PROOF:
(1) If x p — x — s is irreducible in F 9 then the Splitting field E i s obtained by
the adjunction o f a Single root. I t follows tha t (E/F) = p an d therefore tha t the
group is cyclic.
The roots of the polynomial
f(x) — x p — x — a
are distinct sinc e f'(x) — —1. Th e Splitting field must therefore b e normal. No w
f(x) i s periodic of period 1 , for
f(x + 1 ) = ( x + 1 ) * - (j e + 1 ) - a = x p + 1 - x - 1 - a = f(x).
Consequently, i f a i s a root, f(a) = 0 , th e othe r root s ar e obtaine d simpl y b y
repeated addition of 1 :
a, a + 1 , .. . , a + p — 1 .
Having p roots, we have all.
The Splitting field of f(x) i s obviously F(a). Sinc e F(a) = F(a + /x) , each
root a + jii mus t satisf y a n irreducible equatio n o f th e sam e degree a s a ove r F.
Consequently, f(x) factor s into polynomials of equal degree. Clearly, then, if f(x)
is no t irreducible , i t mus t reduce int o linea r factor s an d al l the roots ar e alread y
in F.
7.2. SOLUTIO N OF EQUATIONS BY RADICALS 103
v=0
Since cr(b) = b it follows that b e F. Settin g
P-\
v=0
we obtain
o(ß) = a 2{9) + 2a\6) + • • • + (p - l)a p
-\0)
= a{9) + 2o 2{6) + 3a 3(9) + •• • + ( / ? - \)o {p l)
-0
2 {p l)
-[0 + a{6) + o (0) + a\9) + • • • + a ~ (e)l
that is,
a(ß) = ß-b.
If we then set a — —ß/b w e obtain
a(a) = -{ß)/b = (-ß + b)/b - -ß/b + 1 = a + 1 .
We have constructed a n element a fo r which
av{a) =a + v (v = 0, 1 , . . . , / ?- 1 ) .
The p images of a are plainly distinct and consequently a is a root of an irreducible
polynomial o f degre e p ove r F (Theore m 5.1 6 , p . 66) . W e conclude tha t E =
F{a). I t is only necessary to show that a satisfie s an equation of the form xp —x —
a= 0.
Set a — ap — a. W e have
a(a) = (a + l) p - (a + l) = a p + l - a - 1 = a.
Since a{a) — a it follows that a e F. Thu s a i s a root of the equation
xp - x - a = 0. •
EXAMPLE. Th e polynomial f(x) = x 5 — x — 1 is irreducible in the rational
field R. For , if f(x) wer e reducible in R it would certainly be reducible in R$. But,
from the foregoing, we see that x5 — x — a is reducible in Rs only if a = 0 (mod5).
104 7. THE THEORY OF EQUATIONS
The general case in which the characteristic of a cyclic field is a divisor of the
degree is handled in stages. Suppose E/F i s normal and cyclic of degree n — p vm
where p i s the characteristic an d p\m. Le t o b e the generator o f the group. Th e
element r = op v~xm generate s a cycli c subgrou p S o f orde r p. Th e grou p S
corresponds to a field Q\ whic h is normal and cyclic of degree p v~lm ove r F (cf .
p. 73 ff.). B y repeating this process we arrive at a chain of cyclic normal fields
E = Q 0DQiD '"DQ VD F
where (Q t/ ßf+i ) = p (i = 0 , . . ., v — 1 ) and (Q v/F) = m. I t follows that we can
obtain E fro m F b y adjunction o f m th roots of unity an d one other element, som e
combination of radicals and modified radicals .
EXAMPLE. Conside r the field R(\/2). Th e element \fl satisfie s the irreducible
equation
f(x) =x 4-2 = 0
over R. I n R(^/2), f(x) ha s the factorizatio n
f(x) = (x- V^)(J C + V2)(x 2 + A/2) .
Using the fact that the field of the product of any two factors is the same as the field
of the remaining factor, w e obtain three different cases :
P(x)=x-y/2, Q = R(^/l),
P(x) = x + V2, Q = R(V2),
4
P(x)=x -2, ß = R.
PROOF OF SUFFICIENCY : I f (E/F) i s finite and the number o f fields inter -
mediate betwee n E an d F i s finite, then E ma y b e obtained fro m F b y a Single
adjunction E = F(a). W e consider two possible cases:
(a) F consists of a finite number of elements.
Set n = (E/F). I t is easy to show (see Lemma 5.23, p. 76) that E contains q n
elements. We have proved (Corollary, p. 79) that the nonzero elements of E form a
cyclic group with respect to multiplication. I f a i s the generator of the group, then
E = F(a). Th e field consists simply of the powers of a Single element.
(b) F contains infinitely man y elements.
Since (E/F) i s finite, E can be obtained from F by a finite number of adjunc -
tions, trivially in fact, a s the set of linear combinations of the basis elements. I t is
therefore sufficien t t o prove that an extension of F b y means of two elements ca n
always be obtained by the adjunction o f a Single element, i.e.,
F(a,ß) = F(y)
for any a, ß, an d suitable y i n E.
Consider the elements
yc = a + cß,
where c e F. Sinc e w e have infinitel y man y c' s w e have a n unlimited numbe r
of the y c a t our disposal. Ther e are only a finite number of fields F(y c), however ,
since there are only a finite number of fields between E and F. Consequently , there
must be a pair c,d e F suc h that
F(yc) = F(y d)cF(a,ß).
Now
Yc, Yd G F(y c).
Hence,
(c-d)ß = (y c-yd)eF(yc).
106 7. THE THEORY O F EQUATIONS
reduces to the Solution of a quadratic, the introduction o f cube roots of unity, and
the adjunction o f a cube root of some element of £2.
To show that the general equatio n o f fourth degre e ca n be solve d in terms of
radicals would require a more intimate study of the permutation group of order 24.
But the same method would work. Fo r the general quintic equation, however , we
get the permutation group of order 120, and this cannot be broken down into a chain
of cycli c invarian t subgroups . Thi s doe s not yet prove that a Solution i n radicals
is impossible. S o far w e have only completed th e positive side of the proof; if we
can construct a chain of cyclic invariant subgroups, then an equation with rational
coefficients i s solvable in terms of radicals. It remains to prove the converse.
THEOREM 7.6 If E/F is normal with a commutative group G, then E is a tower
over F.
PROOF: B y takin g th e power s o f an y a e G , a ^ / , w e ca n easil y pic k
out a cyclic subgrou p S. Th e subgrou p S correspond s t o a field Q, E D Q 2> F.
Now Q is normal over F sinc e every subgrou p of a commutative grou p is invari-
ant. Th e group of £l/F i s simply the factor grou p G/S. Sinc e the entire group is
commutative, the factor grou p must again be commutative.
Repeating the process we can determine a field between £ 2 and F whic h has a
cyclic group under Q. In this fashion we can construct a tower of fields between F
and E. D
COROLLARY IfE/F is normal with a commutative group, then E can be ohtained
from F by root extractions and modified radicals.
Is it possible to effect som e economy i n generating E ou t of Fl W e employ
certain root s o f unit y an d othe r radical s an d i t i s desirabl e t o use radical s o f th e
smallest possible index. Thi s can be accomplished b y breaking dow n each cyclic
step into Step s of prim e order . Fo r example , i f a i s a n elemen t o f perio d 1 2 we
could take for the first step either the group of order 2 generated by a6 o r the group
of order 3 generated b y <r 4. I t is clear, then, that we need onl y use roots o f unity
and radicals for which the index is a prime divisor of the order of the group.
Let us consider the special case of R(s) wher e s is a primitive nth root of unity.
If n is a power of a prime, n = p ß, the n s satisfies an irreducible equation of degree
pß~l(p — 1). What , i n general, i s the nature of the group o f R(e)/R7 Th e only
possible automorphisms have the form
(*i(e) = s l,
where we must have (i,n) = 1 , for otherwise the period would be less than n.
EXERCISE 2. Prov e that all the transformation s
at(s) = e l, (i,n) = 1,
are actuall y automorphism s o f th e field R(e). Th e grou p o f R(s)/R i s clearl y
commutative sinc e we have
<Ticrk(e) = cfi(e k) = s ik.
7.4. TOWERS O F FIELDS 109
If the characteristic of F is /?, then we can get rid of all p th root s in the expres-
sion for a. Fo r F(a p) i s a subfield o f F(a) an d a satisfie s th e equations
fix) = 0,
p p
x - a = (J C - a) p = 0,
in F(a p). Sinc e a i s a simple root of fix), th e greatest common divisor of fix)
and xp — ap i s x — a. Bu t both polynomials are in Fia p) an d so, therefore, is their
greatest common divisor. W e conclude that a e Fia p)\ i.e. , a ca n be written as a
polynomial expression in ap. Now , if a pth roo t occurs in a i t can be eliminated in
ap. W e need only employ the relations
(a + b) p =a p + b p an d (ab) p
= a pbp
together wit h th e fact tha t ^ = « / ml . Usin g th e polynomial relatio n be -
tween a an d a p w e get a new expression for a. B y repeating this process w e can
rid the expression fo r a o f al l radicals o f index p. I n other words, if a n equatio n
is solvable by radicals i t can be solve d withou t th e use of p th roots . W e assume,
therefore, that p th root s do not appear in the expression for a.
If pi, p2, . . ., p s ar e th e distinc t indice s o f th e radical s appearin g i n th e ex -
pression for a, w e adjoin to F the ip\) th, ipi) th, • • •, ip s)th root s of unity. The field
Fi i s clearly normal and a tower over F.
Pick out an innermost radical p ^fä in the formula fo r a. For m the field Fi —
p
F\i <i/ä). F2/F i s normal; namely, it is the Splitting field of the polynomial
ixPl - a)ix Pl
- l)ix P2
- 1 ) • • • (je* - 1 ) .
Furthermore, F2/F1 is cyclic (Theorem 7.1). D
110 7. THE THEORY OF EQUATIONS
"i + \
l+l
G
J+l )
(b)
(a)
FIGURE7.1
and
E = F(ai,a 2, ...,«*) .
An automorphism a o f £ i s completely determined if the images of the at unde r a
are known. Now
<*(f(<Xi)) = /(cr(ofi) ) = 0 =^ a(a/) = a k,
7.5. PERMUTATION GROUPS 113
We shall denote the roots by their subscript s alon e an d er wil l then be defined b y
the notation
/l,2,...,n\
\ V l , V 2, . . ., V n)
The product o f two permutations i s defined a s the result of applying them in suc-
cession. In the example of (x 2 — 2)(x 2 — 3 ) we have
/l 2 3 4\/ l2 3 4 \ / l2 3 4 \
\21 4 3J \2 3 4 l) " \l 4 3 2) '
In thi s instanc e the permutation grou p doe s not carr y ever y digi t ove r int o ever y
other digit since, e.g., ot\ canno t go into a^. If, o n the contrary, a group of permuta-
tions carries every digit over into every other digit, we say the group is transitive.
LEMMA 7.9 A necessary and sufficient condition that a group of permutations be
transitive is that the digit 1 can be carried into any other; i.e., as er runs through
the group, <r(l) runs through the digits 1 ,2, ... ,n.
PROOF: Th e conditio n i s obviousl y necessary . I n orde r t o prove i t i s suffi -
cient, we show it is possible to carry any digit j ove r into any other digit k. No w j
and k both appear as images of 1 . Thus there are permutations er, r wit h
cr(l) = j , r(l)=k.
In the inverse to er w e have cr~l(j) = 1 , whence r(a~ l(j)) = k. D
Suppose, for a given group G , that 1 is carried over into the digits 1 ,2,... ,r
and no others. (Thi s is no restriction sinc e we may label our elements s o that the
images of 1 are the first r indices.) B y the proof of Lemma 7.9 it follows tha t any
one of these digits can be mapped onto any other. Furthermore , no permutation of
the group will move any one of the digits 1,2,... ,r int o a digit k > r. Fo r assume
j < r with cr(j) — k. Sinc e j i s an image of 1 , j = r(l) , we have a r ( l) = k.
The grou p permute s r o f th e digit s i n a transitiv e way . Th e se t o f digit s
1,2, ... ,r i s calle d a domain of transitivity. W e ma y divid e al l th e digit s int o
domains o f transitivity , a domain o f transitivit y consistin g o f al l the integers tha t
can be carried into each other by the permutation of the group.
LEMMA 7.1 0 There is a one-to-one correspondence between the irreducible fac-
tor sofa separable polynomial f(x) and the domains of transitivity ofits Galois
group.
PROOF: Le t
/ o r P
V 2 ^ V2 -V2 V2 -V2
V 3 ^ V3 V3 -V3 -73
The effect o f the group on the roots of both equations is given in the tables
x4 - 5x 2 + 6 JE 4 - IOJC 2
+ 1
1234 1234
/1234 / 1234
a 2134 (7 2143
X 1243 r 3412
P 2143 P 4321
In the first case we have two domains of transitivity, each containing two ele-
ments. In the second case we obtain a transitive group of order four. Th e Situation
is entirely different ye t the structure of the abstract group is the same in both cases.
We are particularly intereste d i n group s whic h ar e no t solvable . Apparentl y
most group s ar e solvable , the smalles t nonsolvabl e grou p being o f order 60. Th e
nonsolvable group of order 60 is a simple group; a simple group is a group which
has no invariant subgroup s othe r than itself an d the identity. Clearly , every cyclic
group o f prime orde r i s simple . Apar t fro m these , th e simpl e group s see m to be
very rare . Th e nex t nontrivia l simpl e grou p i s o f orde r 1 6 8 an d i s give n b y th e
symmetries of the abstract geometry of seven points.
This is a projective geometry defined by the postulates:
(a) ther e exists at least one line;
(b) ever y line contains exactly three points;
(c) ther e is at least one point not on a given line;
(d) tw o points lie on exactly one line;
(e) tw o lines intersect in exactly one point.
This geometry i s represented i n the table below an d in the accompanying fig-
ure:
116 7. THE THEORY O F EQUATIONS
l\ 124
h 235
h 346
u 457
h 561
h 672
Li 713
The simpl e grou p o f 1 6 8 elements consist s o f al l permutation s o f th e digit s
1, 2 , . . ., 7 for which the collineation relations of this geometry remain unchanged.
Thus, for example, the eyclic permutations belong to this group.
EXERCISE 4. Determin e all the permutations of the simpl e group of order 168.
The simpl e grou p o f orde r 6 0 also has a geometrical Interpretation . I t is th e
group of rotations of the icosahedron.
There has not yet been an y Solution to the problem of determining al l simpl e
groups.4 Th e first few hav e the Orders 60, 1 68 , 360, 504, 660, 1 092 , there bein g
one simpl e grou p fo r eac h order . However , ther e ma y b e mor e tha n on e simpl e
group of a given order ; ther e are two of orde r 20160. Th e most commo n simpl e
groups appear to be those of order p (p2 — 1) /2 where p is any prime greater than 3.
In particular, the group of the matrices ( acbd) wit h elements in R p an d determinant
equal t o 1 has a factor grou p o f orde r p(p 2 — l)/ 2 wit h respect t o th e invarian t
subgroup
(o ij ; ( o -i )
and this is simple.
It is probable that every group appears as the group of some equation over the
rational field but most equations have certain special groups. The general equation
of n th degre e leads to the so-called Symmetrie group of order n\. Thi s is the group
of all permutations on n objects. In general, the Symmetrie group is nonsolvable as
we shall prove.
A eyclic permutation wil l now be described by writing the digits in the eyclic
order, e.g.,
"= ( 23 \ 4 6 7 5 ) = (1 23)(4)(567) .
axa'1 = (1 24)(36)(75) .
There i s another representatio n whic h i s very useful. Ever y permutatio n ca n
be written as a product of transpositions (2-cycles) but here the terms are not nec-
essarily disjoint. Thus , in particular, we have for a cycle
(0, 1 , 2, . . ., n) = (0 , yi)(0, n - 1 ) • • • (0, 1).
Note. A n (n + l)-cycl e ca n b e writte n a s th e produc t o f n transpositions . Th e
representation a s a product o f transpositions fo r an y permutation ca n be obtained
by writing each ofits disjoin t cycles as above. This is not the only conceivable way
of writing a permutation as a product of transpositions but all such representations
have something in common.
118 7. THE THEORY O F EQUATIONS
\-%n — 1 X n).
If we permute the indices of the x;, then each factor goes into some other factor or
its negative. Thu s a permutation has either the effect o f changing the sign of P o r
leaving it the same. D
EXERCISE 5. Sho w that the effect o f a transposition is to change the sign of P.
If a permutation leaves the sign of P unchanged, it is said to be even; otherwise
it is called odd. The product of two even permutations or two odd permutations is
even. Th e product of an even and an odd is odd. I n view of Exercise 5 it is clear
that an even number of transpositions is required to represent an even permutation,
an odd number of transpositions to represent an odd.
COROLLARY The inverse of a permutation is even or odd according to whether
the original permutation is even or odd.
The even permutations, being closed under multiplication and taking inverses,
form a subgroup of the füll Symmetri e group. Th e subgroup of the even permuta-
tions is clearly invariant:
aSa'1 =S.
The order of the subgroup is easily determined :
There ar e as many eve n permutations a s odd, fo r th e produets o f th e distine t
even permutations wit h an y fixed odd permutation ar e distinet od d permutations .
Hence the number o f od d permutations i s at least a s great a s the number o f eve n
permutations. Bu t if we multiply the distinet odd permutations by any Single odd
permutation we get distinet even produets. Hence the number of even permutations
is not less than the number of odd. We conclude that the even permutations are half
the Symmetrie group. The y constitute a n invariant subgrou p of order \n\. Thi s is
the so-called alternating group on n elements.
The Symmetrie group contains an invariant subgroup of order \n! an d index 2.
The corresponding facto r grou p must then be cyclic. Bu t this, in general, is as far
as the decomposition o f the Symmetrie group can be carried. Excep t i n the case s
n < 4, the alternating group on n elements is nonsolvable.
7.5. PERMUTATION GROUP S 119
Case 4. r = (1 23 ) (456) • • •.
Taking o — (1234) we have by the rule on page 11 7
oxo~x = (1 34)(256 ) (al l other terms remain the same),
axa-xx~x = (1 4235X6) /
= (1 4235) .
Thus we have reduced case 4 to
Case 5. x = (1 234 5 •••/?)••• .
Taking o — (234) w e obtain axa~ x — (13425 • • • p) • • •, the dots indicatin g
elements which remain the same,
orror- 1 r- 1 = (l)(352)(4)(6)...(p) /
= (352) .
Hence if G contains an element of the type of 4 or 5 it contains a 3-cycle. It remains
to complete the discussion of
Case 2. x = (1 2 ) (34) • • •.
)
Editor's note: This is true only for n > 4.
120 7. THE THEORY OF EQUATIONS
we have
E = K(x ux2, . . . , * „ ) .
The xt ar e not interdependent, as we shall prove. Hence the Solution of the general
equation of n th degre e will be given by the field of rational functions o f just n fre e
variables!
Let us investigate this field. Set
E = K(yi,y 2,..., y n)
a = (0 , 1 , 2 , . . . , / ? - 1 ) .
av = (z-+ z + v).
These transformations, w e shall see, are the only linear transformations whic h
leave no digit fixed. Hence , if the lemma is true, the only p-cycle s i n G ar e ele-
ments of G2 . What are the fixed digits of the transformation z — > az + bl Thes e
are Solutions of the equation
z = az + b (mod p)
or, equivalently, of
(a — \)z = —b (mo d p).
There are two cases:
Gase 1. a f £ 1 .
In this case the equation always has a Solution and it is unique.
Case 2. a = 1 .
The transformation ha s the form
z - > z + b.
This is the permutation cr b. It has either all digits fixed or no digits fixed. Thus in
a group of linear transformations th e p-cycles can have only this form.
Now, i f th e lemm a i s true for G t i t is true fo r G/ + i. Firs t o f all , the only p-
cycles in G/ ar e elements of G2 . Hence if r G Gi+\ the n rcrr - 1 i s a p-cycle (se e
rule on p. 1 1 7 ) in G/, r a r - 1 = a b. W e then have
XG — o bx.
whence
r{k + 1 ) = r(k) + b,
r(k + 2 ) = r(Jf c + 1 ) + 1 = r(k) + 2b,
Choose any pair of OL{ , ot^ an d form the field Q = F(a t, o^) . Wha t is the group of
E/Ql Thi s is the subgroup of permutations whic h leave ott an d a^ fixed. But, by
the proof o f Lemma 7.1 9 the only permutation which leaves two elements fixed is
the identity. Hence the group of E/ Q is the identity and E = £2 . D
This theorem has an interesting consequence:
COROLLARY 7.21 A solvable irreducible equation of primedegree which has two
real roots has all roots real.
COROLLARY 7.22 If an irreducible equation is of primedegree greater than three
and possesses precisely three real roots, it cannot be solvable.
EXERCISE 6 . Fin d a n equatio n o f fifth degre e wit h intege r coefficient s an d
with precisely three real roots.
There ar e n o prove n result s concernin g th e frequenc y wit h whic h solvabl e
equations occur , bu t experienc e indicate s tha t mos t equation s hav e th e fül l Sym -
metrie group . Clearl y an y grou p ma y occu r a s th e Galoi s grou p o f a n equatio n
provided w e do not preassign th e groun d field. O n the other hand, i f th e groun d
field is the field of rational numbers , i s it alway s possible t o determin e a norma l
extension which has that group? Th e answer is probably yes, but nobody has suc-
ceeded in finding a proof. Naturally, we cannot expect such a result for any ground
field. For example, every polynomial in the field of complex numbers possesses a
Splitting in the ground field. In the field of real numbers any equation can be solved
126 7 . THE THEORY O F EQUATION S