Home/Math reference/Fields & Galois theory

Math reference · Part 6 of 9

Fields & Galois theory

Field extensions, the Galois correspondence, the algebra↔geometry dictionary (Nullstellensatz), and impossibility-by-invariant — when something simply cannot be done.

~15 min read P4P6RD7

Galois theory is the part of algebra that turns “can you reach this target with these tools?” into a question about symmetry — and answers it with a hard no when the symmetry says so. Its two great dualities — the Galois correspondence (fields ↔︎ symmetry groups) and the Nullstellensatz (equations ↔︎ solution sets) — are both Galois connections that snap shut into exact bijections precisely when a sharp side-condition holds. It is also the native algebra of engineering: the finite fields \(\mathrm{GF}(2^n)\) behind Reed–Solomon codes, AES, and CRCs live and breathe here. The headline is impossibility-by-invariant: the same machinery that proves you cannot trisect an angle with compass and straightedge is the template for proving this state is unreachable with this control law.

Fields, and the ladder of capability upgrades

A field is a number system where you can add, subtract, multiply, and divide by anything nonzero — \(\mathbb{Q}\), \(\mathbb{R}\), \(\mathbb{C}\), and the finite field \(\mathbb{F}_p=\mathbb{Z}/p\) are canonical. Formally it is a commutative ring with \(1\ne 0\) in which every nonzero element is a unit, and it carries a characteristic — either \(0\) or a prime \(p\). (\(\mathbb{Z}\) is not a field: you cannot divide, so it is only a ring.)

The action begins when you enlarge a field. A field extension \(F/k\) (read “\(F\) over \(k\)”) is a bigger field \(F\) containing a smaller one \(k\) — a capability upgrade: \(\mathbb{R}(i)=\mathbb{C}\) adds a square root of \(-1\); \(\mathbb{Q}(\sqrt 2)\) adds a square root of \(2\); and \(\mathrm{GF}(2^8)/\mathrm{GF}(2)\) adds enough roots that AES can multiply bytes and Reed–Solomon can locate errors. Decisively, \(F\) is automatically a vector space over \(k\), so “how much bigger is \(F\)?” has a one-number answer.

Degree \([F:k]=\dim_k F\) — the dimension of \(F\) as a \(k\)-vector space, a single integer (or \(\infty\)). This is the workhorse numerical invariant for every impossibility proof below.
Tower law — for \(k\subseteq E\subseteq F\), degrees multiply: \[[F:k]=[F:E]\,[E:k].\] Each stage degree therefore divides the total. A prime total degree forbids any proper intermediate field.

Intuition. A field is a calculator you can fully divide on; an extension is a firmware update that unlocks new buttons (a root, an \(i\)). The degree counts how many independent “new coordinates” the update adds, and the tower law says coordinates of a two-stage upgrade multiply — exactly like composing linear maps stacks dimensions.

In the synthesis. The tower law is a clean mediation-through-intermediates example (P3): a two-stage factorization \(k\to E\to F\) has its overall “size” pinned down multiplicatively, and the reachable intermediate stages are constrained by divisibility. The degree itself is a dimension invariant (cf. rank and module dimension) repurposed as a reachability budget.

Pitfall. The clean multiplicativity belongs to field-extension degrees, which inherit it from vector-space structure. Do not assume an arbitrary cascade of subsystems composes its “sizes” multiplicatively — the dimensions of cascaded state spaces add, not multiply; that is a property of this specific algebra, not a law of pipelines.

Constrained vs free: algebraic elements and minimal polynomials

Inside an extension, each new element is one of two types. An element \(\alpha\in F\) is algebraic over \(k\) if it satisfies some nonzero polynomial equation with coefficients in \(k\) — it is constrained by an equation — and then \([k(\alpha):k]<\infty\). Otherwise \(\alpha\) is transcendental: utterly free, with \(k(\alpha)\cong k(x)\) the (infinite) field of rational functions. The number \(\sqrt 2\) is algebraic over \(\mathbb{Q}\) (it solves \(x^2-2=0\)); \(\pi\) is transcendental.

For an algebraic \(\alpha\) there is a single equation that says everything:

Definition (Minimal polynomial). The minimal polynomial \(m_\alpha\in k[x]\) is the unique monic, irreducible polynomial over \(k\) having \(\alpha\) as a root. It divides every polynomial that vanishes at \(\alpha\), and \([k(\alpha):k]=\deg m_\alpha\).

Why it exists and is unique

The polynomials vanishing at \(\alpha\) form an ideal of \(k[x]\). Because \(k[x]\) is a principal ideal domain, that ideal is generated by one element; choosing the monic generator gives \(m_\alpha\), and irreducibility follows because \(k(\alpha)\) being a field forces the generator to be prime.

Example (LFSR / AES bytes). Build \(\mathrm{GF}(2^8)\) as \(\mathrm{GF}(2)[x]/(m)\) where \(m\) is an irreducible degree-\(8\) polynomial over \(\mathrm{GF}(2)\) — exactly the minimal polynomial of a generator \(\alpha\), so \([\mathrm{GF}(2^8):\mathrm{GF}(2)]=\deg m=8\) (one bit per coordinate). AES fixes \(m=x^8+x^4+x^3+x+1\); a maximal-length LFSR or PRBS generator is the same object run as a recurrence, its feedback polynomial being a primitive minimal polynomial whose root \(\alpha\) cycles through all \(2^8-1\) nonzero states.

In the synthesis. The minimal polynomial is the field-theoretic exemplar of a canonical form / minimal invariant (P13): the smallest piece of data that decides reachability questions about \(\alpha\) — the direct analogue of the minimal polynomial of a matrix. It is the least you must know to answer “is this target attainable, and at what cost in degree?”

Pitfall. “Minimal” is always relative to the base \(k\). The element \(i\) has minimal polynomial \(x^2+1\) over \(\mathbb{R}\) but \(x-i\) over \(\mathbb{C}\). Change the base — change the toolkit — and the canonical form changes with it.

The two worlds: symmetry, fixed fields, and the Galois-ness gap

To get a correspondence we need a second world to pair against the fields. That world is symmetry. The automorphism group \(\mathrm{Aut}_k(F)\) — the Galois group when the extension is nice — consists of all field automorphisms of \(F\) that fix \(k\) pointwise, made into a group under composition. Its elements permute the roots of every \(k\)-polynomial: this is the “symmetry content” of the extension.

Running the construction backwards, any subgroup \(G\le\mathrm{Aut}(F)\) has a fixed field \(F^G\) — everything left unmoved by every symmetry in \(G\). This is a subfield sitting above \(k\), and it is antitone: a bigger group of symmetries fixes a smaller field. That order-reversal is the heart of the whole theory.

Two structural facts control how tightly the two worlds fit:

Artin’s theorem. For a finite group \(G\) of automorphisms, \([F:F^G]=|G|\), and \(F/F^G\) is automatically a well-behaved (Galois) extension with group \(G\).
The general inequality. Always \(|\mathrm{Aut}_k(F)|\le[F:k]\). Equality is special.

Definition (Normal, separable, Galois). \(F/k\) is normal if, whenever it contains one root of an irreducible \(k\)-polynomial, it contains all of them (it is a splitting field — see below). It is separable if minimal polynomials have no repeated roots (automatic in characteristic \(0\) and over finite fields). It is Galois if it is finite, normal, and separable — and then \(|\mathrm{Aut}_k(F)|=[F:k]\) exactly.

The Galois-ness gap. Define \([F:k]-|\mathrm{Aut}_k(F)|\ge 0\). This gap is zero exactly when the extension is Galois — the two worlds match perfectly. A large gap signals decoupling: the field has more internal distinctions than its symmetries can resolve.

In the synthesis. The Galois group is the symmetry world of a duality (P9), and “fixed by every symmetry” is the precise model of an invariant / observable that survives all symmetry (cf. impossibility invariants, P11). Normality + separability are the sharp side-condition that upgrades a lossy Galois connection into a perfect bijection — the structural origin of the gap, which itself is a candidate diagnostic for degeneracy and decoupling (RD7).

The Galois correspondence

Splitting fields are the raw material: the splitting field of a polynomial \(p\) is the smallest extension in which \(p\) factors completely into linear pieces (e.g. \(\mathbb{Q}(\sqrt 2)\) for \(x^2-2\)), and the algebraic closure \(\overline{k}\) — modelled by \(\mathbb{C}=\overline{\mathbb{R}}\) — is the smallest extension in which every polynomial splits. When \(F/k\) is Galois, the two worlds we built are not merely connected; they are the same lattice, read upside down.

Theorem (Fundamental theorem of Galois theory). For a Galois extension \(F/k\) with group \(G=\mathrm{Gal}(F/k)\), the maps \[E\;\longmapsto\;\mathrm{Aut}_E(F),\qquad H\;\longmapsto\;F^{H}\] are mutually inverse, inclusion-reversing bijections between intermediate fields \(k\subseteq E\subseteq F\) and subgroups \(H\le G\). They satisfy \([F:E]=|H|\) and \([E:k]=[G:H]\). Moreover \(E/k\) is normal iff \(H\) is a normal subgroup, and then \(\mathrm{Gal}(E/k)\cong G/H\).

Reading a tower of fields top-to-bottom corresponds to reading a chain of subgroups bottom-to-top — the order flips, intersections trade places with joins, and it is a genuine lattice anti-isomorphism. The simplest non-trivial picture is \(\mathbb{Q}(\sqrt2,\sqrt3)/\mathbb{Q}\), whose group is \(G=\{1,\sigma,\tau,\sigma\tau\}\cong(\mathbb{Z}/2)^2\) (with \(\sigma:\sqrt2\mapsto-\sqrt2\), \(\tau:\sqrt3\mapsto-\sqrt3\)). The middle layer of three fields matches, in reverse, the three order-2 subgroups:

\[ \begin{CD} F=\mathbb{Q}(\sqrt2,\sqrt3) @>{\text{Aut}_{(-)}F}>> \{1\} \\ @AAA @VVV \\ \mathbb{Q}(\sqrt2) @>>> \langle\tau\rangle \\ @AAA @VVV \\ \mathbb{Q} @>>{F^{(-)}}> G \end{CD} \]

The left column climbs through fields (inclusions pointing up); the right column descends through the matched subgroups (inclusions pointing down). The horizontal arrows are the two inverse maps of the correspondence — passing from a field to the subgroup that fixes it, and back. The full lattice, written out, is:

Intermediate field \(E\)	Subgroup \(H=\mathrm{Aut}_E(F)\)	\([E:\mathbb{Q}]\)	\(\lvert H\rvert\)
\(\mathbb{Q}\)	\(G=\{1,\sigma,\tau,\sigma\tau\}\)	\(1\)	\(4\)
\(\mathbb{Q}(\sqrt2)\)	\(\langle\tau\rangle=\{1,\tau\}\)	\(2\)	\(2\)
\(\mathbb{Q}(\sqrt3)\)	\(\langle\sigma\rangle=\{1,\sigma\}\)	\(2\)	\(2\)
\(\mathbb{Q}(\sqrt6)\)	\(\langle\sigma\tau\rangle=\{1,\sigma\tau\}\)	\(2\)	\(2\)
\(\mathbb{Q}(\sqrt2,\sqrt3)\)	\(\{1\}\)	\(4\)	\(1\)

Notice the columns of degrees and orders multiply to \(4=[F:\mathbb{Q}]=|G|\) on every row — the tower law and Lagrange’s theorem in lockstep.

The native engineering instance is even cleaner. Every extension \(\mathrm{GF}(p^n)/\mathrm{GF}(p)\) is Galois with a cyclic group generated by a single symmetry, the Frobenius automorphism \(\varphi:x\mapsto x^{p}\). Subfields then correspond to subgroups of \(\mathbb{Z}/n\) — i.e. to divisors of \(n\): \(\mathrm{GF}(p^d)\subseteq\mathrm{GF}(p^n)\) exactly when \(d\mid n\), matched to \(\langle\varphi^{d}\rangle\). Frobenius also names the conjugates: the roots of an irreducible factor are the cyclotomic coset \(\{\beta,\beta^{p},\beta^{p^2},\dots\}\), the orbit of \(\varphi\). This is precisely the bookkeeping behind BCH/Reed–Solomon code design, where one chooses roots by their conjugacy classes under \(x\mapsto x^p\).

In the synthesis. This is THE archetype of a duality between two structured worlds gated by a sharp condition — the template the synthesis borrows for the sensory↔︎physical duality (P9). It is the special case of the abstract Galois connection in which the Galois hypothesis makes both round-trips exact identities (no closure slack), and it is the algebraic cousin of the concept lattices of formal concept analysis (RD6).

Pitfall — it is a template, not a literal model. The Galois correspondence is a self-duality of one object: subfields of \(F\) versus automorphisms of that same \(F\). It is not a bridge between two independent worlds. Borrow its shape — an antitone lattice bijection gated by a condition — but do not claim that physics is the automorphism group of sensation. The gap has a literal meaning only once you build a single object whose subobjects and symmetries are the two lattices you care about. (Infinite extensions need the Krull topology and closed subgroups.)

The other duality: the Nullstellensatz dictionary

Galois theory has a geometric twin that translates algebra into geometry. Fix an algebraically closed field. Polynomial equations (packaged as an ideal \(I\)) and their solution sets (varieties) correspond, again antitonely, via \[V(I)=\{p: f(p)=0\ \ \forall f\in I\},\qquad I(S)=\{f: f|_S=0\}.\] Here \(V(I)\) is the feasible set cut out by the constraints \(I\), and \(I(S)\) is the set of constraints forced by a region \(S\). More constraints carve out a smaller feasible set: order-reversing, a Galois connection once more.

Theorem (Hilbert’s Nullstellensatz). Over an algebraically closed field, \(I\bigl(V(I)\bigr)=\sqrt I\), the radical of \(I\). Consequently \(V\) and \(I\) restrict to a bijection between radical ideals and varieties. The geometry is carried by the coordinate ring \(k[V]=R/I(V)\); irreducible varieties correspond to prime ideals and points to maximal ideals.

The radical \(\sqrt I\) identifies \(f=0\) with \(f^2=0\): it models a constraint satisfied robustly, the closest algebraic gesture toward “spec satisfied with margin.” The whole dictionary is functorial — a duality in the categorical sense.

Example (inverse kinematics). A robot arm’s inverse kinematics — find joint angles placing the end-effector at a target pose — is a polynomial system once each \(\cos\theta_i,\sin\theta_i\) is a variable tied by \(c_i^2+s_i^2=1\). The achievable configurations form the variety \(V(I)\) of that ideal \(I\); “is this pose reachable?” is “is \(V(I)\) nonempty?”, and a Gröbner basis of \(I\) triangulates the system to decide it and count solutions. Reachability ↔︎ a nonempty variety is the equality-constrained face of feasibility.

In the synthesis. This is the formal home of constraints ↔︎ feasible sets (P5): a spec becomes an ideal, the achievable region becomes its variety, and the radical encodes margin. It is adjacent to feasibility modeling rather than a drop-in for it (see the pitfall).

Pitfall — two strong hypotheses. (1) Algebraic closure is required. Over \(\mathbb{R}\), the polynomial \(x^2+1\) generates a proper radical ideal whose real variety is empty, so the bijection fails — you need the Real Nullstellensatz. (2) Equalities only. Varieties are cut by \(f=0\), never by inequalities \(f\ge 0\); real engineering feasible sets are inequality-defined semialgebraic sets / Pareto fronts, the province of real algebraic geometry and the Positivstellensatz. Use the Nullstellensatz as the equality, algebraically-closed idealization of P5 — and abandon it the moment you need genuine trade-off fronts.

Impossibility-by-invariant and solvability by radicals

Now the payoff — both dualities feed one methodology: a target is unreachable by a fixed set of operations if attaining it would violate a numerical invariant those operations cannot change. Enlarge the toolkit and the boundary moves, but never past the invariant.

The engineering reflex is the same. For a linear system \(\dot x=Ax+Bu\), no control input \(u\) can ever steer the state out of the reachable subspace \(\mathcal{R}=\operatorname{im}[\,B\ AB\ \cdots\ A^{n-1}B\,]\) — an \(A\)-invariant subspace is the obstruction, and a target outside it is unreachable with this actuation, full stop, just as \(\sqrt[3]{2}\) is unreachable with compass and straightedge. Adding actuators enlarges \(B\) and moves the boundary; it never lets you escape the invariant subspace the current \((A,B)\) defines.

The classical case is straightedge-and-compass construction. Every constructible number sits atop a tower of degree-\(2\) extensions, so \([\mathbb{Q}(\alpha):\mathbb{Q}]\) must be a power of \(2\) — one fact that kills three ancient problems at once:

Problem	Number involved	Degree over \(\mathbb{Q}\)	Verdict
Doubling the cube	\(\sqrt[3]{2}\)	\(3\)	impossible (not a power of \(2\))
Trisecting a general angle	root of a cubic	\(3\)	impossible
Squaring the circle	\(\pi\)	transcendental	impossible

The invariant doing all the work is the degree — a dimension invariant pressed into service as an obstruction. Swap in a marked ruler or paper-folding (origami), and the reachable degrees grow: the boundary shifts, exactly as enlarging an operation set should.

The deepest instance is solvability by radicals — the precise sense in which a “closed-form expression” exists, built from the coefficients by \(+,-,\times,\div\) and \(n\)-th roots. Adjoining an \(n\)-th root is, in symmetry terms, a single cyclic (abelian) step. Stacking such steps means the Galois group must decompose into abelian pieces:

Theorem (Galois’ criterion). A polynomial \(p\) is solvable by radicals over \(k\) iff its Galois group \(G=\mathrm{Gal}(F/k)\) is solvable — i.e. admits a chain \(\{e\}=G_0\trianglelefteq G_1\trianglelefteq\cdots\trianglelefteq G_m=G\) with every quotient \(G_{i+1}/G_i\) abelian. Since the symmetric group \(S_n\) is non-solvable for \(n\ge 5\), the general quintic has no radical formula, while degrees \(\le 4\) are solvable — recovering the classical quadratic, cubic, and quartic formulas.

In the synthesis. This is the field-theoretic archetype of reachable by atomic operations ⟺ symmetry decomposes into atomic pieces — a striking reachability ⟺ decomposability principle, and the template for diagnosing when a design target is attainable from a given library of process steps. Impossibility relative to a capability set is exactly P11, and the whole pattern is the mathematical seed of an impossibility/degeneracy diagnostic (RD7): find a quantity preserved by every allowed move, then exhibit a forbidden value.

Bridge. The materials counterpart is the goal of an impossibility certificate for recipes: see the R&D system and the materials bridge. The aspiration is to attach to a process family an invariant — preserved by every step in the allowed library — and to read off “no recipe here reaches this spec” the way the degree reads off “no compass construction reaches \(\sqrt[3]2\).” As in Galois theory, the certificate is only as sharp as your explicit list of atomic moves and the symmetry object attached to them.

Pitfall. Every impossibility proof rules out only the specific operation set whose invariant you used. “Impossible” here always means “impossible given these operations,” never absolutely — so always state the invariant and the operation set it certifies against.

Recap

A field lets you divide; a field extension \(F/k\) is a capability upgrade measured by one number, the degree \([F:k]\), which multiplies along towers and constrains intermediates by divisibility (P3).
The minimal polynomial is the canonical minimal invariant (P13) deciding an algebraic element’s reachability — always relative to the base field.
The Galois correspondence is an order-reversing bijection between intermediate fields and subgroups, exact precisely when the extension is normal + separable; the Galois-ness gap measures how far a duality falls short and flags decoupling (P9, RD7).
The Nullstellensatz is the twin duality, equations ↔︎ feasible sets (P5), valid over algebraically closed fields and for equalities only — a false friend for real, inequality-defined trade-off fronts.
Impossibility-by-invariant is the unifying message: degree forbids three classical constructions; non-solvability of \(S_{n\ge5}\) forbids any closed-form quintic formula; an \(A\)-invariant subspace forbids steering a linear system off its reachable set. The materials analogue (P11, RD7) is a provable “no recipe in this family can hit this spec” — sharp only relative to the stated operation set.

Part of a four-document set: the ARiSE draft (problem + AI solution), this modular Mathematics reference, the companion materials reference, and the synthesis. Generated from modular Markdown with a custom static-site builder.

Mathematics is typeset with MathJax (loaded once from a CDN with Subresource Integrity; needs network on first view). Diagrams are inline SVG and follow the light/dark theme. Keyboard: / search · [ ] prev/next · t theme.