Algebraic Geometry Codes

March 15, 2025

Recall the definition of Reed-Solomon codes.

Let n,k0n,k \geq 0 and qnq \geq n. Let α1,,αnFq\alpha_1,\ldots,\alpha_n \in \bbf_q and define the Reed-Solomon Code with parameters q,n,k,αq,n,k,\alpha as

RSq(n,k)={(f(α1),,f(αn))fFq[x],degf<k}RS_q(n,k) = \{(f(\alpha_1), \ldots, f(\alpha_n)) \mid f \in \bbf_q[x], \deg f < k\}

That is, RSq(n,k)RS_q(n,k) codes are defined as evaluation maps of polynomials of degree at most kk on nn points α1,,αn\alpha_1,\ldots,\alpha_n in Fq\mathbb{F}_q. Geometrically, Fq\mathbb{F}_q is an affine line which we can projectivize to PFq1=:P1\bbp^1_{\bbf_q} =: \bbp^1.

We can view f(x)Fq[x]f(x) \in \bbf_q[x] equivalently as a rational function on P1\bbp^1. Then the condition degf<k\deg f < k is restated equivalently as ff having a pole of order at most k1k-1 at \infty.

In the notation of divisors, div(f)+(k1)[]0\div(f) + (k-1)[\infty] \geq 0, or equivalently fOP1((k1)[])f \in \calo_{\bbp^1}((k-1)[\infty]). Thus what if instead we try curves CC other than P1\bbp^1, and divisors GG other than (k1)[1:0](k-1)[1:0]? This is exactly the intuition behind algebraic geometry codes.

Algebraic Geometry Codes

Throughout this post, we will fix the following notation. CC will denote a smooth, projective, connected, irreducible algebraic curve over Fq\bar{\bbf}_q of genus gg. We will let p1,,pnp_1,\ldots,p_n be pairwise distinct Fq\bbf_q-rational points of CC. We will let D=p1++pnD = p_1 + \ldots + p_n denote the corresponding divisor, and GG will denote another divisor on CC such that SuppDSuppG=\supp D \cap \supp G = \emptyset.

We will use Riemman-Roch for curves particularly often, so we restate it below.

Theorem (Riemann-Roch) For a smooth, projective, connected, irreducible curve CC of genus GG with canonical divisor KCK_C, and a divisor DD on CC,

(D)(KCD)=degD+1g\ell(D) - \ell(K_C-D) = \deg D + 1 - g

We can now define the primary object of interest in this post, which is our error correcting code constructed using our theory of algebraic curves.

Definition (AG Code) The algebraic geometry (AG) code associated with the curve CC and divisors D,GD,G above is denoted C(D,G)\calc(D,G) and is given by

C(D,G):={(f(p1),,f(pn)):fΓ(C,OC(G))}Fqn\calc(D,G) := \{(f(p_1),\cdots,f(p_n)) : f \in \Gamma(C, \calo_C(G))\} \subseteq \mathbb{F}_q^n

Note that f(pi)f(p_i) is well-defined since div(f)+G0\text{div}(f) + G \geq 0 and SuppDSuppG=\supp D \cap \supp G = \emptyset, i.e. if ff has a pole at pip_i then ordpi(f)<0\ord_{p_i}(f) < 0, which would imply div(f)+G≱0\text{div}(f) + G \not \geq 0 since GG is not supported on pip_i.

So now having defined AG codes, we can ask ourselves about the properties of such codes, for example, the distance and the message length.

Theorem The following hold.

  • C(D,G)\calc(D,G) is a linear [n,k,d]q[n,k,d]_q code such that
dndegGandk=(G)(GD)d \geq n- \deg G \quad and \quad k = \ell(G) - \ell(G - D)
  • If degG<n\deg G < n then kdegG+1gk \geq \deg G + 1 - g. If in addition 2g2<degG<n2g -2 < \deg G < n, we have equality. That is,
k=degG+1gk = \deg G + 1 - g
  • If {f1,,fk}\{f_1,\ldots,f_k\} is a basis of Γ(C,OC(G))\Gamma(C,\calo_C(G)) then the n×kn \times k matrix
M=(f1(p1)f2(p1)fk(p1)f1(pn)f2(pn)fk(pn))M = \begin{pmatrix} f_1(p_1) & f_2(p_1) & \ldots & f_k(p_1) \\ \vdots & \vdots & \ddots & \vdots \\ f_1(p_n) & f_2(p_n) & \ldots & f_k(p_n) \end{pmatrix}

is a generator matrix for C(D,G)\calc(D,G).

Proof. If we consider the evaluation map, ev:Γ(C,OC(G))Fqnev: \Gamma(C,\calo_C(G)) \to \mathbb{F}_q^n,

ev(f)=(f(p1),,f(pn))ev(f) = (f(p_1),\cdots,f(p_n))

we see that ff is in the kernel of evev if and only if div(f)+GD0\div(f) + G - D \geq 0, i.e. ff has at a zero of order at least 11 at each pip_i. Equivalently, fΓ(C,OC(GD))f \in \Gamma(C, \calo_C(G-D)) and so the kernel of evev has dimension (GD)\ell(G-D). By rank-nullity we indeed get

k=dim(C(D,G))=(G)(GD)k = \dim(\calc(D,G)) = \ell(G) - \ell(G-D)

If degG<degD=n\deg G < \deg D = n then (GD)=0\ell(G-D) = 0 (that is, evev is injective) and k=(G)k = \ell(G) which can be computed directly using Riemann-Roch, where KCK_C is the canonical divisor.

(G)=deg(G)+1g+(KCG)\ell(G) = \deg(G) + 1 - g + \ell(K_C-G)

In particular, (G)degG+1g\ell(G) \geq \deg G + 1 - g and we have equality iff 2g2=deg(KC)<deg(G)2g-2 = \deg(K_C) < \deg(G).

For the distance, let us consider a non-zero fΓ(C,OC(G))f \in \Gamma(C,\calo_C(G)) such that ev(f)ev(f) is of weight dd. This means ff vanishes on ndn-d points pi1,,pindp_{i_1},\cdots, p_{i_{n-d}}. Then

fΓ(C,OC(G(pi1++pind)))f \in \Gamma(C,\calo_C( G - (p_{i_1} + \cdots + p_{i_{n-d}})))

and so (G(pi1++pind))>0\ell(G - (p_{i_1} + \cdots + p_{i_{n-d}})) > 0, so in particular, we must have degG(nd)0\deg G - (n-d) \geq 0 which implies that dndegGd \geq n - \deg G.

It's clear that if f=c1f1++ckfkf = c_1 f_1 + \ldots + c_k f_k then f(pi)=c1f1(pi)++ckfk(pi)=[Mf]if(p_i) = c_1 f_1(p_i) + \ldots + c_k f_k(p_i) = [Mf]_i.

Dual Codes

We can construct another code using the two divisors GG,DD, using another point of view. The AG codes defined above used an evaluation point of view. We can also use the residual point of view to create some linear codes.

Fix the same notation for C,D=p1++pnC,D = p_1 + \ldots + p_n, and GG such that SuppGSuppD=\supp G \cap \supp D = \emptyset as in the previous subsection, and recall the notions of the rational differential 11-form ω\omega on CC, canonical divisor KC=div(ω)K_C = \div(\omega), and canonical bundle ΩC=OC(KC)\Omega_C = \calo_C(K_C).

Recall Γ(C,ΩC(Z))\Gamma(C, \Omega_C(Z)) is the space of global rational 11-forms η\eta such that div(η)Z0\div(\eta) - Z \geq 0. Note that

Γ(C,ΩC(Z))=Γ(C,OC(KCZ))\Gamma(C, \Omega_C(Z)) = \Gamma(C,\calo_C(K_C - Z))

since giving a rational ff such that div(f)+KCZ=div(f)+div(ω)Z0\div(f) + K_C - Z = \div(f) + \div(\omega) - Z \geq 0 is the same as giving a rational 11-form η=fω\eta = f\omega such that div(η)Z0\div(\eta) - Z \geq 0.

For a rational 11-form η\eta, suppose the local component at pp is given by ηp=f(z)dz\eta_p = f(z)dz. If ηp=f(z)dz\eta_p = f(z)dz, the residue Resp(η)\res_{p}(\eta) of η\eta at pp is defined as the coefficient of 1/z1/z in the Laurent series expansion of f(z)f(z) at 0.

Residue Theorem Let ω\omega be a rational differentiable 11-form on a smooth projective curve CC. Then

pCResp(ω)=0\sum_{p \in C} \res_p(\omega) = 0

This is the analogue of the residue theorem in complex analysis, but the proof is rather involved so we omit it.

Definition Let C,D,GC,D,G be as before with SuppDSuppG=\supp D \cap \supp G = \emptyset. Define the following code.

CΩ(D,G)={(Resp1(η),,Respn(η)) ⁣:ηΩC(GD)}Fqn\calc_{\Omega}(D,G) = \{(\res_{p_1}(\eta), \ldots, \res_{p_n}(\eta)) \colon \eta \in \Omega_C(G-D)\} \subseteq \bbf_q^n

Then we have an analogue of the Theorem in the previous section for AG codes whose proof contains essentially the same ideas.

Theorem The following statements hold.

  • CΩ(D,G)\calc_{\Omega}(D,G) is a linear [n,k,d]q[n,k',d']_q code such that
ddegG(2g2)andk=(KCG+D)(KCG)d' \geq \deg G - (2g-2) \quad and \quad k' = \ell(K_C - G + D) - \ell(K_C - G)
  • If 2g2<degG2g-2 < \deg G then k=(KCG+D)ndegG+g1k' = \ell(K_C-G+D) \geq n - \deg G +g-1.
  • If 2g2<degG<n2g-2 < \deg G < n we have equality, i.e.
k=ndegG+g1k' = n - \deg G + g -1

Proof. Consider the residue map Res ⁣:ΩC(GD)Fqn\res \colon \Omega_C(G-D) \to \bbf_q^n, defined as

Res(η)=(Resp1(η),,Respn(η))\res(\eta) = (\res_{p_1}(\eta), \ldots, \res_{p_n}(\eta))

The kernel consists of rational 11-forms η\eta with 00 residue at each pip_i. Since div(η)G+D0\div(\eta) - G + D \geq 0, each η\eta has a pole of order at most 11 at every pip_i and having residue 00 implies it has no pole, i.e. div(η)G0\div(\eta) - G \geq 0. Thus the kernel is Γ(C,ΩC(G))=Γ(C,OC(KCG))\Gamma(C,\Omega_C(G)) = \Gamma(C,\calo_C(K_C-G)) which has dimension (KCG)\ell(K_C-G).

Similarly, the domain has dimension (KCG+D)\ell(K_C-G+D). By rank-nullity,

k=(KCG+D)(KCG)k' = \ell(K_C-G+D) - \ell(K_C-G)

In particular, if 2g2=degKC<degG2g-2 = \deg K_C < \deg G then (KCG)=0\ell(K_C-G) = 0 and by Riemann-Roch

k=(KCG+D)=(GD)deg(GD)+g1ndegG+g1\begin{align*} k' &= \ell(K_C-G+D) \\ &= \ell(G-D) - \deg(G-D) + g - 1 \\ &\geq n - \deg G + g - 1 \end{align*}

If in addition degG<degD=n\deg G < \deg D = n then (GD)=0\ell(G-D) = 0 and we have equality.

For distance, suppose ndn-d' residues vanish pi1,,pindp_{i_1},\ldots,p_{i_{n-d}} for some η\eta. Then

ηΓ(C,ΩC(GD+pi1++pind))\eta \in \Gamma(C,\Omega_C(G-D + p_{i_1} + \ldots + p_{i_{n-d'}}))

so (KCG+D(pi1++pind))>0\ell(K_C - G+D - (p_{i_1} + \ldots + p_{i_{n-d'}})) > 0. We must then have degKCdegG+n(nd)0\deg K_C - \deg G + n - (n-d') \geq 0, i.e.

ddegG(2g2)d' \geq \deg G - (2g-2)

In the particular case 2g2<degG<n2g-2 < \deg G < n, the above theorem and the analogous theorem for AG codes imply

k+k=(ndegG+g1)+(degG+1g)=nk' + k = (n-\deg G + g-1) + (\deg G + 1 - g) = n

but it's not too hard to check this holds in general, i.e.

k+k=(KCG+D)(KCG)+(G)(GD)=nk' + k = \ell(K_C-G+D) - \ell(K_C-G) + \ell(G) - \ell(G-D) = n

(Hint: use Riemann-Roch). There's good reason for this, as we see with the next theorem.

Theorem (Duality) The codes C(D,G)\calc(D,G) and CΩ(D,G)\calc_{\Omega}(D,G) are dual to each other;

C(D,G)=CΩ(D,G)\calc(D,G) = \calc_{\Omega}(D,G)^{\bot}

Proof. We showed that k+k=nk + k' = n so it suffices to show any codewords satisfy ev(f),Res(η)=0\langle ev(f), \res(\eta) \rangle = 0. Note that since div(f)+G0\div(f) + G \geq 0 and div(η)G+D0\div(\eta) - G + D \geq 0, we have div(fη)+D0\div(f \eta) + D \geq 0 so fηf\eta has a pole of order at most 11 at each pip_i. The residue of fηf\eta at pip_i is f(pi)Respi(η)f(p_i)\res_{p_i}(\eta), hence

ev(f),Res(η)=i=1nf(pi)Respi(η)=i=1nRespi(fη)\langle ev(f), \res(\eta) \rangle = \sum_{i=1}^n f(p_i) \res_{p_i}(\eta) = \sum_{i=1}^n \res_{p_i}(f\eta)

Note that as div(fη)+D0\div(f\eta) + D \geq 0 these are the only possible poles, i.e. the only points with nonzero residues, hence by the residue theorem

i=1nRespi(fη)=pCResp(fη)=0\sum_{i=1}^n \res_{p_i}(f\eta) = \sum_{p \in C} \res_{p}(f\eta) = 0

Conclusion

It turns out that these codes have merits other than this interesting relationship between AG codes and their duals being constructed from the trivial bundle and its Serre dualizing sheaf.

There is a famouns bound on the rate and distance of error correcting codes called the Gilbert-Varshamov (GV) Bound established in 1957, which includes a proof that there are a family of linear codes at or above this bound. The proof was non-constructive and instead selected linear codes at random.

It was an open question for around 30 years as to whether there were a family of codes that could beat the GV bound. AG codes were the first family of codes found to beat the bound, which can be shown using results on counting the maximum number of Fq\bbf_q-rational points Nq(g)N_q(g) on curves of genus gg, similar to the Hasse-Weil bound. Nonetheless, these codes provide interesting mathematical objects outside the realm of coding theory which was my primary motivation in writing this blog post.