The subject of this post is an introduction to error correcting codes with the main goal of defining a class of codes called Reed-Solomon codes. I eventually want to discuss another class of codes called Algebraic Geometry codes in the next post, so this will serve as sort of an optional post giving a quick and dirty background on codes for those unfamiliar with the general theory.
The notion of an algebraic error correcting code over an alphabet of length and distance is a -dimensional linear subspace of . Codes are a way of efficiently introducing redudancy into messages in order to protect against interference by noisy channels.
For example, Alice and Bob are standing on opposite sides of a highway and Alice wants to tell Bob the message "Hello". If she just yells it once, Bob might not hear, so instead she can yell "HelloHelloHello" to make sure, even if parts of the message are lost, Bob is still likely to understand.
This is not very efficient, however. Consider Alice sending a binary message of length , ... and suppose that our noisy channel randomly flips bits, so Bob receives which has bits flipped. How can we encode Alice's message such that Bob can always correct at most errors?
Insead of binary, we can send messages in an alphabet of size , say . Then one way to encode the message is to linearly map the message space into higher dimensional space, i.e. if Alice is sending possilbe messages of length , we can construct a linear injective map .
The vectors are called codewords and the image is called the code. is called the generator matrix for the code. The integer is the dimension of the code , and is the message length of . The Hamming distance between any two vectors is the number of indices at which and differ.
Definition The distance of the code is defined as the minimum distance between any two codewords in ,
We usually call a code with dimension , message length , distance , and alphabet size a linear code.
Suppose Alice encodes her message as , and the noisy channel introduces an error vector such that Bob receives . Then the idea is for Bob to decode by taking the codeword closest to the received in terms of modified indices. Then Bob can recover such that . The hope is that , i.e. Bob recovers Alice's message .
How many errors can the channel introduce before Bob decodes the wrong codeword? Observe that the Hamming ball of radius centered at a codeword , defined as
is the largest such ball of received messages such that Bob corrects to , i.e. . This is because the midpoint between any two codewords is at least Hamming distance , which means if exceeds this distance from it might be decoded to another codeword.
Thus we can correct errors with at most nonzero entries. This is the reason why we usually want to construct codes with good distance, since the larger the distance the more errors we can correct. Of course, we also want good rate, which is the amount of redudancy we have to introduce.
Definition For a family of linear codes , the rate and distance are defined as
Our goal is to construct families of codes with asymptotically good rate and distance tradeoff.
We define the weight of a vector as the number of nonzero entries, so in particular we can correct as long as .
A very useful alternative definition for distance is the minimum weight of a codeword, . Note that is always a codeword, and is the Hamming distance between and the vector. Then one can check with not too much difficulty that . Since is a linear subspace, which means
We'll jump into the main example of a linear code that we're interested in, called the Reed-Solomon code.
Definition Let and . The Reed-Solomon code is defined as
This is indeed a linear code. If with and then
The generator matrix is given by
since if and then
Here's the interesting bit: computing the distance of . Note that a codeword has weight if and only if vanishes on points . Since , has at most roots which implies or
There is a theorem that establishes for any code the reverse inequality.
Theorem (Singleton Bound) For any linear code , i.e. a linear code over with dimension , message length , and distance ,
Any code that meets this bound is called maximum distance separable or MDS. As we have just seen, Reed-Solomon codes are MDS, i.e. which is a very nice property.
In particular, we note that
and therefore for MDS codes (and in particular RS codes),