Efficient Linear and Affine Codes for Correcting Insertions/Deletions

Cheng, Kuan; Guruswami, Venkatesan; Haeupler, Bernhard; Li, Xin

Computer Science > Information Theory

arXiv:2007.09075 (cs)

[Submitted on 17 Jul 2020 (v1), last revised 21 Jul 2022 (this version, v4)]

Title:Efficient Linear and Affine Codes for Correcting Insertions/Deletions

Authors:Kuan Cheng, Venkatesan Guruswami, Bernhard Haeupler, Xin Li

View PDF

Abstract:This paper studies \emph{linear} and \emph{affine} error-correcting codes for correcting synchronization errors such as insertions and deletions. We call such codes linear/affine insdel codes.
Linear codes that can correct even a single deletion are limited to have information rate at most $1/2$ (achieved by the trivial 2-fold repetition code). Previously, it was (erroneously) reported that more generally no non-trivial linear codes correcting $k$ deletions exist, i.e., that the $(k+1)$-fold repetition codes and its rate of $1/(k+1)$ are basically optimal for any $k$. We disprove this and show the existence of binary linear codes of length $n$ and rate just below $1/2$ capable of correcting $\Omega(n)$ insertions and deletions. This identifies rate $1/2$ as a sharp threshold for recovery from deletions for linear codes, and reopens the quest for a better understanding of the capabilities of linear codes for correcting insertions/deletions.
We prove novel outer bounds and existential inner bounds for the rate vs. (edit) distance trade-off of linear insdel codes. We complement our existential results with an efficient synchronization-string-based transformation that converts any asymptotically-good linear code for Hamming errors into an asymptotically-good linear code for insdel errors. Lastly, we show that the $\frac{1}{2}$-rate limitation does not hold for affine codes by giving an explicit affine code of rate $1-\epsilon$ which can efficiently correct a constant fraction of insdel errors.

Subjects:	Information Theory (cs.IT); Discrete Mathematics (cs.DM); Data Structures and Algorithms (cs.DS); Combinatorics (math.CO)
Cite as:	arXiv:2007.09075 [cs.IT]
	(or arXiv:2007.09075v4 [cs.IT] for this version)
	https://doi.org/10.48550/arXiv.2007.09075

Submission history

From: Kuan Cheng [view email]
[v1] Fri, 17 Jul 2020 15:56:05 UTC (40 KB)
[v2] Mon, 16 Nov 2020 10:07:17 UTC (41 KB)
[v3] Thu, 3 Jun 2021 12:54:40 UTC (41 KB)
[v4] Thu, 21 Jul 2022 03:42:10 UTC (475 KB)

Computer Science > Information Theory

Title:Efficient Linear and Affine Codes for Correcting Insertions/Deletions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Theory

Title:Efficient Linear and Affine Codes for Correcting Insertions/Deletions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators