Fully-Online Suffix Tree and Directed Acyclic Word Graph Construction for Multiple Texts

Takagi, Takuya; Diptarama; Inenaga, Shunsuke; Breslauer, Dany; Arimura, Hiroki

Computer Science > Data Structures and Algorithms

arXiv:1507.07622v4 (cs)

[Submitted on 28 Jul 2015 (v1), revised 23 Jan 2018 (this version, v4), latest version 12 Jul 2018 (v5)]

Title:Fully-Online Suffix Tree and Directed Acyclic Word Graph Construction for Multiple Texts

Authors:Takuya Takagi, Diptarama, Shunsuke Inenaga, Dany Breslauer, Hiroki Arimura

View PDF

Abstract:We consider the construction of the suffix tree and the directed acyclic word graph (DAWG) indexing data structures for a collection of texts $\mathcal{T}$, where a new symbol may be appended to any text in $\mathcal{T} = \{T_1, \ldots, T_K\}$, at any time. This fully-online scenario, that arises when dynamically indexing multi-sensor data, is a natural generalization of the long solved semi-online problem, where texts $T_1, \ldots, T_{k-1}$ are permanently fixed before the next text $T_k$ is processed. We present fully-online algorithms that constructs the suffix tree and the DAWG for $\mathcal{T}$ in $O(N \log \sigma)$ time and $O(N)$ space, where $N$ is the total lengths of the strings in $\mathcal{T}$ and $\sigma$ is their alphabet size. The standard explicit representation of the suffix tree and the DAWG edges must be relaxed in the fully-online scenario, since too many updates might be required, and instead, we provide access to the frequently updated suffix tree leaf edge labels and the DAWG re-directable edges via auxiliary data structures, in $O(\log \sigma)$ time.

Comments:	23 pages, 6 figures, LaTeX
Subjects:	Data Structures and Algorithms (cs.DS)
Cite as:	arXiv:1507.07622 [cs.DS]
	(or arXiv:1507.07622v4 [cs.DS] for this version)
	https://doi.org/10.48550/arXiv.1507.07622

Submission history

From: Takuya Takagi [view email]
[v1] Tue, 28 Jul 2015 02:15:02 UTC (1,659 KB)
[v2] Wed, 29 Jul 2015 08:57:15 UTC (1,658 KB)
[v3] Mon, 1 Feb 2016 08:18:14 UTC (1,648 KB)
[v4] Tue, 23 Jan 2018 19:39:17 UTC (652 KB)
[v5] Thu, 12 Jul 2018 14:35:38 UTC (945 KB)

Computer Science > Data Structures and Algorithms

Title:Fully-Online Suffix Tree and Directed Acyclic Word Graph Construction for Multiple Texts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Data Structures and Algorithms

Title:Fully-Online Suffix Tree and Directed Acyclic Word Graph Construction for Multiple Texts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators