LCP-Aware Parallel String Sorting

Ellert, Jonas; Fischer, Johannes; Sitchinava, Nodari

Computer Science > Data Structures and Algorithms

arXiv:2006.02219 (cs)

[Submitted on 3 Jun 2020]

Title:LCP-Aware Parallel String Sorting

Authors:Jonas Ellert, Johannes Fischer, Nodari Sitchinava

View PDF

Abstract:When lexicographically sorting strings, it is not always necessary to inspect all symbols. For example, the lexicographical rank of "europar" amongst the strings "eureka", "eurasia", and "excells" only depends on its so called relevant prefix "euro". The distinguishing prefix size $D$ of a set of strings is the number of symbols that actually need to be inspected to establish the lexicographical ordering of all strings. Efficient string sorters should be $D$-aware, i.e. their complexity should depend on $D$ rather than on the total number $N$ of all symbols in all strings. While there are many $D$-aware sorters in the sequential setting, there appear to be no such results in the PRAM model. We propose a framework yielding a $D$-aware modification of any existing PRAM string sorter. The derived algorithms are work-optimal with respect to their original counterpart: If the original algorithm requires $O(w(N))$ work, the derived one requires $O(w(D))$ work. The execution time increases only by a small factor that is logarithmic in the length of the longest relevant prefix. Our framework universally works for deterministic and randomized algorithms in all variations of the PRAM model, such that future improvements in ($D$-unaware) parallel string sorting will directly result in improvements in $D$-aware parallel string sorting.

Comments:	Accepted at Euro-Par 2020 and to be published by Springer as part of the conference proceedings
Subjects:	Data Structures and Algorithms (cs.DS); Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as:	arXiv:2006.02219 [cs.DS]
	(or arXiv:2006.02219v1 [cs.DS] for this version)
	https://doi.org/10.48550/arXiv.2006.02219

Submission history

From: Jonas Ellert [view email]
[v1] Wed, 3 Jun 2020 12:30:53 UTC (37 KB)

Computer Science > Data Structures and Algorithms

Title:LCP-Aware Parallel String Sorting

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Data Structures and Algorithms

Title:LCP-Aware Parallel String Sorting

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators