A survey of string orderings and their application to the Burrows-Wheeler transform

Jacqueline W. Daykin, Richard Groult, Yannick Guesnet, Thierry Lecroq, Arnaud Lefebvre, Martine Léonard, Élise Prieur-Gaston

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)

Abstract

For over 20 years the data clustering properties and applications of the efficient Burrows–Wheeler transform have been researched. Lexicographic suffix-sorting is induced during the transformation, and more recently a new direction has considered alternative ordering strategies for suffix arrays and thus the transforms. In this survey we look at these distinctly ordered bijective and linear transforms. For arbitrary alphabets we discuss the V-BWT derived from V-order and the D-BWT based on lex-extension order. The binary case yields a pair of transforms, the binary Rouen B-BWT, defined using binary block order. Lyndon words are relevant to implementing the original transform; the new transforms are defined for analogous structures: V-words, indeterminate Lyndon words, and B-words, respectively. There is plenty of scope for further non-lexicographic transforms as indicated in the conclusion.
Original languageEnglish
Pages (from-to)52-65
Number of pages14
JournalTheoretical Computer Science
Volume710
Early online date01 Mar 2017
DOIs
Publication statusPublished - 01 Feb 2018

Keywords

  • algorithm
  • bijective alphabet
  • block order
  • Burrown-Wheeler transform
  • B-word
  • data clustering
  • degenerate
  • GB-word
  • generic alphabet
  • generic block order
  • indeterminate Lyndon word
  • inverse transform
  • lexicographic order
  • linear
  • Lyndon word
  • string
  • suffix array
  • suffix-sorting
  • T-order
  • V-order
  • word

Fingerprint

Dive into the research topics of 'A survey of string orderings and their application to the Burrows-Wheeler transform'. Together they form a unique fingerprint.

Cite this