A hybrid algorithm for the longest common transposition-invariant subsequence problem

Authors

  • Sebastian Deorowicz
  • Szymon Grabowski

Abstract

The longest common transposition-invariant subsequence (LCTS) problem is a music information retrieval oriented variation of the classic LCS problem. There are basically only two known efficient approaches to calculate the length of the LCTS, one based on sparse dynamic programming and the other on bit-parallelism. In this work, we propose a hybrid algorithm picking the better of the two algorithms for individual subproblems. Experiments on music (MIDI), with 32-bit and 64-bit implementations, show that the proposed algorithm outperforms the faster of the two component algorithms by a factor of 1.4–2.0, depending on sequence lengths. Similar, if not better, improvements can be observed for random data with Gaussian distribution. Also for uniformly random data, the hybrid algorithm is the winner if the alphabet is neither too small (at least 32 symbols) nor too large (up to 128 symbols). Part of the success of our scheme is attributed to a quite robust component selection heuristic.

Downloads

Download data is not yet available.

Downloads

Published

2012-01-26

How to Cite

Deorowicz, S., & Grabowski, S. (2012). A hybrid algorithm for the longest common transposition-invariant subsequence problem. COMPUTING AND INFORMATICS, 28(5), 729–744. Retrieved from http://www.cai.sk/ojs/index.php/cai/article/view/59