English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Book Chapter

Efficient string mining under constraints via the deferred frequency index

MPS-Authors

Schulz,  Marcel H.
Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available
Citation

Weese, D., & Schulz, M. H. (2008). Efficient string mining under constraints via the deferred frequency index. In P. Perner (Ed.), Advances in Data Mining. Medical Applications, E-Commerce, Marketing, and Theoretical Aspects. Berlin/Heidelberg: Springer.


Cite as: https://hdl.handle.net/11858/00-001M-0000-0010-7F86-8
Abstract
We propose a general approach for frequency based string mining, which has many applications, e.g. in contrast data mining. Our contribution is a novel algorithm based on a deferred data structure. Despite its simplicity, our approach is up to 4 times faster and uses about half the memory compared to the best-known algorithm of Fischer et al. Applications in various string domains, e.g. natural language, DNA or protein sequences, demonstrate the improvement of our algorithm.