Help Privacy Policy Disclaimer
  Advanced SearchBrowse




Journal Article

Data Mining for Biologists


Tsuda,  K
Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society;
Max Planck Institute for Biological Cybernetics, Max Planck Society;

Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)
There are no public fulltexts stored in PuRe
Supplementary Material (public)
There is no public supplementary material available

Tsuda, K. (2012). Data Mining for Biologists. International Journal of Knowledge Discovery in Bioinformatics, 3(4), 1-14. doi:10.4018/ijkdb.2012100101.

Cite as: https://hdl.handle.net/21.11116/0000-0001-8492-4
In this tutorial article, the author reviews basics about frequent pattern mining algorithms, including itemset mining, association rule mining, and graph mining. These algorithms can find frequently appearing substructures in discrete data. They can discover structural motifs, for example, from mutation data, protein structures, and chemical compounds. As they have been primarily used for business data, biological applications are not so common yet, but their potential impact would be large. Recent advances in computers including multicore machines and ever increasing memory capacity support the application of such methods to larger datasets. The author explains technical aspects of the algorithms, but do not go into details. Current biological applications are summarized and possible future directions are given.