English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONS
  This item is discarded!Release HistoryDetailsSummary

Discarded

Journal Article

EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation

MPS-Authors
/persons/resource/persons210306

Buttigieg,  P.
HGF MPG Joint Research Group for Deep Sea Ecology & Technology, Max Planck Institute for Marine Microbiology, Max Planck Society;

/persons/resource/persons210665

Pereira,  E.
Microbial Genomics Group, Department of Molecular Ecology, Max Planck Institute for Marine Microbiology, Max Planck Society;

/persons/resource/persons210754

Schnetzer,  J.
Microbial Genomics Group, Department of Molecular Ecology, Max Planck Institute for Marine Microbiology, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

(No access)

Supplementary Material (public)
There is no public supplementary material available
Citation

Pafilis, E., Buttigieg, P., Ferrell, B., Pereira, E., Schnetzer, J., Arvanitidis, C., et al. (2016). EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation. Database-the Journal of Biological Databases and Curation, baw005, pp. 1-7.


Abstract
The microbial and molecular ecology research communities have made substantial progress on developing standards for annotating samples with environment metadata. However, sample manual annotation is a highly labor intensive process and requires familiarity with the terminologies used. We have therefore developed an interactive annotation tool, EXTRACT, which helps curators identify and extract standard-compliant terms for annotation of metagenomic records and other samples. Behind its web-based user interface, the system combines published methods for named entity recognition of environment, organism, tissue and disease terms. The evaluators in the BioCreative V Interactive Annotation Task found the system to be intuitive, useful, well documented and sufficiently accurate to be helpful in spotting relevant text passages and extracting organism and environment terms. Comparison of fully manual and text-mining-assisted curation revealed that EXTRACT speeds up annotation by 15-25% and helps curators to detect terms that would otherwise have been missed.