English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Recommending plant taxa for supporting on-site species identification

Wittich, H. C., Seeland, M., Wäldchen, J., Rzanny, M., & Mäder, P. (2018). Recommending plant taxa for supporting on-site species identification. BMC Bioinformatics, 19: 190. doi:10.1186/s12859-018-2201-7.

Item is

Files

show Files
hide Files
:
BGC2856.pdf (Publisher version), 2MB
Name:
BGC2856.pdf
Description:
-
OA-Status:
Gold
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
-

Locators

show
hide
Locator:
http://dx.doi.org/10.1186/s12859-018-2201-7 (Publisher version)
Description:
OA
OA-Status:
Gold

Creators

show
hide
 Creators:
Wittich, Hans Christian, Author
Seeland, Marco, Author
Wäldchen, Jana1, Author           
Rzanny, Michael1, Author           
Mäder, Patrick, Author
Affiliations:
1Flora Incognita, Dr. Jana Wäldchen, Department Biogeochemical Integration, Prof. Dr. M. Reichstein, Max Planck Institute for Biogeochemistry, Max Planck Society, ou_3240484              

Content

show
hide
Free keywords: -
 Abstract: Background
Predicting a list of plant taxa most likely to be observed at a given geographical location and time is useful for many scenarios in biodiversity informatics. Since efficient plant species identification is impeded mainly by the large number of possible candidate species, providing a shortlist of likely candidates can help significantly expedite the task. Whereas species distribution models heavily rely on geo-referenced occurrence data, such information still remains largely unused for plant taxa identification tools.
Results

In this paper, we conduct a study on the feasibility of computing a ranked shortlist of plant taxa likely to be encountered by an observer in the field. We use the territory of Germany as case study with a total of 7.62M records of freely available plant presence-absence data and occurrence records for 2.7k plant taxa. We systematically study achievable recommendation quality based on two types of source data: binary presence-absence data and individual occurrence records. Furthermore, we study strategies for aggregating records into a taxa recommendation based on location and date of an observation.
Conclusion

We evaluate recommendations using 28k geo-referenced and taxa-labeled plant images hosted on the Flickr website as an independent test dataset. Relying on location information from presence-absence data alone results in an average recall of 82%. However, we find that occurrence records are complementary to presence-absence data and using both in combination yields considerably higher recall of 96% along with improved ranking metrics. Ultimately, by reducing the list of candidate taxa by an average of 62%, a spatio-temporal prior can substantially expedite the overall identification problem.

Details

show
hide
Language(s):
 Dates: 2018-05-142018-05-30
 Publication Status: Published online
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: Other: BGC2856
DOI: 10.1186/s12859-018-2201-7
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: BMC Bioinformatics
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: BioMed Central
Pages: - Volume / Issue: 19 Sequence Number: 190 Start / End Page: - Identifier: ISSN: 1471-2105
CoNE: https://pure.mpg.de/cone/journals/resource/111000136905000