ENVIRONMENTS and EOL : identification of Environment Ontology terms in text and the annotation of the Encyclopedia of Life
Frankild, Sune P.
Leary, Patrick R.
Parr, Cynthia Sims
Jensen, Lars Juhl
MetadataShow full item record
The association of organisms to their environments is a key issue in exploring biodiversity patterns. This knowledge has traditionally been scattered, but textual descriptions of taxa and their habitats are now being consolidated in centralized resources. However, structured annotations are needed to facilitate large-scale analyses. Therefore, we developed ENVIRONMENTS, a fast dictionary-based tagger capable of identifying Environment Ontology (ENVO) terms in text. We evaluate the accuracy of the tagger on a new manually curated corpus of 600 Encyclopedia of Life (EOL) species pages. We use the tagger to associate taxa with environments by tagging EOL text content monthly, and integrate the results into the EOL to disseminate them to a broad audience of users.
© The Author(s), 2015. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Bioinformatics 31 (2015): 1872-1874, doi:10.1093/bioinformatics/btv045.
The following license files are associated with this item:
Showing items related by title, author, creator and subject.
Mopper, Kenneth (Massachusetts Institute of Technology and Woods Hole Oceanographic Institution, 1973-06)The goal of this thesis is to examine the distribution and diagenesis of carbohydrates in aquatic environments. The following questions are studied: what is the carbohydrate composition of sediment in different environments ...