• Login
    About WHOAS
    View Item 
    •   WHOAS Home
    • Marine Biological Laboratory
    • Center for Library and Informatics (CLI)
    • CLI Data Sets
    • View Item
    •   WHOAS Home
    • Marine Biological Laboratory
    • Center for Library and Informatics (CLI)
    • CLI Data Sets
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of WHOASCommunities & CollectionsBy Issue DateAuthorsTitlesKeywordsThis CollectionBy Issue DateAuthorsTitlesKeywords

    My Account

    LoginRegister

    Statistics

    View Usage Statistics

    NetiNeti : Discovery of Scientific Names from Text Using Machine Learning Methods Table 2

    Thumbnail
    View/Open
    Results from FAT tool without those found by NetiNeti and Manual Markup (126bytes)
    Results from FAT tool without those found by NetiNeti or Manual Markup (13.48Kb)
    Results from FAT tool without those found by NetiNeti or Manual Markup words identified as geographical or common words (13.82Kb)
    Results of the FAT tool run against American Seashells http://www.biodiversitylibrary.org/item/31699 (37.38Kb)
    Results of NetiNeti without those found by the FAT tool or manual markup (2.603Kb)
    Results of NetiNeti without those found by manual markup or TaxonFinder with reasons (2.869Kb)
    All results produced by running NetiNeti against American Seashells http://www.biodiversitylibrary.org/item/31699 (51.43Kb)
    Results of TaxonFinder used on American Seashells http://www.biodiversitylibrary.org/item/31699 (35.46Kb)
    Results of TaxonFinder without those found by NetiNeti or manual markup and why (3.501Kb)
    Date
    2012-01-27
    Author
    Akella, Lakshmi Manohar  Concept link
    Metadata
    Show full item record
    Citable URI
    https://hdl.handle.net/1912/5002
    Related Material/Data
    https://hdl.handle.net/1912/6236
    DOI
    10.1575/1912/5002
    Keyword
    Precision and recall values
    Abstract
    A scientific name for an organism can be associated with almost all biological data. Name identification is an important step in many text mining tasks aiming to extract useful information from biological, biomedical and biodiversity text sources. A scientific name acts as an important metadata element to link biological information.We present NetiNeti, a machine learning based approach for identification and discovery of scientific names. The system implementing the approach can be accessed at http://namefinding.ubio.org we present the comparison results of various machine learning algorithms on our annotated corpus. Naïve Bayes and Maximum Entropy with Generalized Iterative Scaling (GIS) parameter estimation are the top two performing algorithms.
    Description
    A comparison of NetiNeti, TaxonFinder and FAT tool for the American Seashell Book book (http://www.biodiversitylibrary.org/item/31699) is presented in Table 2. The FAT approach has lower precision and recall values compared to NetiNeti and TaxonFinder approaches for this corpus. The names marked up by the FAT tool were compared with the manual mark up. 869 of the names identified by FAT did not match with the manually marked up set of names. Most of these unmatched names are species epithets with authorship information. We - 16 - further analyzed a random sample of 100 names out of these 869 names and examined genus information interpreted by the tool in the marked up tags. 32 of the 100 mismatched names have correctly interpreted genus names and the remaining are all true false positives with incorrect genus tags. We estimated that 278 of these 869 are correct identifications and the adjusted precision and recall values for the FAT approach were summarized in Table 2. For many of the true false positives, the FAT tool tags the species epithet, but does not seem to recognize the genus name immediately preceding the species name.
    Collections
    • CLI Data Sets
    Suggested Citation
    Dataset: Akella, Lakshmi Manohar, "NetiNeti : Discovery of Scientific Names from Text Using Machine Learning Methods Table 2", 2012-01-27, DOI:10.1575/1912/5002, https://hdl.handle.net/1912/5002
     
    All Items in WHOAS are protected by original copyright, with all rights reserved, unless otherwise indicated. WHOAS also supports the use of the Creative Commons licenses for original content.
    A service of the MBLWHOI Library | About WHOAS
    Contact Us | Send Feedback | Privacy Policy
    Core Trust Logo