NetiNeti : Discovery of Scientific Names from Text Using Machine Learning Methods Figure 1

dc.contributor.author Akella, Lakshmi Manohar
dc.date.accessioned 2011-12-30T18:31:38Z
dc.date.available 2011-12-30T18:31:38Z
dc.date.issued 2011-12-30
dc.description Figure 1 demonstrates a series of training experiments with the Naïve Bayes classifier using different neighborhoods for contextual features, different sizes of positive and negative training examples and evaluated the resulting classifiers with our annotated gold standard corpus. The data sets are the results of running NetiNeti on subset of 136 PubMedCentral tagged open access articles and with no stop list. en_US
dc.description.abstract A scientific name for an organism can be associated with almost all biological data. Name identification is an important step in many text mining tasks aiming to extract useful information from biological, biomedical and biodiversity text sources. A scientific name acts as an important metadata element to link biological information.We present NetiNeti, a machine learning based approach for identification and discovery of scientific names. The system implementing the approach can be accessed at http://namefinding.ubio.org we present the comparison results of various machine learning algorithms on our annotated corpus. Naïve Bayes and Maximum Entropy with Generalized Iterative Scaling (GIS) parameter estimation are the top two performing algorithms. en_US
dc.format.mimetype text/plain
dc.identifier.doi 10.1575/1912/4965
dc.identifier.uri https://hdl.handle.net/1912/4965
dc.language.iso en_US en_US
dc.relation.ispartof https://hdl.handle.net/1912/6236
dc.subject Naïve Bayes classifier, training experiments en_US
dc.title NetiNeti : Discovery of Scientific Names from Text Using Machine Learning Methods Figure 1 en_US
dc.type Dataset en_US
dspace.entity.type Publication
relation.isAuthorOfPublication 6e47b51b-97ba-4e96-b68c-16b48db54d20
relation.isAuthorOfPublication.latestForDiscovery 6e47b51b-97ba-4e96-b68c-16b48db54d20
Files
Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
NPR-Experiment.txt
Size:
26.03 KB
Format:
Plain Text
Description:
Netineti precision and recall
No Thumbnail Available
Name:
NPR-Experiment-nosl.txt
Size:
26.02 KB
Format:
Plain Text
Description:
Netineti precision and recall no stoplost
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.89 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections