Options
Towards Gene Recognition from Rare and Ambiguous Abbreviations using a Filtering Approach
Hartung, Matthias; Klinger, Roman; Zwick, Matthias; u. a. (2014): Towards Gene Recognition from Rare and Ambiguous Abbreviations using a Filtering Approach, in: Kevin Cohen, Dina Demner-Fushman, Sophia Ananiadou, u. a. (Hrsg.), Proceedings of BioNLP 2014, Baltimore, Maryland: Association for Computational Linguistics, S. 118–127, doi: 10.3115/v1/W14-3418.
Faculty/Chair:
Author:
Title of the compilation:
Proceedings of BioNLP 2014
Editors:
Cohen, Kevin
Demner-Fushman, Dina
Ananiadou, Sophia
Tsujii, Jun-ichi
Conference:
BioNLP 2014 ; Baltimore, Maryland
Publisher Information:
Year of publication:
2014
Pages:
Language:
English
DOI:
Abstract:
Retrieving information about highly ambiguous gene/protein homonyms is a challenge, in particular where their non-protein meanings are more frequent than their protein meaning (e. g., SAH or HF). Due to their limited coverage in common benchmarking data sets, the performance of existing gene/protein recognition tools on these problematic cases is hard to assess. We uniformly sample a corpus of eight ambiguous gene/protein abbreviations from MEDLINEr and provide manual annotations for each mention of these abbreviations.1 Based on this resource, we show that available gene recognition tools such as conditional random fields (CRF) trained on BioCreative 2 NER data or GNAT tend to underperform on this phenomenon. We propose to extend existing gene recognition approaches by combining a CRF and a support vector machine. In a cross- entity evaluation and without taking any entity-specific information into account, our model achieves a gain of 6 points F1-Measure over our best baseline which checks for the occurrence of a long form of the abbreviation and more than 9 points over all existing tools investigated.
GND Keywords: ; ;
Maschinelles Lernen
Genetik
Computerlinguistik
Keywords:
Gene Recognition
DDC Classification:
RVK Classification:
Peer Reviewed:
Yes:
International Distribution:
Yes:
Open Access Journal:
Yes:
Type:
Conferenceobject
Activation date:
March 13, 2024
Versioning
Question on publication
Permalink
https://fis.uni-bamberg.de/handle/uniba/93994