IJMLC 2013 Vol.3(6): 494-498 ISSN: 2010-3700
DOI: 10.7763/IJMLC.2013.V3.367
Disease Named Entity Recognition by Machine Learning Using Semantic Type of Metathesaurus
Zhong Huang and Xiaohua Hu
Abstract—Named Entity Recognition (NER) has been an active research fields in biomedical text mining. In the past years, much attention has been focused on semantic types related to protein, gene, and other named entities in biology domain. Human disease named entity recognition in literatures, however, has not received much attention. Comparing the NER solutions targeting protein/gene named entities, existing machine learning solutions lacks same level of precision and recall for disease named entity recognition. The development of machine learning based NER for disease named entity is largely focused on local features of tokens in the sentence, by integrating its linguistic, orthographic, morphological, local contextual characteristics. In this paper, we utilized the sentence level semantic contextual information as one of discriminative features for disease NE recognition. Our method takes advantage of semantic types related to disease in UMLS metathesaurus by fuzzy dictionary lookup. The results show promises to improve the performance of current disease NER methods.
Index Terms—Biomedical concept, disease, named entity, named entity recognition, NE, NER, semantic type, machine learning, conditional random fields, CRF.
The authors are with School of Information Science and Technology, Drexel University, Philadelphia, USA (e-mail: zhong.huang@drexel.edu).
[PDF]
Cite:Zhong Huang and Xiaohua Hu, "Disease Named Entity Recognition by Machine Learning Using Semantic Type of Metathesaurus," International Journal of Machine Learning and Computing vol.3, no. 6, pp. 494-498, 2013.