您当前所在位置: 首页 > 学者

朱小燕

  • 61浏览

  • 0点赞

  • 0收藏

  • 0分享

  • 100下载

  • 0评论

  • 引用

期刊论文

Various Features with Integrated Strategies for Protein Name Classification

朱小燕Budi Taruna Ongkowijaya Shilin Ding and Xiaoyan Zhu

ISPA Workshops 2005, LNCS 3759, pp. 213-222, 2005.,-0001,():

URL:

摘要/描述

Classification task is an integral part of named entity recognition system to classify a recognized named entity to its corresponding class. This task has not received much attention in the biomedical domain, due to the lack of awareness to differentiate feature sources and strategies in previous studies. In this research, we analyze different sources and strategies of protein name classification, and developed integrated strategies that incorporate advantages from rule-based, dictionary-based and statistical-based method. In rule-based method, terms and knowledge of protein nomenclature that provide strong cue for protein name are used. In dictionary-based method, a set of rules for curating protein name dictionary are used. These terms and dictionaries are combined with our developed features into a statistical-based classifier. Our developed features are comprised of word shape features and unigram & bi-gram features. Our various information sources and integrated strategies are able to achieve state-of-the-art performance to classify protein and non-protein names.

关键词:

【免责声明】以下全部内容由[朱小燕]上传于[2006年07月25日 17时38分44秒],版权归原创者所有。本文仅代表作者本人观点,与本网站无关。本网站对文中陈述、观点判断保持中立,不对所包含内容的准确性、可靠性或完整性提供任何明示或暗示的保证。请读者仅作参考,并请自行承担全部责任。

我要评论

全部评论 0

本学者其他成果

    同领域成果