首页 | 本学科首页   官方微博 | 高级检索  
     检索      

统计与规则相结合的术语抽取
引用本文:唐涛,周俏丽,张桂平.统计与规则相结合的术语抽取[J].沈阳航空工业学院学报,2011,28(5):71-74.
作者姓名:唐涛  周俏丽  张桂平
作者单位:沈阳航空航天大学知识工程研究中心,辽宁沈阳,110136
摘    要:在面向特定领域的分词中,术语抽取效果的好坏会对分词精度产生很大影响。因此,高精度的术语抽取成为领域分词的基础工作。针对特定领域提出了一种统计和规则相结合的术语抽取方法。在条件随机场给出的5-best结果的基础上,通过规则及给分机制进行术语抽取,并对抽取结果利用规则进行后处理。实验表明,相比于传统的基于条件随机场1-best进行的术语抽取,该方法能够明显提高未登录术语的召回率。

关 键 词:术语抽取  条件随机场  未登录术语  5-best

Term extraction based on the combination of statistics and rules
TANG Tao,ZHOU Qiao-li,ZHANG Gui-ping.Term extraction based on the combination of statistics and rules[J].Journal of Shenyang Institute of Aeronautical Engineering,2011,28(5):71-74.
Authors:TANG Tao  ZHOU Qiao-li  ZHANG Gui-ping
Institution:TANG Tao,ZHOU Qiao-li,ZHANG Gui-ping(Knowledge Engineering Research Center,Shenyang Aerospace University,Liaoning Shenyang 110136)
Abstract:The extraction of terms has a significant impact on the precision of domain-specific word segmentation.Based on the combination of statistics and rules,this paper proposes a method of term extraction for a certain specific.The 5-best results are achieved with Conditional Random Fields first,then the term extraction is performed with rules and scoring mechanism,finally the extracted data are post-processed with rules.Compared to the term extraction of 1-best output based on Conditional Random Fields,this met...
Keywords:term extraction  Conditional Random Fields  out-of-vocabulary term  5-best  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号