首页 | 本学科首页   官方微博 | 高级检索  
     检索      

Wikipedia中的语义析取
引用本文:余旸,林漳希,夏国平.Wikipedia中的语义析取[J].北京航空航天大学学报,2009,35(10):1283-1286.
作者姓名:余旸  林漳希  夏国平
作者单位:北京航空航天大学经济管理学院,北京,100191;德克萨斯理工大学管理学院,德克萨斯,79410
摘    要:维基百科(Wikipedia)现有搜索模块采用关键词匹配方式导致搜索效率相对低下.为了提高Wikipedia中的知识获取效率,提出基于链接分析的词间距算法(TDL,Term Distance based on Linkage).利用可扩展的计算模型,通过内部链接结构分析发现词簇,并且引入排序和推荐机制.基于Wikipedia 2009年5月快照数据的实验表明,TDL有效增强了Wikipedia知识检索的准确性,经由用户评判检验证实TDL算法能有效提高用户意图识别度达7%.

关 键 词:Wikipedia  链接分析  知识发现
收稿时间:2008-11-30

Extracting thematic communities from Wikipedia
Yu Yang,Lin Zhangxi,Xia Guoping.Extracting thematic communities from Wikipedia[J].Journal of Beijing University of Aeronautics and Astronautics,2009,35(10):1283-1286.
Authors:Yu Yang  Lin Zhangxi  Xia Guoping
Institution:1. School of Economics and Management, Beijing University of Aeronautics and Astronautics, Beijing 100191, China;
2. The Rawls College of Business Administration, Texas Tech University, Texas 79410, U.S.A
Abstract:The current search module in Wikipedia has low search efficiency due to the search method, which is built on simple keywords matching. To improve the efficiency of knowledge retrieval from the Wikipedia spheres with more accurate links among them, the algorithm named term distance based on linkage (TDL) was proposed. TDL defines a new measure of distance between two keywords, which reorients and organizes those keywords into clusters. It is based on link structure analysis underpinned by computational models. The mechanism of ranking and recommending was imported. The experiment, which based on the snapshot of Wikipedia (May 2009), indicates that TDL would significantly increase the accuracy of knowledge retrieval in Wikipedia and this new algorithm can improve the users- satisfaction by 7% compared with the present one.
Keywords:Wikipedia  Wikipedia  link analysis  knowledge discovery in databases
本文献已被 万方数据 等数据库收录!
点击此处可从《北京航空航天大学学报》浏览原始摘要信息
点击此处可从《北京航空航天大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号