首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于潜在语义索引的谱聚类方法研究
引用本文:冯霞,闫冠男,李娟娟.一种基于潜在语义索引的谱聚类方法研究[J].中国民航学院学报,2011(3):47-51.
作者姓名:冯霞  闫冠男  李娟娟
作者单位:[1]中国民航大学计算机科学与技术学院,天津300300 [2]中国民航信息技术科研基地,天津300300
基金项目:国家自然科学基金项目(60776806,60672174);中国民航大学博士启动基金(06qd08s)
摘    要:传统的文本聚类算法存在文本向量维度过高,算法易陷入局部最优问题。针对上述问题,提出了一种适用于文本的基于潜在语义索引的谱聚类方法,该方法应用了潜在语义索引和谱聚类方法的优点。不仅分析了词与词之间的语义关系,而且适用于任意形状分布的样拳数据聚类。针对航空安全报告的聚类实验表明,该方法取得了较好的聚类效果。

关 键 词:文本聚类  潜在语义索引  奇异值分解  谱聚类

Research on Spectral Clustering Based on Latent Semantic Indexing
FENG Xia,YAN Guan-nan,LI Juan-juan.Research on Spectral Clustering Based on Latent Semantic Indexing[J].Journal of Civil Aviation University of China,2011(3):47-51.
Authors:FENG Xia  YAN Guan-nan  LI Juan-juan
Institution:1. College of Computer Science and Technology, CA UC, Tianjin 300300, China; 2. Information Technology Research Base, Civil A viation Administration of China, Tianjin 300300, China)
Abstract:There is a problem that the text vector dimension is too high and the algorithm is easy to fall into local optimum problem in traditional text clustering. About this problem, this paper presents a spectral clustering method based on Latent Semantic Index (LSI), which uses the advantages of both. Not only analyzed the words and semantic relations between words, but also applies to any shape of the distribution of sample data clustering. The clustering experiment of aviation safety report shows that this method has a good clustering result.
Keywords:text clustering  LSI  SVD  spectral clustering
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号