首页 | 本学科首页   官方微博 | 高级检索  
     

基于粗糙集特征选择的过拟合现象及应对策略
引用本文:张文冬,亓慧,刘克宇,杨习贝. 基于粗糙集特征选择的过拟合现象及应对策略[J]. 南京航空航天大学学报, 2019, 51(5): 687-692
作者姓名:张文冬  亓慧  刘克宇  杨习贝
作者单位:1.江苏科技大学计算机学院, 镇江, 212003;2.太原师范学院计算机系, 太原, 030619
基金项目:国家自然科学基金 61572242 61502211;61503160)项目国家自然科学基金(61572242, 61502211, 61503160)项目资助。
摘    要:
在粗糙集方法中,利用向前启发式算法进行特征选择,是一个逐步加入重要度最高的特征的过程,直至满足所给定的约束条件。但使用这一策略选择出来的特征子集有可能产生过拟合现象。鉴于此,设计了一种新的启发式算法,其主要思想是借助交叉验证的方法对特征的重要度进行计算,当过拟合出现时,则采用截断式机制终止算法。使用邻域粗糙集模型,在UCI数据集上将启发式算法与所提算法进行对比分析,实验结果表明:所提算法能够有效地降低过拟合的程度;利用所提算法得到的特征子集能够带来更好的分类性能。

关 键 词:特征选择  启发式算法  邻域粗糙集  过拟合
收稿时间:2018-05-10
修稿时间:2018-06-30

Over-Fitting and Its Countermeasure in Feature Selection Based on Rough Set
ZHANG Wendong,QI Hui,LIU Keyu,YANG Xibei. Over-Fitting and Its Countermeasure in Feature Selection Based on Rough Set[J]. Journal of Nanjing University of Aeronautics & Astronautics, 2019, 51(5): 687-692
Authors:ZHANG Wendong  QI Hui  LIU Keyu  YANG Xibei
Affiliation:1.School of Computer, Jiangsu University of Science and Technology, Zhenjiang, 212003, China;2.Computer Science and Technology Department, Taiyuan Normal University, Taiyuan, 030619, China
Abstract:
In rough set theory, forward heuristic algorithm selects the most important feature in the process of feature selection until the given constraint is satisfied. However, the feature subset selected by such strategy may bring us over-fitting. To solve this problem, a new heuristic algorithm is designed. The importance of the feature is obtained by cross validation and then the early stopping is employed to terminate the algorithm when over-fitting occurs. Based on the neighborhood rough set, the new method is compared with the heuristic algorithm over several UCI data sets. The experimental results show that: the proposed algorithm can effectively reduce the degree of over-fitting, and the feature subset obtained by the new algorithm may offer better classification performances.
Keywords:feature selection  heuristic algorithm  neighborhood rough set  over-fitting
本文献已被 CNKI 等数据库收录!
点击此处可从《南京航空航天大学学报》浏览原始摘要信息
点击此处可从《南京航空航天大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号