首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Feature selection for high-dimensional data in astronomy
Authors:Hongwen Zheng  Yanxia Zhang  
Institution:aInstitute of Mathematics and Physics, North China Electric Power University, Deshengmenwai, Zhuxinzhuang, Beijing 102206, China;bNational Astronomical Observatories, CAS, 20A Datun Road, Chaoyang District, Beijing 100012, China
Abstract:With an exponentially increasing amount of astronomical data, the complexity and dimension of astronomical data are likewise growing rapidly. Extracting information from such data becomes a critical and challenging problem. For example, some algorithms can only be employed in the low-dimensional spaces, so feature selection and feature extraction become important topics. Here we describe the difference between feature selection and feature extraction methods, and introduce the taxonomy of feature selection methods as well as the characteristics of each method. We present a case study comparing the performance and computational cost of different feature selection methods. For the filter method, ReliefF and fisher filter are adopted; for the wrapper method, improved CHAID, linear discriminant analysis (LDA), Naive Bayes (NB) and C4.5 are taken as learners. Applied on the sample, the result indicates that from the viewpoints of computational cost the filter method is superior to the wrapper method. Moreover, different learning algorithms combined with appropriate feature selection methods may arrive at better performance.
Keywords:Method: data analysis  Feature selection  Astronomical catalogs  Sky survey
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号