首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于中心-对数半长的区间数据主成分分析
引用本文:赵青,王惠文,王珊珊.基于中心-对数半长的区间数据主成分分析[J].北京航空航天大学学报,2021,47(7):1414-1421.
作者姓名:赵青  王惠文  王珊珊
作者单位:1.北京航空航天大学 经济管理学院, 北京 100083
基金项目:国家自然科学基金71420107025国家自然科学基金11701023
摘    要:为研究多变量区间数据的降维和可视化,采用包含中心点和半长对数值的二维数组表征区间数据,建立了区间数据的代数运算法则,并在此基础上提出了一种新的区间数据主成分分析(PCA)方法。对区间半长取对数的处理保证了最终得到的区间主成分半长非负的合理性,计算过程简单、复杂度较低,并且使得降维前后样本集合中点点之间相对位置的改变尽可能小。通过对高维空间进行变量降维,从而多种经典的统计分析方法能够得到运用,同时能够在低维空间中描绘原始高维空间中的样本点,使得多变量区间数据的可视化成为可能。仿真实验结果表明了所提方法的有效性。 

关 键 词:区间数据    主成分分析(PCA)    中心-对数半长    降维    协方差矩阵
收稿时间:2020-05-29

Aprincipal component analysis of interval data based on center and log-radius
ZHAO Qing,WANG Huiwen,WANG Shanshan.Aprincipal component analysis of interval data based on center and log-radius[J].Journal of Beijing University of Aeronautics and Astronautics,2021,47(7):1414-1421.
Authors:ZHAO Qing  WANG Huiwen  WANG Shanshan
Institution:1.School of Economics and Management, Beihang University, Beijing 100083, China2.Beijing Key Laboratory of Emergency Support Simulation Technologies for City Operations, Beijing 100083, China3.Beijing Advanced Innovation Center for Big Data and Brain Computing, Beihang University, Beijing 100083, China
Abstract:In order to study the dimension reduction and visualization of multivariate interval data, a two-dimensional array including center and log-radius is used as the expression of interval data. Then the algebraic algorithm of interval data is given, and a new Principal Component Analysis (PCA) method of interval data is proposed on this basis. The processing of the logarithm of interval radius ensures the rationality that the range of the final interval principal components are non-negative. The calculation of this new method is simple, and the complexity is low. Furthermore, the change of the relative position between the points in the sample group before and after the dimension reduction is as small as possible. By reducing the dimension of variables in the high-dimensional space, various classical statistical analysis methods can be used. Besides, the sample points in the original high-dimensional space can be depicted in the low-dimensional space, which makes it possible to visualize multivariate interval data. The results of simulation experiment verify the effectiveness of the proposed method. 
Keywords:
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《北京航空航天大学学报》浏览原始摘要信息
点击此处可从《北京航空航天大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号