期刊文献+

基于类内和类间距离的主成分分析算法 被引量:15

Method of principal component analysis based on intra-class distance and inter-class distance
在线阅读 下载PDF
导出
摘要 为改善高维数据的降维结果,提高数据低维表示的判别能力,通过对类内和类间距离的研究,提出基于类内和类间距离的主成分分析(IOPCA)数据降维算法。计算属性信息熵,对比信息熵阈值,进行数据矩阵特征筛选;采用综合类间距离最大化和类内距离最小化思想,改进PCA算法进行数据降维;将降维后的数据通过KNN、SVM算法分类。对比PCA、E-PCA、LDA算法,仿真结果表明,该算法在改善降维结果的同时,有效提高了降维后低维数据的判别性能。 To improve the dimension reduction result of high-dimensional data and the discrimination ability of low-dimensional representation of data,by studying the intra-class and inter-class distance,a principal component analysis data dimension reduction algorithm based on intra-class and inter-class distance was proposed.The entropy of attribute information was calculated,which was compared with the threshold value of information entropy,and the feature screening of data matrix was realized.The improved algorithm was used to reduce the dimension of data.The data after dimension reduction was classified using KNN and SVM algorithms.Compared with the PCA,E-PCA and LDA algorithms,the simulation results show that the proposed algorithm can not only improve the result of dimension reduction,but also effectively improve the discrimination performance of low-dimensional data after dimension reduction.
作者 张素智 陈小妮 杨芮 李鹏辉 蔡强 ZHANG Su-zhi;CHEN Xiao-ni;YANG Rui;LI Peng-hui;CAI Qiang(School of Computer and Communication Engineering,Zhengzhou University of Light Industry,Zhengzhou 450002,China;Beijing Key Laboratory of Big Data Technology for Food Safety,Beijng Technology and Business University,Beijing 100048,China)
出处 《计算机工程与设计》 北大核心 2020年第8期2177-2183,共7页 Computer Engineering and Design
基金 北京市重点实验室开放课题基金项目(BKBD-2017KF08) 国家自然科学基金项目(61802353)。
关键词 信息熵 类内距离 类间距离 主成分分析 数据降维 information entropy intra-class distance inter-class distance principal component analysis data dimension reduction
作者简介 张素智(1965-),男,河南焦作人,博士,教授,CCF会员,研究方向为Web数据库、分布式计算和异构系统集成;陈小妮(1993-),女,河北邯郸人,硕士研究生,CCF学生会员,研究方向为大数据挖掘与分析;杨芮(1994-),女,河南驻马店人,硕士研究生,CCF学生会员,研究方向为大数据分析与挖掘;李鹏辉(1993-),男,河南三门峡人,硕士研究生,研究方向为大数据挖掘;蔡强(1969-),男,重庆人,博士,教授,研究方向为计算机图形学、科学可视化、智能信息处理、食品安全信息技术。E-mail:zhsuzhi@zzuli.edu.cn。
  • 相关文献

参考文献7

二级参考文献60

  • 1陈伏兵,高秀梅,张生亮,杨静宇.基于分块PCA的人脸识别方法[J].小型微型计算机系统,2006,27(10):1943-1947. 被引量:10
  • 2陈伏兵,杨静宇.分块PCA及其在人脸识别中的应用[J].计算机工程与设计,2007,28(8):1889-1892. 被引量:26
  • 3Donoho D L. High-dimensional data analysis: the curses and blessings of dimensionality[C] // Proc. of the AMS Conference on Math Challenges of the 21st Century ,2000.
  • 4MaaterL L J P, Postma E O, Herik H J. Dimensionality reduc- tion:a comparative review[J]. IEEE Trans. on Pattern Analy- sis and Machine Intelligence, 2007,10(2) : 1 - 35.
  • 5Magnus B, Erik B. Real-time implementation of a combined PCA- ICA algorithm for blind source separation, examensarbete MEE04 - 74[R]. Blekinge: Blekinge Tekniska Hogskola (BTH), 2005.
  • 6Jolliffe I T. Principal component analysis[M]. 2nd ed. New York: Springer, 2002.
  • 7Hyvarinen A, Karhunen J, Oja E. Independent component analy- sis[M]. New York: Wiley, 2001.
  • 8Kato M, Chen Y W, Xu G. Articulated hand motion tracking using ICA-based motion analysis and particle filtering[J]. Jour- nal of Multimedia, 2006,1 (3) :52 - 60.
  • 9Jegelka S, Gretton A. Brisk kernel ICA[M]//Bottou L, Cha- pelle O, DeCoste D, et al. Large Scale Kernel Machines. MA: MIT Press, 2007:225 - 250.
  • 10Amari S, Cichocki A, Yang H H. A new learning algorithm for blind signal separation[M]//Touretzky D S, Mozer M C, Has- selmo M E. Advances in Neural Information Processing Sys- tems. MA : MIT Press, 1996 : 757 - 763.

共引文献147

同被引文献217

引证文献15

二级引证文献69

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部