To overcome the too fine-grained granularity problem of multivariate grey incidence analysis and to explore the comprehensive incidence analysis model, three multivariate grey incidences degree models based on princip...To overcome the too fine-grained granularity problem of multivariate grey incidence analysis and to explore the comprehensive incidence analysis model, three multivariate grey incidences degree models based on principal component analysis (PCA) are proposed. Firstly, the PCA method is introduced to extract the feature sequences of a behavioral matrix. Then, the grey incidence analysis between two behavioral matrices is transformed into the similarity and nearness measure between their feature sequences. Based on the classic grey incidence analysis theory, absolute and relative incidence degree models for feature sequences are constructed, and a comprehensive grey incidence model is proposed. Furthermore, the properties of models are researched. It proves that the proposed models satisfy the properties of translation invariance, multiple transformation invariance, and axioms of the grey incidence analysis, respectively. Finally, a case is studied. The results illustrate that the model is effective than other multivariate grey incidence analysis models.展开更多
Oil–water two-phase flow patterns in a horizontal pipe are analyzed with a 16-electrode electrical resistance tomography(ERT) system. The measurement data of the ERT are treated as a multivariate time-series, thus th...Oil–water two-phase flow patterns in a horizontal pipe are analyzed with a 16-electrode electrical resistance tomography(ERT) system. The measurement data of the ERT are treated as a multivariate time-series, thus the information extracted from each electrode represents the local phase distribution and fraction change at that location. The multivariate maximum Lyapunov exponent(MMLE) is extracted from the 16-dimension time-series to demonstrate the change of flow pattern versus the superficial velocity ratio of oil to water. The correlation dimension of the multivariate time-series is further introduced to jointly characterize and finally separate the flow patterns with MMLE. The change of flow patterns with superficial oil velocity at different water superficial velocities is studied with MMLE and correlation dimension, respectively, and the flow pattern transition can also be characterized with these two features. The proposed MMLE and correlation dimension map could effectively separate the flow patterns, thus is an effective tool for flow pattern identification and transition analysis.展开更多
In this article, authors introduce a method to assess local influence of obser- vations on the parameter estimates and prediction in multivariate regression model. The diagnostics under the perturbations of error vari...In this article, authors introduce a method to assess local influence of obser- vations on the parameter estimates and prediction in multivariate regression model. The diagnostics under the perturbations of error variance, response variables and explanatory variables are derived, and the results are compared with those of case- deletion. Two examples are analyzed for illustration.展开更多
In order to solve the problem that existing multivariate grey incidence models cannot be applied to time series on different scales, a new model is proposed based on spatial pyramid pooling.Firstly, local features of ...In order to solve the problem that existing multivariate grey incidence models cannot be applied to time series on different scales, a new model is proposed based on spatial pyramid pooling.Firstly, local features of multivariate time series on different scales are pooled and aggregated by spatial pyramid pooling to construct n levels feature pooling matrices on the same scale. Secondly,Deng's multivariate grey incidence model is introduced to measure the degree of incidence between feature pooling matrices at each level. Thirdly, grey incidence degrees at each level are integrated into a global incidence degree. Finally, the performance of the proposed model is verified on two data sets compared with a variety of algorithms. The results illustrate that the proposed model is more effective and efficient than other similarity measure algorithms.展开更多
Multivariate statistical techniques,such as cluster analysis(CA),discriminant analysis(DA),principal component analysis(PCA) and factor analysis(FA),were applied to evaluate and interpret the surface water quality dat...Multivariate statistical techniques,such as cluster analysis(CA),discriminant analysis(DA),principal component analysis(PCA) and factor analysis(FA),were applied to evaluate and interpret the surface water quality data sets of the Second Songhua River(SSHR) basin in China,obtained during two years(2012-2013) of monitoring of 10 physicochemical parameters at 15 different sites.The results showed that most of physicochemical parameters varied significantly among the sampling sites.Three significant groups,highly polluted(HP),moderately polluted(MP) and less polluted(LP),of sampling sites were obtained through Hierarchical agglomerative CA on the basis of similarity of water quality characteristics.DA identified p H,F,DO,NH3-N,COD and VPhs were the most important parameters contributing to spatial variations of surface water quality.However,DA did not give a considerable data reduction(40% reduction).PCA/FA resulted in three,three and four latent factors explaining 70%,62% and 71% of the total variance in water quality data sets of HP,MP and LP regions,respectively.FA revealed that the SSHR water chemistry was strongly affected by anthropogenic activities(point sources:industrial effluents and wastewater treatment plants;non-point sources:domestic sewage,livestock operations and agricultural activities) and natural processes(seasonal effect,and natural inputs).PCA/FA in the whole basin showed the best results for data reduction because it used only two parameters(about 80% reduction) as the most important parameters to explain 72% of the data variation.Thus,this work illustrated the utility of multivariate statistical techniques for analysis and interpretation of datasets and,in water quality assessment,identification of pollution sources/factors and understanding spatial variations in water quality for effective stream water quality management.展开更多
In order to effectively analyse the multivariate time series data of complex process,a generic reconstruction technology based on reduction theory of rough sets was proposed,Firstly,the phase space of multivariate tim...In order to effectively analyse the multivariate time series data of complex process,a generic reconstruction technology based on reduction theory of rough sets was proposed,Firstly,the phase space of multivariate time series was originally reconstructed by a classical reconstruction technology.Then,the original decision-table of rough set theory was set up according to the embedding dimensions and time-delays of the original reconstruction phase space,and the rough set reduction was used to delete the redundant dimensions and irrelevant variables and to reconstruct the generic phase space,Finally,the input vectors for the prediction of multivariate time series were extracted according to generic reconstruction results to identify the parameters of prediction model.Verification results show that the developed reconstruction method leads to better generalization ability for the prediction model and it is feasible and worthwhile for application.展开更多
Background:Manga nese(Mn)is an essential microelement in cotton seeds,which is usually determined by the techniques relied on hazardous reagents and complex pretreatment procedures.Therefore a rapid,low-cost,and reage...Background:Manga nese(Mn)is an essential microelement in cotton seeds,which is usually determined by the techniques relied on hazardous reagents and complex pretreatment procedures.Therefore a rapid,low-cost,and reagent-free analytical way is demanded to substitute the traditional analytical method.Results:The Mn content in cottonseed meal was investigated by near-infrared spectroscopy(NIRS)and chemometrics techniques.Standard normal variate(SNV)combined with first derivatives(FD)was the optimal spectra pre-treatment method.Monte Carlo uninformative variable elimination(MCUVE)and successive projections algorithm method(SPA)were employed to extract the informative variables from the full NIR spectra.The lin ear and non linear calibration models for cott on seed Mn content were developed.Finally,the optimal model for cottonseed Mn content was obtained by MCUVE-SPA-LSSVM,with root mean squares error of prediction(RMSEP)of 1.994 6,coefficient of determination(R^2)of 0.949 3,and the residual predictive deviation(RPD)of 4.370 5,respectively.Conclusions:The MCUVE-SPA-LSSVM model is accuracy enough to measure the Mn content in cottonseed meal,which can be used as an alter native way to substitute for traditional analytical method.展开更多
为有效解决多维时间序列(multivariate time series, MTS)无监督异常检测模型中自编码器模块容易拟合异常样本、正常MTS样本对应的隐空间特征可能被重构为异常MTS的问题,设计一种具有三重生成对抗的MTS异常检测模型。以LSTM自编码器为...为有效解决多维时间序列(multivariate time series, MTS)无监督异常检测模型中自编码器模块容易拟合异常样本、正常MTS样本对应的隐空间特征可能被重构为异常MTS的问题,设计一种具有三重生成对抗的MTS异常检测模型。以LSTM自编码器为生成器,基于重构误差生成伪标签,由判别器区分经伪标签过滤后的重构MTS和原始MTS;采用两次对抗训练将LSTM自编码器的隐空间约束为均匀分布,减少LSTM自编码器隐空间特征重构出异常MTS的可能性。多个公开MTS数据集上的实验结果表明,T-GAN能在带有污染数据的训练集上更好学习正常MTS分布,取得较高的异常检测效果。展开更多
基金supported by the National Natural Science Foundation of China(71401052)the Key Project of National Social Science Fund of China(12AZD108)+2 种基金the Doctoral Fund of Ministry of Education(20120094120024)the Philosophy and Social Science Fund of Jiangsu Province Universities(2013SJD630073)the Central University Basic Service Project Fee of Hohai University(2011B09914)
文摘To overcome the too fine-grained granularity problem of multivariate grey incidence analysis and to explore the comprehensive incidence analysis model, three multivariate grey incidences degree models based on principal component analysis (PCA) are proposed. Firstly, the PCA method is introduced to extract the feature sequences of a behavioral matrix. Then, the grey incidence analysis between two behavioral matrices is transformed into the similarity and nearness measure between their feature sequences. Based on the classic grey incidence analysis theory, absolute and relative incidence degree models for feature sequences are constructed, and a comprehensive grey incidence model is proposed. Furthermore, the properties of models are researched. It proves that the proposed models satisfy the properties of translation invariance, multiple transformation invariance, and axioms of the grey incidence analysis, respectively. Finally, a case is studied. The results illustrate that the model is effective than other multivariate grey incidence analysis models.
基金Projects(61227006,61473206) supported by the National Natural Science Foundation of ChinaProject(13TXSYJC40200) supported by Science and Technology Innovation of Tianjin,China
文摘Oil–water two-phase flow patterns in a horizontal pipe are analyzed with a 16-electrode electrical resistance tomography(ERT) system. The measurement data of the ERT are treated as a multivariate time-series, thus the information extracted from each electrode represents the local phase distribution and fraction change at that location. The multivariate maximum Lyapunov exponent(MMLE) is extracted from the 16-dimension time-series to demonstrate the change of flow pattern versus the superficial velocity ratio of oil to water. The correlation dimension of the multivariate time-series is further introduced to jointly characterize and finally separate the flow patterns with MMLE. The change of flow patterns with superficial oil velocity at different water superficial velocities is studied with MMLE and correlation dimension, respectively, and the flow pattern transition can also be characterized with these two features. The proposed MMLE and correlation dimension map could effectively separate the flow patterns, thus is an effective tool for flow pattern identification and transition analysis.
文摘In this article, authors introduce a method to assess local influence of obser- vations on the parameter estimates and prediction in multivariate regression model. The diagnostics under the perturbations of error variance, response variables and explanatory variables are derived, and the results are compared with those of case- deletion. Two examples are analyzed for illustration.
基金supported by the National Natural Science Foundation of China(71401052)the Fundamental Research Funds for the Central Universities(2019B19514)。
文摘In order to solve the problem that existing multivariate grey incidence models cannot be applied to time series on different scales, a new model is proposed based on spatial pyramid pooling.Firstly, local features of multivariate time series on different scales are pooled and aggregated by spatial pyramid pooling to construct n levels feature pooling matrices on the same scale. Secondly,Deng's multivariate grey incidence model is introduced to measure the degree of incidence between feature pooling matrices at each level. Thirdly, grey incidence degrees at each level are integrated into a global incidence degree. Finally, the performance of the proposed model is verified on two data sets compared with a variety of algorithms. The results illustrate that the proposed model is more effective and efficient than other similarity measure algorithms.
基金Project (2012ZX07501002-001) supported by the Ministry of Science and Technology of China
文摘Multivariate statistical techniques,such as cluster analysis(CA),discriminant analysis(DA),principal component analysis(PCA) and factor analysis(FA),were applied to evaluate and interpret the surface water quality data sets of the Second Songhua River(SSHR) basin in China,obtained during two years(2012-2013) of monitoring of 10 physicochemical parameters at 15 different sites.The results showed that most of physicochemical parameters varied significantly among the sampling sites.Three significant groups,highly polluted(HP),moderately polluted(MP) and less polluted(LP),of sampling sites were obtained through Hierarchical agglomerative CA on the basis of similarity of water quality characteristics.DA identified p H,F,DO,NH3-N,COD and VPhs were the most important parameters contributing to spatial variations of surface water quality.However,DA did not give a considerable data reduction(40% reduction).PCA/FA resulted in three,three and four latent factors explaining 70%,62% and 71% of the total variance in water quality data sets of HP,MP and LP regions,respectively.FA revealed that the SSHR water chemistry was strongly affected by anthropogenic activities(point sources:industrial effluents and wastewater treatment plants;non-point sources:domestic sewage,livestock operations and agricultural activities) and natural processes(seasonal effect,and natural inputs).PCA/FA in the whole basin showed the best results for data reduction because it used only two parameters(about 80% reduction) as the most important parameters to explain 72% of the data variation.Thus,this work illustrated the utility of multivariate statistical techniques for analysis and interpretation of datasets and,in water quality assessment,identification of pollution sources/factors and understanding spatial variations in water quality for effective stream water quality management.
基金Project(61025015) supported by the National Natural Science Funds for Distinguished Young Scholars of ChinaProject(21106036) supported by the National Natural Science Foundation of China+2 种基金Project(200805331103) supported by Research Fund for the Doctoral Program of Higher Education of ChinaProject(NCET-08-0576) supported by Program for New Century Excellent Talents in Universities of ChinaProject(11B038) supported by Scientific Research Fund for the Excellent Youth Scholars of Hunan Provincial Education Department,China
文摘In order to effectively analyse the multivariate time series data of complex process,a generic reconstruction technology based on reduction theory of rough sets was proposed,Firstly,the phase space of multivariate time series was originally reconstructed by a classical reconstruction technology.Then,the original decision-table of rough set theory was set up according to the embedding dimensions and time-delays of the original reconstruction phase space,and the rough set reduction was used to delete the redundant dimensions and irrelevant variables and to reconstruct the generic phase space,Finally,the input vectors for the prediction of multivariate time series were extracted according to generic reconstruction results to identify the parameters of prediction model.Verification results show that the developed reconstruction method leads to better generalization ability for the prediction model and it is feasible and worthwhile for application.
基金funded by The National Key Technology R&D program of China(2016YFD0101404)China Agriculture Research System(CARS-18-25)Jiangsu Collaborative Innovation Center for Modern Crop Production
文摘Background:Manga nese(Mn)is an essential microelement in cotton seeds,which is usually determined by the techniques relied on hazardous reagents and complex pretreatment procedures.Therefore a rapid,low-cost,and reagent-free analytical way is demanded to substitute the traditional analytical method.Results:The Mn content in cottonseed meal was investigated by near-infrared spectroscopy(NIRS)and chemometrics techniques.Standard normal variate(SNV)combined with first derivatives(FD)was the optimal spectra pre-treatment method.Monte Carlo uninformative variable elimination(MCUVE)and successive projections algorithm method(SPA)were employed to extract the informative variables from the full NIR spectra.The lin ear and non linear calibration models for cott on seed Mn content were developed.Finally,the optimal model for cottonseed Mn content was obtained by MCUVE-SPA-LSSVM,with root mean squares error of prediction(RMSEP)of 1.994 6,coefficient of determination(R^2)of 0.949 3,and the residual predictive deviation(RPD)of 4.370 5,respectively.Conclusions:The MCUVE-SPA-LSSVM model is accuracy enough to measure the Mn content in cottonseed meal,which can be used as an alter native way to substitute for traditional analytical method.
文摘为有效解决多维时间序列(multivariate time series, MTS)无监督异常检测模型中自编码器模块容易拟合异常样本、正常MTS样本对应的隐空间特征可能被重构为异常MTS的问题,设计一种具有三重生成对抗的MTS异常检测模型。以LSTM自编码器为生成器,基于重构误差生成伪标签,由判别器区分经伪标签过滤后的重构MTS和原始MTS;采用两次对抗训练将LSTM自编码器的隐空间约束为均匀分布,减少LSTM自编码器隐空间特征重构出异常MTS的可能性。多个公开MTS数据集上的实验结果表明,T-GAN能在带有污染数据的训练集上更好学习正常MTS分布,取得较高的异常检测效果。