The support vector machine(SVM)is a classical machine learning method.Both the hinge loss and least absolute shrinkage and selection operator(LASSO)penalty are usually used in traditional SVMs.However,the hinge loss i...The support vector machine(SVM)is a classical machine learning method.Both the hinge loss and least absolute shrinkage and selection operator(LASSO)penalty are usually used in traditional SVMs.However,the hinge loss is not differentiable,and the LASSO penalty does not have the Oracle property.In this paper,the huberized loss is combined with non-convex penalties to obtain a model that has the advantages of both the computational simplicity and the Oracle property,contributing to higher accuracy than traditional SVMs.It is experimentally demonstrated that the two non-convex huberized-SVM methods,smoothly clipped absolute deviation huberized-SVM(SCAD-HSVM)and minimax concave penalty huberized-SVM(MCP-HSVM),outperform the traditional SVM method in terms of the prediction accuracy and classifier performance.They are also superior in terms of variable selection,especially when there is a high linear correlation between the variables.When they are applied to the prediction of listed companies,the variables that can affect and predict financial distress are accurately filtered out.Among all the indicators,the indicators per share have the greatest influence while those of solvency have the weakest influence.Listed companies can assess the financial situation with the indicators screened by our algorithm and make an early warning of their possible financial distress in advance with higher precision.展开更多
The relationship among Mercer kernel, reproducing kernel and positive definite kernel in support vector machine (SVM) is proved and their roles in SVM are discussed. The quadratic form of the kernel matrix is used t...The relationship among Mercer kernel, reproducing kernel and positive definite kernel in support vector machine (SVM) is proved and their roles in SVM are discussed. The quadratic form of the kernel matrix is used to confirm the positive definiteness and their construction. Based on the Bochner theorem, some translation invariant kernels are checked in their Fourier domain. Some rotation invariant radial kernels are inspected according to the Schoenberg theorem. Finally, the construction of discrete scaling and wavelet kernels, the kernel selection and the kernel parameter learning are discussed.展开更多
Laser-induced breakdown spectroscopy(LIBS) is a versatile tool for both qualitative and quantitative analysis.In this paper,LIBS combined with principal component analysis(PCA) and support vector machine(SVM) is...Laser-induced breakdown spectroscopy(LIBS) is a versatile tool for both qualitative and quantitative analysis.In this paper,LIBS combined with principal component analysis(PCA) and support vector machine(SVM) is applied to rock analysis.Fourteen emission lines including Fe,Mg,Ca,Al,Si,and Ti are selected as analysis lines.A good accuracy(91.38% for the real rock) is achieved by using SVM to analyze the spectroscopic peak area data which are processed by PCA.It can not only reduce the noise and dimensionality which contributes to improving the efficiency of the program,but also solve the problem of linear inseparability by combining PCA and SVM.By this method,the ability of LIBS to classify rock is validated.展开更多
Indoor localization has gained much attention over several decades due to enormous applications. However, the accuracy of indoor localization is hard to improve because the signal propagation has small scale effects w...Indoor localization has gained much attention over several decades due to enormous applications. However, the accuracy of indoor localization is hard to improve because the signal propagation has small scale effects which leads to inaccurate measurements. In this paper, we propose an efficient learning approach that combines grid search based kernel support vector machine and principle component analysis. The proposed approach applies principle component analysis to reduce high dimensional measurements. Then we design a grid search algorithm to optimize the parameters of kernel support vector machine in order to improve the localization accuracy. Experimental results indicate that the proposed approach reduces the localization error and improves the computational efficiency comparing with K-nearest neighbor, Back Propagation Neural Network and Support Vector Machine based methods.展开更多
A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speec...A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speech feature sequence to make up time-aligned input patterns for SVM, and the decisions of several 2-class SVM classifiers were employed for constructing an N-class classifier. Four kinds of SVM kernel functions were compared in the experiments of speaker-independent speech recognition of mandarin digits. And the kernel of radial basis function has the highest accurate rate of 99.33%, which is better than that of the baseline system based on hidden Markov models (HMM) (97.08%). And the experiments also show that SVM can outperform HMM especially when the samples for learning were very limited.展开更多
The discrimination of neutrons from gamma rays in a mixed radiation field is crucial in neutron detection tasks.Several approaches have been proposed to enhance the performance and accuracy of neutron-gamma discrimina...The discrimination of neutrons from gamma rays in a mixed radiation field is crucial in neutron detection tasks.Several approaches have been proposed to enhance the performance and accuracy of neutron-gamma discrimination.However,their performances are often associated with certain factors,such as experimental requirements and resulting mixed signals.The main purpose of this study is to achieve fast and accurate neutron-gamma discrimination without a priori information on the signal to be analyzed,as well as the experimental setup.Here,a novel method is proposed based on two concepts.The first method exploits the power of nonnegative tensor factorization(NTF)as a blind source separation method to extract the original components from the mixture signals recorded at the output of the stilbene scintillator detector.The second one is based on the principles of support vector machine(SVM)to identify and discriminate these components.In addition to these two main methods,we adopted the Mexican-hat function as a continuous wavelet transform to characterize the components extracted using the NTF model.The resulting scalograms are processed as colored images,which are segmented into two distinct classes using the Otsu thresholding method to extract the features of interest of the neutrons and gamma-ray components from the background noise.We subsequently used principal component analysis to select the most significant of these features wich are used in the training and testing datasets for SVM.Bias-variance analysis is used to optimize the SVM model by finding the optimal level of model complexity with the highest possible generalization performance.In this framework,the obtained results have verified a suitable bias–variance trade-off value.We achieved an operational SVM prediction model for neutron-gamma classification with a high true-positive rate.The accuracy and performance of the SVM based on the NTF was evaluated and validated by comparing it to the charge comparison method via figure of merit.The results indicate that the proposed approach has a superior discrimination quality(figure of merit of 2.20).展开更多
Filament-induced breakdown spectroscopy(FIBS)combined with machine learning algorithms was used to identify five aluminum alloys.To study the effect of the distance between focusing lens and target surface on the iden...Filament-induced breakdown spectroscopy(FIBS)combined with machine learning algorithms was used to identify five aluminum alloys.To study the effect of the distance between focusing lens and target surface on the identification accuracy of aluminum alloys,principal component analysis(PCA)combined with support vector machine(SVM)and Knearest neighbor(KNN)was used.The intensity and intensity ratio of fifteen lines of six elements(Fe,Si,Mg,Cu,Zn,and Mn)in the FIBS spectrum were selected.The distances between the focusing lens and the target surface in the pre-filament,filament,and post-filament were 958 mm,976 mm,and 1000 mm,respectively.The source data set was fifteen spectral line intensity ratios,and the cumulative interpretation rates of PC1,PC2,and PC3 were 97.22%,98.17%,and 95.31%,respectively.The first three PCs obtained by PCA were the input variables of SVM and KNN.The identification accuracy of the different positions of focusing lens and target surface was obtained,and the identification accuracy of SVM and KNN in the filament was 100%and 90%,respectively.The source data set of the filament was obtained by PCA for the first three PCs,which were randomly selected as the training set and test set of SVM and KNN in 3:2.The identification accuracy of SVM and KNN was 97.5%and 92.5%,respectively.The research results can provide a reference for the identification of aluminum alloys by FIBS.展开更多
文摘The support vector machine(SVM)is a classical machine learning method.Both the hinge loss and least absolute shrinkage and selection operator(LASSO)penalty are usually used in traditional SVMs.However,the hinge loss is not differentiable,and the LASSO penalty does not have the Oracle property.In this paper,the huberized loss is combined with non-convex penalties to obtain a model that has the advantages of both the computational simplicity and the Oracle property,contributing to higher accuracy than traditional SVMs.It is experimentally demonstrated that the two non-convex huberized-SVM methods,smoothly clipped absolute deviation huberized-SVM(SCAD-HSVM)and minimax concave penalty huberized-SVM(MCP-HSVM),outperform the traditional SVM method in terms of the prediction accuracy and classifier performance.They are also superior in terms of variable selection,especially when there is a high linear correlation between the variables.When they are applied to the prediction of listed companies,the variables that can affect and predict financial distress are accurately filtered out.Among all the indicators,the indicators per share have the greatest influence while those of solvency have the weakest influence.Listed companies can assess the financial situation with the indicators screened by our algorithm and make an early warning of their possible financial distress in advance with higher precision.
基金Supported by the National Natural Science Foundation of China(60473035)~~
文摘The relationship among Mercer kernel, reproducing kernel and positive definite kernel in support vector machine (SVM) is proved and their roles in SVM are discussed. The quadratic form of the kernel matrix is used to confirm the positive definiteness and their construction. Based on the Bochner theorem, some translation invariant kernels are checked in their Fourier domain. Some rotation invariant radial kernels are inspected according to the Schoenberg theorem. Finally, the construction of discrete scaling and wavelet kernels, the kernel selection and the kernel parameter learning are discussed.
基金Project supported by the National Natural Science Foundation of China(Grant No.11075184)the Knowledge Innovation Program of the Chinese Academy of Sciences(CAS)(Grant No.Y03RC21124)the CAS President’s International Fellowship Initiative Foundation(Grant No.2015VMA007)
文摘Laser-induced breakdown spectroscopy(LIBS) is a versatile tool for both qualitative and quantitative analysis.In this paper,LIBS combined with principal component analysis(PCA) and support vector machine(SVM) is applied to rock analysis.Fourteen emission lines including Fe,Mg,Ca,Al,Si,and Ti are selected as analysis lines.A good accuracy(91.38% for the real rock) is achieved by using SVM to analyze the spectroscopic peak area data which are processed by PCA.It can not only reduce the noise and dimensionality which contributes to improving the efficiency of the program,but also solve the problem of linear inseparability by combining PCA and SVM.By this method,the ability of LIBS to classify rock is validated.
基金supported by“the Fundamental Research Funds for the Central Universities No. 2017JBM016”
文摘Indoor localization has gained much attention over several decades due to enormous applications. However, the accuracy of indoor localization is hard to improve because the signal propagation has small scale effects which leads to inaccurate measurements. In this paper, we propose an efficient learning approach that combines grid search based kernel support vector machine and principle component analysis. The proposed approach applies principle component analysis to reduce high dimensional measurements. Then we design a grid search algorithm to optimize the parameters of kernel support vector machine in order to improve the localization accuracy. Experimental results indicate that the proposed approach reduces the localization error and improves the computational efficiency comparing with K-nearest neighbor, Back Propagation Neural Network and Support Vector Machine based methods.
文摘A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speech feature sequence to make up time-aligned input patterns for SVM, and the decisions of several 2-class SVM classifiers were employed for constructing an N-class classifier. Four kinds of SVM kernel functions were compared in the experiments of speaker-independent speech recognition of mandarin digits. And the kernel of radial basis function has the highest accurate rate of 99.33%, which is better than that of the baseline system based on hidden Markov models (HMM) (97.08%). And the experiments also show that SVM can outperform HMM especially when the samples for learning were very limited.
基金L’Ore´al-UNESCO for the Women in Science Maghreb Program Grant Agreement No.4500410340.
文摘The discrimination of neutrons from gamma rays in a mixed radiation field is crucial in neutron detection tasks.Several approaches have been proposed to enhance the performance and accuracy of neutron-gamma discrimination.However,their performances are often associated with certain factors,such as experimental requirements and resulting mixed signals.The main purpose of this study is to achieve fast and accurate neutron-gamma discrimination without a priori information on the signal to be analyzed,as well as the experimental setup.Here,a novel method is proposed based on two concepts.The first method exploits the power of nonnegative tensor factorization(NTF)as a blind source separation method to extract the original components from the mixture signals recorded at the output of the stilbene scintillator detector.The second one is based on the principles of support vector machine(SVM)to identify and discriminate these components.In addition to these two main methods,we adopted the Mexican-hat function as a continuous wavelet transform to characterize the components extracted using the NTF model.The resulting scalograms are processed as colored images,which are segmented into two distinct classes using the Otsu thresholding method to extract the features of interest of the neutrons and gamma-ray components from the background noise.We subsequently used principal component analysis to select the most significant of these features wich are used in the training and testing datasets for SVM.Bias-variance analysis is used to optimize the SVM model by finding the optimal level of model complexity with the highest possible generalization performance.In this framework,the obtained results have verified a suitable bias–variance trade-off value.We achieved an operational SVM prediction model for neutron-gamma classification with a high true-positive rate.The accuracy and performance of the SVM based on the NTF was evaluated and validated by comparing it to the charge comparison method via figure of merit.The results indicate that the proposed approach has a superior discrimination quality(figure of merit of 2.20).
基金Project supported by the Natural Science Foundation of Jilin Province,China(Grant No.2020122348JC)。
文摘Filament-induced breakdown spectroscopy(FIBS)combined with machine learning algorithms was used to identify five aluminum alloys.To study the effect of the distance between focusing lens and target surface on the identification accuracy of aluminum alloys,principal component analysis(PCA)combined with support vector machine(SVM)and Knearest neighbor(KNN)was used.The intensity and intensity ratio of fifteen lines of six elements(Fe,Si,Mg,Cu,Zn,and Mn)in the FIBS spectrum were selected.The distances between the focusing lens and the target surface in the pre-filament,filament,and post-filament were 958 mm,976 mm,and 1000 mm,respectively.The source data set was fifteen spectral line intensity ratios,and the cumulative interpretation rates of PC1,PC2,and PC3 were 97.22%,98.17%,and 95.31%,respectively.The first three PCs obtained by PCA were the input variables of SVM and KNN.The identification accuracy of the different positions of focusing lens and target surface was obtained,and the identification accuracy of SVM and KNN in the filament was 100%and 90%,respectively.The source data set of the filament was obtained by PCA for the first three PCs,which were randomly selected as the training set and test set of SVM and KNN in 3:2.The identification accuracy of SVM and KNN was 97.5%and 92.5%,respectively.The research results can provide a reference for the identification of aluminum alloys by FIBS.