Lithofacies identification is a crucial work in reservoir characterization and modeling.The vast inter-well area can be supplemented by facies identification of seismic data.However,the relationship between lithofacie...Lithofacies identification is a crucial work in reservoir characterization and modeling.The vast inter-well area can be supplemented by facies identification of seismic data.However,the relationship between lithofacies and seismic information that is affected by many factors is complicated.Machine learning has received extensive attention in recent years,among which support vector machine(SVM) is a potential method for lithofacies classification.Lithofacies classification involves identifying various types of lithofacies and is generally a nonlinear problem,which needs to be solved by means of the kernel function.Multi-kernel learning SVM is one of the main tools for solving the nonlinear problem about multi-classification.However,it is very difficult to determine the kernel function and the parameters,which is restricted by human factors.Besides,its computational efficiency is low.A lithofacies classification method based on local deep multi-kernel learning support vector machine(LDMKL-SVM) that can consider low-dimensional global features and high-dimensional local features is developed.The method can automatically learn parameters of kernel function and SVM to build a relationship between lithofacies and seismic elastic information.The calculation speed will be expedited at no cost with respect to discriminant accuracy for multi-class lithofacies identification.Both the model data test results and the field data application results certify advantages of the method.This contribution offers an effective method for lithofacies recognition and reservoir prediction by using SVM.展开更多
The relationship among Mercer kernel, reproducing kernel and positive definite kernel in support vector machine (SVM) is proved and their roles in SVM are discussed. The quadratic form of the kernel matrix is used t...The relationship among Mercer kernel, reproducing kernel and positive definite kernel in support vector machine (SVM) is proved and their roles in SVM are discussed. The quadratic form of the kernel matrix is used to confirm the positive definiteness and their construction. Based on the Bochner theorem, some translation invariant kernels are checked in their Fourier domain. Some rotation invariant radial kernels are inspected according to the Schoenberg theorem. Finally, the construction of discrete scaling and wavelet kernels, the kernel selection and the kernel parameter learning are discussed.展开更多
Support vector machines (SVM) have been widely used in chaotic time series predictions in recent years. In order to enhance the prediction efficiency of this method and implement it in hardware, the sigmoid kernel i...Support vector machines (SVM) have been widely used in chaotic time series predictions in recent years. In order to enhance the prediction efficiency of this method and implement it in hardware, the sigmoid kernel in SVM is drawn in a more natural way by using the fuzzy logic method proposed in this paper. This method provides easy hardware implementation and straightforward interpretability. Experiments on two typical chaotic time series predictions have been carried out and the obtained results show that the average CPU time can be reduced significantly at the cost of a small decrease in prediction accuracy, which is favourable for the hardware implementation for chaotic time series prediction.展开更多
Extreme learning machine(ELM) has attracted much attention in recent years due to its fast convergence and good performance.Merging both ELM and support vector machine is an important trend,thus yielding an ELM kernel...Extreme learning machine(ELM) has attracted much attention in recent years due to its fast convergence and good performance.Merging both ELM and support vector machine is an important trend,thus yielding an ELM kernel.ELM kernel based methods are able to solve the nonlinear problems by inducing an explicit mapping compared with the commonly-used kernels such as Gaussian kernel.In this paper,the ELM kernel is extended to the least squares support vector regression(LSSVR),so ELM-LSSVR was proposed.ELM-LSSVR can be used to reduce the training and test time simultaneously without extra techniques such as sequential minimal optimization and pruning mechanism.Moreover,the memory space for the training and test was relieved.To confirm the efficacy and feasibility of the proposed ELM-LSSVR,the experiments are reported to demonstrate that ELM-LSSVR takes the advantage of training and test time with comparable accuracy to other algorithms.展开更多
The application of carbon dioxide(CO_(2)) in enhanced oil recovery(EOR) has increased significantly, in which CO_(2) solubility in oil is a key parameter in predicting CO_(2) flooding performance. Hydrocarbons are the...The application of carbon dioxide(CO_(2)) in enhanced oil recovery(EOR) has increased significantly, in which CO_(2) solubility in oil is a key parameter in predicting CO_(2) flooding performance. Hydrocarbons are the major constituents of oil, thus the focus of this work lies in investigating the solubility of CO_(2) in hydrocarbons. However, current experimental measurements are time-consuming, and equations of state can be computationally complex. To address these challenges, we developed an artificial intelligence-based model to predict the solubility of CO_(2) in hydrocarbons under varying conditions of temperature, pressure, molecular weight, and density. Using experimental data from previous studies,we trained and predicted the solubility using four machine learning models: support vector regression(SVR), extreme gradient boosting(XGBoost), random forest(RF), and multilayer perceptron(MLP).Among four models, the XGBoost model has the best predictive performance, with an R^(2) of 0.9838.Additionally, sensitivity analysis and evaluation of the relative impacts of each input parameter indicate that the prediction of CO_(2) solubility in hydrocarbons is most sensitive to pressure. Furthermore, our trained model was compared with existing models, demonstrating higher accuracy and applicability of our model. The developed machine learning-based model provides a more efficient and accurate approach for predicting CO_(2) solubility in hydrocarbons, which may contribute to the advancement of CO_(2)-related applications in the petroleum industry.展开更多
The support vector machine(SVM)is a classical machine learning method.Both the hinge loss and least absolute shrinkage and selection operator(LASSO)penalty are usually used in traditional SVMs.However,the hinge loss i...The support vector machine(SVM)is a classical machine learning method.Both the hinge loss and least absolute shrinkage and selection operator(LASSO)penalty are usually used in traditional SVMs.However,the hinge loss is not differentiable,and the LASSO penalty does not have the Oracle property.In this paper,the huberized loss is combined with non-convex penalties to obtain a model that has the advantages of both the computational simplicity and the Oracle property,contributing to higher accuracy than traditional SVMs.It is experimentally demonstrated that the two non-convex huberized-SVM methods,smoothly clipped absolute deviation huberized-SVM(SCAD-HSVM)and minimax concave penalty huberized-SVM(MCP-HSVM),outperform the traditional SVM method in terms of the prediction accuracy and classifier performance.They are also superior in terms of variable selection,especially when there is a high linear correlation between the variables.When they are applied to the prediction of listed companies,the variables that can affect and predict financial distress are accurately filtered out.Among all the indicators,the indicators per share have the greatest influence while those of solvency have the weakest influence.Listed companies can assess the financial situation with the indicators screened by our algorithm and make an early warning of their possible financial distress in advance with higher precision.展开更多
A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speec...A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speech feature sequence to make up time-aligned input patterns for SVM, and the decisions of several 2-class SVM classifiers were employed for constructing an N-class classifier. Four kinds of SVM kernel functions were compared in the experiments of speaker-independent speech recognition of mandarin digits. And the kernel of radial basis function has the highest accurate rate of 99.33%, which is better than that of the baseline system based on hidden Markov models (HMM) (97.08%). And the experiments also show that SVM can outperform HMM especially when the samples for learning were very limited.展开更多
The basic principles of the Support Vector Machine (SVM) are introduced in this paper. A specific process to establish an SVM prediction model is given. To improve the precision of coal reserve estimation, a support v...The basic principles of the Support Vector Machine (SVM) are introduced in this paper. A specific process to establish an SVM prediction model is given. To improve the precision of coal reserve estimation, a support vector machine method, based on statistical learning theory, is put forward. The SVM model was trained and tested by using the existing exploration and exploitation data of Chencun mine of Yima bureau’s as the input data. Then coal reserves within a particular region were calculated. These calculated results and the actual results of the exploration block were compared. The maximum relative error was 10.85%, within the scope of acceptable error limits. The results show that the SVM coal reserve calculation method is reliable. This method is simple, practical and valuable.展开更多
One of the most serious mining disasters in underground mines is rockburst phenomena.They can lead to injuries and even fatalities as well as damage to underground openings and mining equipment.This has forced many re...One of the most serious mining disasters in underground mines is rockburst phenomena.They can lead to injuries and even fatalities as well as damage to underground openings and mining equipment.This has forced many researchers to investigate alternative methods to predict the potential for rockburst occurrence.However,due to the highly complex relation between geological,mechanical and geometric parameters of the mining environment,the traditional mechanics-based prediction methods do not always yield precise results.With the emergence of machine learning methods,a breakthrough in the prediction of rockburst occurrence has become possible in recent years.This paper presents a state-ofthe-art review of various applications of machine learning methods for the prediction of rockburst potential.First,existing rockburst prediction methods are introduced,and the limitations of such methods are highlighted.A brief overview of typical machine learning methods and their main features as predictive tools is then presented.The current applications of machine learning models in rockburst prediction are surveyed,with related mechanisms,technical details and performance analysis.展开更多
The use of magnetic nanoparticle(MNP)-labeled immunochromatography test strips(ICTSs) is very important for point-ofcare testing(POCT). However, common diagnostic methods cannot accurately analyze the weak magnetic si...The use of magnetic nanoparticle(MNP)-labeled immunochromatography test strips(ICTSs) is very important for point-ofcare testing(POCT). However, common diagnostic methods cannot accurately analyze the weak magnetic signal from ICTSs, limiting the applications of POCT. In this study, an ultrasensitive multiplex biosensor was designed to overcome the limitations of capturing and normalization of the weak magnetic signal from MNPs on ICTSs. A machine learning model for sandwich assays was constructed and used to classify weakly positive and negative samples, which significantly enhanced the specificity and sensitivity. The potential clinical application was evaluated by detecting 50 human chorionic gonadotropin(HCG) samples and 59 myocardial infarction serum samples. The quantitative range for HCG was 1–1000 mIU mL^(-1) and the ideal detection limit was 0.014 mIU mL^(-1), which was well below the clinical threshold. Quantitative detection results of multiplex cardiac markers showed good linear correlations with standard values. The proposed multiplex assay can be readily adapted for identifying other biomolecules and also be used in other applications such as environmental monitoring, food analysis, and national security.展开更多
This paper considers a robust kernel regularized classification algorithm with a non-convex loss function which is proposed to alleviate the performance deterioration caused by the outliers.A comparison relationship b...This paper considers a robust kernel regularized classification algorithm with a non-convex loss function which is proposed to alleviate the performance deterioration caused by the outliers.A comparison relationship between the excess misclassification error and the excess generalization error is provided;from this,along with the convex analysis theory,a kind of learning rate is derived.The results show that the performance of the classifier is effected by the outliers,and the extent of impact can be controlled by choosing the homotopy parameters properly.展开更多
Conventional machine learning(CML)methods have been successfully applied for gas reservoir prediction.Their prediction accuracy largely depends on the quality of the sample data;therefore,feature optimization of the i...Conventional machine learning(CML)methods have been successfully applied for gas reservoir prediction.Their prediction accuracy largely depends on the quality of the sample data;therefore,feature optimization of the input samples is particularly important.Commonly used feature optimization methods increase the interpretability of gas reservoirs;however,their steps are cumbersome,and the selected features cannot sufficiently guide CML models to mine the intrinsic features of sample data efficiently.In contrast to CML methods,deep learning(DL)methods can directly extract the important features of targets from raw data.Therefore,this study proposes a feature optimization and gas-bearing prediction method based on a hybrid fusion model that combines a convolutional neural network(CNN)and an adaptive particle swarm optimization-least squares support vector machine(APSO-LSSVM).This model adopts an end-to-end algorithm structure to directly extract features from sensitive multicomponent seismic attributes,considerably simplifying the feature optimization.A CNN was used for feature optimization to highlight sensitive gas reservoir information.APSO-LSSVM was used to fully learn the relationship between the features extracted by the CNN to obtain the prediction results.The constructed hybrid fusion model improves gas-bearing prediction accuracy through two processes of feature optimization and intelligent prediction,giving full play to the advantages of DL and CML methods.The prediction results obtained are better than those of a single CNN model or APSO-LSSVM model.In the feature optimization process of multicomponent seismic attribute data,CNN has demonstrated better gas reservoir feature extraction capabilities than commonly used attribute optimization methods.In the prediction process,the APSO-LSSVM model can learn the gas reservoir characteristics better than the LSSVM model and has a higher prediction accuracy.The constructed CNN-APSO-LSSVM model had lower errors and a better fit on the test dataset than the other individual models.This method proves the effectiveness of DL technology for the feature extraction of gas reservoirs and provides a feasible way to combine DL and CML technologies to predict gas reservoirs.展开更多
基金financially supported by the National Natural Science Foundation of China (41774129, 41904116)the Foundation Research Project of Shaanxi Provincial Key Laboratory of Geological Support for Coal Green Exploitation (MTy2019-20)。
文摘Lithofacies identification is a crucial work in reservoir characterization and modeling.The vast inter-well area can be supplemented by facies identification of seismic data.However,the relationship between lithofacies and seismic information that is affected by many factors is complicated.Machine learning has received extensive attention in recent years,among which support vector machine(SVM) is a potential method for lithofacies classification.Lithofacies classification involves identifying various types of lithofacies and is generally a nonlinear problem,which needs to be solved by means of the kernel function.Multi-kernel learning SVM is one of the main tools for solving the nonlinear problem about multi-classification.However,it is very difficult to determine the kernel function and the parameters,which is restricted by human factors.Besides,its computational efficiency is low.A lithofacies classification method based on local deep multi-kernel learning support vector machine(LDMKL-SVM) that can consider low-dimensional global features and high-dimensional local features is developed.The method can automatically learn parameters of kernel function and SVM to build a relationship between lithofacies and seismic elastic information.The calculation speed will be expedited at no cost with respect to discriminant accuracy for multi-class lithofacies identification.Both the model data test results and the field data application results certify advantages of the method.This contribution offers an effective method for lithofacies recognition and reservoir prediction by using SVM.
基金Supported by the National Natural Science Foundation of China(60473035)~~
文摘The relationship among Mercer kernel, reproducing kernel and positive definite kernel in support vector machine (SVM) is proved and their roles in SVM are discussed. The quadratic form of the kernel matrix is used to confirm the positive definiteness and their construction. Based on the Bochner theorem, some translation invariant kernels are checked in their Fourier domain. Some rotation invariant radial kernels are inspected according to the Schoenberg theorem. Finally, the construction of discrete scaling and wavelet kernels, the kernel selection and the kernel parameter learning are discussed.
文摘Support vector machines (SVM) have been widely used in chaotic time series predictions in recent years. In order to enhance the prediction efficiency of this method and implement it in hardware, the sigmoid kernel in SVM is drawn in a more natural way by using the fuzzy logic method proposed in this paper. This method provides easy hardware implementation and straightforward interpretability. Experiments on two typical chaotic time series predictions have been carried out and the obtained results show that the average CPU time can be reduced significantly at the cost of a small decrease in prediction accuracy, which is favourable for the hardware implementation for chaotic time series prediction.
基金Sponsored by the National Natural Science Foundation of China(51006052)
文摘Extreme learning machine(ELM) has attracted much attention in recent years due to its fast convergence and good performance.Merging both ELM and support vector machine is an important trend,thus yielding an ELM kernel.ELM kernel based methods are able to solve the nonlinear problems by inducing an explicit mapping compared with the commonly-used kernels such as Gaussian kernel.In this paper,the ELM kernel is extended to the least squares support vector regression(LSSVR),so ELM-LSSVR was proposed.ELM-LSSVR can be used to reduce the training and test time simultaneously without extra techniques such as sequential minimal optimization and pruning mechanism.Moreover,the memory space for the training and test was relieved.To confirm the efficacy and feasibility of the proposed ELM-LSSVR,the experiments are reported to demonstrate that ELM-LSSVR takes the advantage of training and test time with comparable accuracy to other algorithms.
基金supported by the Fundamental Research Funds for the National Major Science and Technology Projects of China (No. 2017ZX05009-005)。
文摘The application of carbon dioxide(CO_(2)) in enhanced oil recovery(EOR) has increased significantly, in which CO_(2) solubility in oil is a key parameter in predicting CO_(2) flooding performance. Hydrocarbons are the major constituents of oil, thus the focus of this work lies in investigating the solubility of CO_(2) in hydrocarbons. However, current experimental measurements are time-consuming, and equations of state can be computationally complex. To address these challenges, we developed an artificial intelligence-based model to predict the solubility of CO_(2) in hydrocarbons under varying conditions of temperature, pressure, molecular weight, and density. Using experimental data from previous studies,we trained and predicted the solubility using four machine learning models: support vector regression(SVR), extreme gradient boosting(XGBoost), random forest(RF), and multilayer perceptron(MLP).Among four models, the XGBoost model has the best predictive performance, with an R^(2) of 0.9838.Additionally, sensitivity analysis and evaluation of the relative impacts of each input parameter indicate that the prediction of CO_(2) solubility in hydrocarbons is most sensitive to pressure. Furthermore, our trained model was compared with existing models, demonstrating higher accuracy and applicability of our model. The developed machine learning-based model provides a more efficient and accurate approach for predicting CO_(2) solubility in hydrocarbons, which may contribute to the advancement of CO_(2)-related applications in the petroleum industry.
文摘The support vector machine(SVM)is a classical machine learning method.Both the hinge loss and least absolute shrinkage and selection operator(LASSO)penalty are usually used in traditional SVMs.However,the hinge loss is not differentiable,and the LASSO penalty does not have the Oracle property.In this paper,the huberized loss is combined with non-convex penalties to obtain a model that has the advantages of both the computational simplicity and the Oracle property,contributing to higher accuracy than traditional SVMs.It is experimentally demonstrated that the two non-convex huberized-SVM methods,smoothly clipped absolute deviation huberized-SVM(SCAD-HSVM)and minimax concave penalty huberized-SVM(MCP-HSVM),outperform the traditional SVM method in terms of the prediction accuracy and classifier performance.They are also superior in terms of variable selection,especially when there is a high linear correlation between the variables.When they are applied to the prediction of listed companies,the variables that can affect and predict financial distress are accurately filtered out.Among all the indicators,the indicators per share have the greatest influence while those of solvency have the weakest influence.Listed companies can assess the financial situation with the indicators screened by our algorithm and make an early warning of their possible financial distress in advance with higher precision.
文摘A method of applying support vector machine (SVM) in speech recognition was proposed, and a speech recognition system for mandarin digits was built up by SVMs. In the system, vectors were linearly extracted from speech feature sequence to make up time-aligned input patterns for SVM, and the decisions of several 2-class SVM classifiers were employed for constructing an N-class classifier. Four kinds of SVM kernel functions were compared in the experiments of speaker-independent speech recognition of mandarin digits. And the kernel of radial basis function has the highest accurate rate of 99.33%, which is better than that of the baseline system based on hidden Markov models (HMM) (97.08%). And the experiments also show that SVM can outperform HMM especially when the samples for learning were very limited.
基金Project 072400430420 supported by the Natural Science Foundation of Henan Province
文摘The basic principles of the Support Vector Machine (SVM) are introduced in this paper. A specific process to establish an SVM prediction model is given. To improve the precision of coal reserve estimation, a support vector machine method, based on statistical learning theory, is put forward. The SVM model was trained and tested by using the existing exploration and exploitation data of Chencun mine of Yima bureau’s as the input data. Then coal reserves within a particular region were calculated. These calculated results and the actual results of the exploration block were compared. The maximum relative error was 10.85%, within the scope of acceptable error limits. The results show that the SVM coal reserve calculation method is reliable. This method is simple, practical and valuable.
文摘One of the most serious mining disasters in underground mines is rockburst phenomena.They can lead to injuries and even fatalities as well as damage to underground openings and mining equipment.This has forced many researchers to investigate alternative methods to predict the potential for rockburst occurrence.However,due to the highly complex relation between geological,mechanical and geometric parameters of the mining environment,the traditional mechanics-based prediction methods do not always yield precise results.With the emergence of machine learning methods,a breakthrough in the prediction of rockburst occurrence has become possible in recent years.This paper presents a state-ofthe-art review of various applications of machine learning methods for the prediction of rockburst potential.First,existing rockburst prediction methods are introduced,and the limitations of such methods are highlighted.A brief overview of typical machine learning methods and their main features as predictive tools is then presented.The current applications of machine learning models in rockburst prediction are surveyed,with related mechanisms,technical details and performance analysis.
基金support by the National Key Research and Development Program of China (Grant Nos. 2017FYA0205301, and 2017FYA0205303)the National Natural Science Foundation of China (Grant Nos. 81571835 and 81672247)+3 种基金National Key Research and Development Program of China (No. 2017YFA0205303)National Key Basic Research Program (973 Project) (No. 2015CB931802)"13th Five-Year Plan" Science and Technology Project of Jilin Province Education Department (No. JJKH20170410K)Shanghai Science and Technology Fund (No. 15DZ2252000)
文摘The use of magnetic nanoparticle(MNP)-labeled immunochromatography test strips(ICTSs) is very important for point-ofcare testing(POCT). However, common diagnostic methods cannot accurately analyze the weak magnetic signal from ICTSs, limiting the applications of POCT. In this study, an ultrasensitive multiplex biosensor was designed to overcome the limitations of capturing and normalization of the weak magnetic signal from MNPs on ICTSs. A machine learning model for sandwich assays was constructed and used to classify weakly positive and negative samples, which significantly enhanced the specificity and sensitivity. The potential clinical application was evaluated by detecting 50 human chorionic gonadotropin(HCG) samples and 59 myocardial infarction serum samples. The quantitative range for HCG was 1–1000 mIU mL^(-1) and the ideal detection limit was 0.014 mIU mL^(-1), which was well below the clinical threshold. Quantitative detection results of multiplex cardiac markers showed good linear correlations with standard values. The proposed multiplex assay can be readily adapted for identifying other biomolecules and also be used in other applications such as environmental monitoring, food analysis, and national security.
基金supported by the NSF(61877039)the NSFC/RGC Joint Research Scheme(12061160462 and N City U 102/20)of China+2 种基金the NSF(LY19F020013)of Zhejiang Provincethe Special Project for Scientific and Technological Cooperation(20212BDH80021)of Jiangxi Provincethe Science and Technology Project in Jiangxi Province Department of Education(GJJ211334)。
文摘This paper considers a robust kernel regularized classification algorithm with a non-convex loss function which is proposed to alleviate the performance deterioration caused by the outliers.A comparison relationship between the excess misclassification error and the excess generalization error is provided;from this,along with the convex analysis theory,a kind of learning rate is derived.The results show that the performance of the classifier is effected by the outliers,and the extent of impact can be controlled by choosing the homotopy parameters properly.
基金funded by the Natural Science Foundation of Shandong Province (ZR2021MD061ZR2023QD025)+3 种基金China Postdoctoral Science Foundation (2022M721972)National Natural Science Foundation of China (41174098)Young Talents Foundation of Inner Mongolia University (10000-23112101/055)Qingdao Postdoctoral Science Foundation (QDBSH20230102094)。
文摘Conventional machine learning(CML)methods have been successfully applied for gas reservoir prediction.Their prediction accuracy largely depends on the quality of the sample data;therefore,feature optimization of the input samples is particularly important.Commonly used feature optimization methods increase the interpretability of gas reservoirs;however,their steps are cumbersome,and the selected features cannot sufficiently guide CML models to mine the intrinsic features of sample data efficiently.In contrast to CML methods,deep learning(DL)methods can directly extract the important features of targets from raw data.Therefore,this study proposes a feature optimization and gas-bearing prediction method based on a hybrid fusion model that combines a convolutional neural network(CNN)and an adaptive particle swarm optimization-least squares support vector machine(APSO-LSSVM).This model adopts an end-to-end algorithm structure to directly extract features from sensitive multicomponent seismic attributes,considerably simplifying the feature optimization.A CNN was used for feature optimization to highlight sensitive gas reservoir information.APSO-LSSVM was used to fully learn the relationship between the features extracted by the CNN to obtain the prediction results.The constructed hybrid fusion model improves gas-bearing prediction accuracy through two processes of feature optimization and intelligent prediction,giving full play to the advantages of DL and CML methods.The prediction results obtained are better than those of a single CNN model or APSO-LSSVM model.In the feature optimization process of multicomponent seismic attribute data,CNN has demonstrated better gas reservoir feature extraction capabilities than commonly used attribute optimization methods.In the prediction process,the APSO-LSSVM model can learn the gas reservoir characteristics better than the LSSVM model and has a higher prediction accuracy.The constructed CNN-APSO-LSSVM model had lower errors and a better fit on the test dataset than the other individual models.This method proves the effectiveness of DL technology for the feature extraction of gas reservoirs and provides a feasible way to combine DL and CML technologies to predict gas reservoirs.