期刊文献+
共找到10篇文章
< 1 >
每页显示 20 50 100
Classification of aviation incident causes using LGBM with improved cross-validation 被引量:1
1
作者 NI Xiaomei WANG Huawei +1 位作者 CHEN Lingzi LIN Ruiguan 《Journal of Systems Engineering and Electronics》 SCIE CSCD 2024年第2期396-405,共10页
Aviation accidents are currently one of the leading causes of significant injuries and deaths worldwide. This entices researchers to investigate aircraft safety using data analysis approaches based on an advanced mach... Aviation accidents are currently one of the leading causes of significant injuries and deaths worldwide. This entices researchers to investigate aircraft safety using data analysis approaches based on an advanced machine learning algorithm.To assess aviation safety and identify the causes of incidents, a classification model with light gradient boosting machine (LGBM)based on the aviation safety reporting system (ASRS) has been developed. It is improved by k-fold cross-validation with hybrid sampling model (HSCV), which may boost classification performance and maintain data balance. The results show that employing the LGBM-HSCV model can significantly improve accuracy while alleviating data imbalance. Vertical comparison with other cross-validation (CV) methods and lateral comparison with different fold times comprise the comparative approach. Aside from the comparison, two further CV approaches based on the improved method in this study are discussed:one with a different sampling and folding order, and the other with more CV. According to the assessment indices with different methods, the LGBMHSCV model proposed here is effective at detecting incident causes. The improved model for imbalanced data categorization proposed may serve as a point of reference for similar data processing, and the model’s accurate identification of civil aviation incident causes can assist to improve civil aviation safety. 展开更多
关键词 aviation safety imbalance data light gradient boosting machine(LGBM) cross-validation(CV)
在线阅读 下载PDF
基于Cross-Validation的小波自适应去噪方法 被引量:5
2
作者 黄文清 戴瑜兴 李加升 《湖南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2008年第11期40-43,共4页
小波去噪算法中,阈值的选择非常关键.提出一种自适应阈值选择算法.该算法先通过Cross-Validation方法将噪声干扰信号分成两个子信号,一个用于阈值处理,一个用作参考信号;再采用最深梯度法来寻求一个最优去噪阈值.仿真和实验结果表明:在... 小波去噪算法中,阈值的选择非常关键.提出一种自适应阈值选择算法.该算法先通过Cross-Validation方法将噪声干扰信号分成两个子信号,一个用于阈值处理,一个用作参考信号;再采用最深梯度法来寻求一个最优去噪阈值.仿真和实验结果表明:在均方误差意义上,所提算法去噪效果优于Donoho等提出的VisuShrink和SureShrink两种去噪算法,且不需要带噪信号的任何'先验信息',适应于实际信号去噪处理. 展开更多
关键词 小波变换 cross-validation 自适应滤波 阈值
在线阅读 下载PDF
Using Multiple Risk Factors and Generalized Linear Mixed Models with 5-Fold Cross-Validation Strategy for Optimal Carotid Plaque Progression Prediction
3
作者 Qingyu Wang Dalin Tang +5 位作者 Liang Wang Gador Canton Zheyang Wu Thomas SHatsukami Kristen L Billiar Chun Yuan 《医用生物力学》 EI CAS CSCD 北大核心 2019年第A01期74-75,共2页
Background Cardiovascular diseases are closely linked to atherosclerotic plaque development and rupture.Plaque progression prediction is of fundamental significance to cardiovascular research and disease diagnosis,pre... Background Cardiovascular diseases are closely linked to atherosclerotic plaque development and rupture.Plaque progression prediction is of fundamental significance to cardiovascular research and disease diagnosis,prevention,and treatment.Generalized linear mixed models(GLMM)is an extension of linear model for categorical responses while considering the correlation among observations.Methods Magnetic resonance image(MRI)data of carotid atheroscleroticplaques were acquired from 20 patients with consent obtained and 3D thin-layer models were constructed to calculate plaque stress and strain for plaque progression prediction.Data for ten morphological and biomechanical risk factors included wall thickness(WT),lipid percent(LP),minimum cap thickness(MinCT),plaque area(PA),plaque burden(PB),lumen area(LA),maximum plaque wall stress(MPWS),maximum plaque wall strain(MPWSn),average plaque wall stress(APWS),and average plaque wall strain(APWSn)were extracted from all slices for analysis.Wall thickness increase(WTI),plaque burden increase(PBI)and plaque area increase(PAI) were chosen as three measures for plaque progression.Generalized linear mixed models(GLMM)with 5-fold cross-validation strategy were used to calculate prediction accuracy for each predictor and identify optimal predictor with the highest prediction accuracy defined as sum of sensitivity and specificity.All 201 MRI slices were randomly divided into 4 training subgroups and 1 verification subgroup.The training subgroups were used for model fitting,and the verification subgroup was used to estimate the model.All combinations(total1023)of 10 risk factors were feed to GLMM and the prediction accuracy of each predictor were selected from the point on the ROC(receiver operating characteristic)curve with the highest sum of specificity and sensitivity.Results LA was the best single predictor for PBI with the highest prediction accuracy(1.360 1),and the area under of the ROC curve(AUC)is0.654 0,followed by APWSn(1.336 3)with AUC=0.6342.The optimal predictor among all possible combinations for PBI was the combination of LA,PA,LP,WT,MPWS and MPWSn with prediction accuracy=1.414 6(AUC=0.715 8).LA was once again the best single predictor for PAI with the highest prediction accuracy(1.184 6)with AUC=0.606 4,followed by MPWSn(1. 183 2)with AUC=0.6084.The combination of PA,PB,WT,MPWS,MPWSn and APWSn gave the best prediction accuracy(1.302 5)for PAI,and the AUC value is 0.6657.PA was the best single predictor for WTI with highest prediction accuracy(1.288 7)with AUC=0.641 5,followed by WT(1.254 0),with AUC=0.6097.The combination of PA,PB,WT,LP,MinCT,MPWS and MPWS was the best predictor for WTI with prediction accuracy as 1.314 0,with AUC=0.6552.This indicated that PBI was a more predictable measure than WTI and PAI. The combinational predictors improved prediction accuracy by 9.95%,4.01%and 1.96%over the best single predictors for PAI,PBI and WTI(AUC values improved by9.78%,9.45%,and 2.14%),respectively.Conclusions The use of GLMM with 5-fold cross-validation strategy combining both morphological and biomechanical risk factors could potentially improve the accuracy of carotid plaque progression prediction.This study suggests that a linear combination of multiple predictors can provide potential improvement to existing plaque assessment schemes. 展开更多
关键词 Multiple Risk FACTORS GENERALIZED Linear 5-Fold cross-validation STRATEGY AUC
在线阅读 下载PDF
Risk assessment of rockburst using SMOTE oversampling and integration algorithms under GBDT framework 被引量:1
4
作者 WANG Jia-chuang DONG Long-jun 《Journal of Central South University》 SCIE EI CAS CSCD 2024年第8期2891-2915,共25页
Rockburst is a common geological disaster in underground engineering,which seriously threatens the safety of personnel,equipment and property.Utilizing machine learning models to evaluate risk of rockburst is graduall... Rockburst is a common geological disaster in underground engineering,which seriously threatens the safety of personnel,equipment and property.Utilizing machine learning models to evaluate risk of rockburst is gradually becoming a trend.In this study,the integrated algorithms under Gradient Boosting Decision Tree(GBDT)framework were used to evaluate and classify rockburst intensity.First,a total of 301 rock burst data samples were obtained from a case database,and the data were preprocessed using synthetic minority over-sampling technique(SMOTE).Then,the rockburst evaluation models including GBDT,eXtreme Gradient Boosting(XGBoost),Light Gradient Boosting Machine(LightGBM),and Categorical Features Gradient Boosting(CatBoost)were established,and the optimal hyperparameters of the models were obtained through random search grid and five-fold cross-validation.Afterwards,use the optimal hyperparameter configuration to fit the evaluation models,and analyze these models using test set.In order to evaluate the performance,metrics including accuracy,precision,recall,and F1-score were selected to analyze and compare with other machine learning models.Finally,the trained models were used to conduct rock burst risk assessment on rock samples from a mine in Shanxi Province,China,and providing theoretical guidance for the mine's safe production work.The models under the GBDT framework perform well in the evaluation of rockburst levels,and the proposed methods can provide a reliable reference for rockburst risk level analysis and safety management. 展开更多
关键词 rockburst evaluation SMOTE oversampling random search grid K-fold cross-validation confusion matrix
在线阅读 下载PDF
直方图理论与最优直方图制作 被引量:27
5
作者 张建方 王秀祥 《应用概率统计》 CSCD 北大核心 2009年第2期201-214,共14页
直方图是一种最为常见的密度估计和数据分析工具.在直方图理论和制作过程中,组距的选择和边界点的确定尤为重要.然而,许多学者对这两个参数的选择仍然采用经验的方法,甚至现在大多数统计软件在确定直方图分组数时也是默认采用粗略的计... 直方图是一种最为常见的密度估计和数据分析工具.在直方图理论和制作过程中,组距的选择和边界点的确定尤为重要.然而,许多学者对这两个参数的选择仍然采用经验的方法,甚至现在大多数统计软件在确定直方图分组数时也是默认采用粗略的计算公式.本文主要介绍直方图理论和最优直方图制作的最新研究成果,强调面向样本的最优直方图制作方法. 展开更多
关键词 直方图 Sturges公式 Scott公式 cross-validation Histogram-Kernel ERROR 误差平方和
在线阅读 下载PDF
基于ANFIS和Elman网络的信用评价研究 被引量:8
6
作者 梁樑 吴德胜 +2 位作者 王志强 熊立 王国华 《管理工程学报》 CSSCI 2005年第1期69-73,共5页
BP神经网络用作信用等级分类可取得较好的效果,但在过分要求输出信用分值时效果不佳。针对该缺陷,本文采用自适应神经网络(ANFIS)和Elman网络研究公司信用评分。文中提出了一套甄选方法准则,用于建立适合我国企业的信用评分指标体系;然... BP神经网络用作信用等级分类可取得较好的效果,但在过分要求输出信用分值时效果不佳。针对该缺陷,本文采用自适应神经网络(ANFIS)和Elman网络研究公司信用评分。文中提出了一套甄选方法准则,用于建立适合我国企业的信用评分指标体系;然后依据该指标体系建立了基于Elman网络和ANFIS的信用评估模型;采用V foldCross validation技巧,利用样本公司实际指标数据对该模型的评分效果进行了实证研究。 展开更多
关键词 信用评分 自适应神经模糊推理 ELMAN网络 V-fold cross-validation技巧 主成分分析
在线阅读 下载PDF
不同模型在信用评价中的比较研究 被引量:8
7
作者 吴德胜 梁樑 杨力 《预测》 CSSCI 2004年第2期73-76,69,共5页
比较了不同模型应用于企业信用评价问题的优劣,针对信用评分问题特点,采用Elman回归神经网络和BP网络建模。在建立了适合于我国企业的信用评分指标体系之后,运用以上两种方法进行实证研究并比较两种网络的诊断行为;为克服小样本建模的缺... 比较了不同模型应用于企业信用评价问题的优劣,针对信用评分问题特点,采用Elman回归神经网络和BP网络建模。在建立了适合于我国企业的信用评分指标体系之后,运用以上两种方法进行实证研究并比较两种网络的诊断行为;为克服小样本建模的缺点,引进V foldCross validation计算技巧。 展开更多
关键词 ELMAN神经网络 BP神经网络 V-fold cross-validation技巧 信用评分
在线阅读 下载PDF
基于支持向量机的机械故障特征选择方法研究 被引量:4
8
作者 王新峰 邱静 刘冠军 《机械科学与技术》 CSCD 北大核心 2005年第9期1122-1125,共4页
在机械故障诊断中,对机器状态信号进行处理可得到故障特征集。但是此特征集中通常含有冗余特征而影响诊断效果。特征选择可以去除原始特征中的冗余特征,提高诊断精度和诊断效率。本文提出采用支持向量机(SVM)作为决策分类器,研究了使用... 在机械故障诊断中,对机器状态信号进行处理可得到故障特征集。但是此特征集中通常含有冗余特征而影响诊断效果。特征选择可以去除原始特征中的冗余特征,提高诊断精度和诊断效率。本文提出采用支持向量机(SVM)作为决策分类器,研究了使用SVM的错误上界如半径-间距上界代替学习错误率作为特征性能评价,并且使用遗传算法对特征集进行寻优的特征选择方法。此方法由于只需要训练一次SVM,相比常用的分组轮换方法有较高的计算效率。数值仿真和减速器的轴承故障特征选择试验中,采用此方法对生成特征集进行选择,并与常用的分组轮换法进行了对比。结果显示此方法有较好的选择性能和选择效率。 展开更多
关键词 特征选择 分组轮换法(cross-validation) 支持向量机(SVM) 半径-间距上界 遗传算法
在线阅读 下载PDF
Real-time Prediction Model of Amount of Manure in Winter Pig Pen Based on Backpropagation Neural Network 被引量:1
9
作者 Hu Zhen-nan Sun Hong-min +3 位作者 Li Xiao-ming Dai Bai-sheng Gao Yue Wang Yu-han 《Journal of Northeast Agricultural University(English Edition)》 CAS 2022年第4期77-90,共14页
The automatic control of cleaning need to be based on the total amount of manure in the house. Therefore, this article established a prediction model for the total amount of manure in a pig house and took the number o... The automatic control of cleaning need to be based on the total amount of manure in the house. Therefore, this article established a prediction model for the total amount of manure in a pig house and took the number of pigs in the house, age, feed intake,feeding time, the time when the ammonia concentration increased the fastest and the daily fixed cleaning time as variable factors for modelling, so that the model could obtain the current manure output according to the real-time input of time. A Backpropagation(BP) neural network was used for training. The cross-validation method was used to select the best hyperparameters, and the genetic algorithm(GA), particle swarm optimization(PSO) algorithm and mind evolutionary algorithm(MEA) were selected to optimize the initial network weights. The results showed that the model could predict the amount of manure in real-time according to the model input. After the cross-validation method determined the hyperparameters, the GA, PSO and MEA were used to optimize the manure prediction model. The GA had the best average performance. 展开更多
关键词 manure amount BP neural network weight optimization algorithm cross-validation
在线阅读 下载PDF
软计算信用分析方法比较探讨 被引量:1
10
作者 梁 吴德胜 《系统工程理论方法应用》 北大核心 2005年第3期268-274,共7页
研究了信用分析中的软计算方法——BP网络与自适应神经模糊推理,并对软计算方法与DEA、TOPSIS的结合运用作了进一步探讨。采用V-foldCross-validation技巧,利用样本公司实际指标数据对该模型的评分效果进行了实证研究。
关键词 信用评分 自适应神经模糊推理 BP神经网络 V-fold cross-validation技巧 软计算 数据包络分析
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部