Traditional coal mine safety prediction methods are off-line and do not have dynamic prediction functions.The Support Vector Machine(SVM) is a new machine learning algorithm that has excellent properties.The least squ...Traditional coal mine safety prediction methods are off-line and do not have dynamic prediction functions.The Support Vector Machine(SVM) is a new machine learning algorithm that has excellent properties.The least squares support vector machine(LS-SVM) algorithm is an improved algorithm of SVM.But the common LS-SVM algorithm,used directly in safety predictions,has some problems.We have first studied gas prediction problems and the basic theory of LS-SVM.Given these problems,we have investigated the affect of the time factor about safety prediction and present an on-line prediction algorithm,based on LS-SVM.Finally,given our observed data,we used the on-line algorithm to predict gas emissions and used other related algorithm to compare its performance.The simulation results have verified the validity of the new algorithm.展开更多
With rising capacity demand in mobile networks, the infrastructure is also becoming increasingly denser and complex. This results in collection of larger amount of raw data(big data) that is generated at different lev...With rising capacity demand in mobile networks, the infrastructure is also becoming increasingly denser and complex. This results in collection of larger amount of raw data(big data) that is generated at different levels of network architecture and is typically underutilized. To unleash its full value, innovative machine learning algorithms need to be utilized in order to extract valuable insights which can be used for improving the overall network's performance. Additionally, a major challenge for network operators is to cope up with increasing number of complete(or partial) cell outages and to simultaneously reduce operational expenditure. This paper contributes towards the aforementioned problems by exploiting big data generated from the core network of 4 G LTE-A to detect network's anomalous behavior. We present a semi-supervised statistical-based anomaly detection technique to identify in time: first, unusually low user activity region depicting sleeping cell, which is a special case of cell outage; and second, unusually high user traffic area corresponding to a situation where special action such as additional resource allocation, fault avoidance solution etc. may be needed. Achieved results demonstrate that the proposed method can be used for timely and reliable anomaly detection in current and future cellular networks.展开更多
In the past two decades, software aging has been studied by both academic and industry communities. Many scholars focused on analytical methods or time series to model software aging process. While machine learning ha...In the past two decades, software aging has been studied by both academic and industry communities. Many scholars focused on analytical methods or time series to model software aging process. While machine learning has been shown as a very promising technique in application to forecast software state: normal or aging. In this paper, we proposed a method which can give practice guide to forecast software aging using machine learning algorithm. Firstly, we collected data from a running commercial web server and preprocessed these data. Secondly, feature selection algorithm was applied to find a subset of model parameters set. Thirdly, time series model was used to predict values of selected parameters in advance. Fourthly, some machine learning algorithms were used to model software aging process and to predict software aging. Fifthly, we used sensitivity analysis to analyze how heavily outcomes changed following input variables change. In the last, we applied our method to an IIS web server. Through analysis of the experiment results, we find that our proposed method can predict software aging in the early stage of system development life cycle.展开更多
Electromagnetic Radiation Source Identification(ERSI) is a key technology that is widely used in military and radiation management and in electromagnetic interference diagnostics.The discriminative capability of machi...Electromagnetic Radiation Source Identification(ERSI) is a key technology that is widely used in military and radiation management and in electromagnetic interference diagnostics.The discriminative capability of machine learning methods has recently been used for facilitating ERSI.This paper presents a new approach to improve ERSI by adopting support vector machines,which are proven to be effective tools in pattern classification and regression,on the basis of the spatial distribution of electromagnetic radiation sources.Spatial information is converted from 3D cubes to 1D vectors with subscripts as inputs in order to simplify the model.The model is trained with 187 500 data sets in order to enable it to identify the types of radiation source types with an accuracy of up to 99.9%.The influence of parameters(e.g.,penalty parameter,reflection and noise from the ambient environment,and the scaling method for the input data) are discussed.The proposed method has good performance in noisy and reverberant environment.It has an identification accuracy of 82.15% when the signal-to-noise ratio is 20 dB.The proposed method has better accuracy in a noisy environment than artificial neural networks.Given that each Electromagnetic(EM) source has unique spatial characteristics,this method can be used for EM source identification and EM interference diagnostics.展开更多
Ovarian cancer is one of the three most common gynecological cancers in the world,and is regarded as a priority in terms of women’s cancer.In the past few years,many researchers have attempted to develop and apply ar...Ovarian cancer is one of the three most common gynecological cancers in the world,and is regarded as a priority in terms of women’s cancer.In the past few years,many researchers have attempted to develop and apply artificial intelligence(AI)techniques to multiple clinical scenarios of ovarian cancer,especially in the field of medical imaging.AI-assisted imaging studies have involved computer tomography(CT),ultrasonography(US),and magnetic resonance imaging(MRI).In this review,we perform a literature search on the published studies that using AI techniques in the medical care of ovarian cancer,and bring up the advances in terms of four clinical aspects,including medical diagnosis,pathological classification,targeted biopsy guidance,and prognosis prediction.Meanwhile,current status and existing issues of the researches on AI application in ovarian cancer are discussed.展开更多
Diagnosis and treatment of breast cancer have been improved during the last decade; however, breast cancer is still a leading cause of death among women in the whole world. Early detection and accurate diagnosis of th...Diagnosis and treatment of breast cancer have been improved during the last decade; however, breast cancer is still a leading cause of death among women in the whole world. Early detection and accurate diagnosis of this disease has been demonstrated an approach to long survival of the patients. As an attempt to develop a reliable diagnosing method for breast cancer, we integrated support vector machine (SVM), k-nearest neighbor and probabilistic neural network into a complex machine learning approach to detect malignant breast tumour through a set of indicators consisting of age and ten cellular features of fine-needle aspiration of breast which were ranked according to signal-to-noise ratio to identify determinants distinguishing benign breast tumours from malignant ones. The method turned out to significantly improve the diagnosis, with a sensitivity of 94.04%, a specificity of 97.37%, and an overall accuracy up to 96.24% when SVM was adopted with the sigmoid kernel function under 5-fold cross validation. The results suggest that SVM is a promising methodology to be further developed into a practical adjunct implement to help discerning benign and malignant breast tumours and thus reduce the incidence of misdiagnosis.展开更多
Nowadays, machine learning is widely used in malware detection system as a core component. The machine learning algorithm is designed under the assumption that all datasets follow the same underlying data distribution...Nowadays, machine learning is widely used in malware detection system as a core component. The machine learning algorithm is designed under the assumption that all datasets follow the same underlying data distribution. But the real-world malware data distribution is not stable and changes with time. By exploiting the knowledge of the machine learning algorithm and malware data concept drift problem, we show a novel learning evasive botnet architecture and a stealthy and secure C&C mechanism. Based on the email communication channel, we construct a stealthy email-based P2 P-like botnet that exploit the excellent reputation of email servers and a huge amount of benign email communication in the same channel. The experiment results show horizontal correlation learning algorithm is difficult to separate malicious email traffic from normal email traffic based on the volume features and time-related features with enough confidence. We discuss the malware data concept drift and possible defense strategies.展开更多
In this paper, we present Real-Time Flow Filter (RTFF) -a system that adopts a middle ground between coarse-grained volume anomaly detection and deep packet inspection. RTFF was designed with the goal of scaling to hi...In this paper, we present Real-Time Flow Filter (RTFF) -a system that adopts a middle ground between coarse-grained volume anomaly detection and deep packet inspection. RTFF was designed with the goal of scaling to high volume data feeds that are common in large Tier-1 ISP networks and providing rich, timely information on observed attacks. It is a software solution that is designed to run on off-the-shelf hardware platforms and incorporates a scalable data processing architecture along with lightweight analysis algorithms that make it suitable for deployment in large networks. RTFF also makes use of state of the art machine learning algorithms to construct attack models that can be used to detect as well as predict attacks.展开更多
To prevent possible accidents,the study of data-driven analytics to predict hidden dangers in cloud service-based intelligent industrial production management has been the subject of increasing interest recently.A mac...To prevent possible accidents,the study of data-driven analytics to predict hidden dangers in cloud service-based intelligent industrial production management has been the subject of increasing interest recently.A machine learning algorithm that uses timeliness managing extreme learning machine is utilized in this article to achieve the above prediction.Compared with traditional learning algorithms,extreme learning machine(ELM) exhibits high performance because of its unique feature of a high generalization capability at a fast learning speed.Timeliness managing ELM is proposed by incorporating timeliness management scheme into ELM.When using the timeliness managing ELM scheme to predict hidden dangers,newly incremental data could be added prior to the historical data to maximize the contribution of the newly incremental training data,because the incremental data may be able to contribute reasonable weights to represent the current production situation according to practical analysis of accidents in some industrial productions.Experimental results from a coal mine show that the use of timeliness managing ELM can improve the prediction accuracy of hidden dangers with better stability compared with other similar machine learning methods.展开更多
文摘Traditional coal mine safety prediction methods are off-line and do not have dynamic prediction functions.The Support Vector Machine(SVM) is a new machine learning algorithm that has excellent properties.The least squares support vector machine(LS-SVM) algorithm is an improved algorithm of SVM.But the common LS-SVM algorithm,used directly in safety predictions,has some problems.We have first studied gas prediction problems and the basic theory of LS-SVM.Given these problems,we have investigated the affect of the time factor about safety prediction and present an on-line prediction algorithm,based on LS-SVM.Finally,given our observed data,we used the on-line algorithm to predict gas emissions and used other related algorithm to compare its performance.The simulation results have verified the validity of the new algorithm.
基金supported in part by the National Natural Science Foundation of China under the Grants No.61431011 and 61671371the National Science and Technology Major Project under Grant no.2016ZX03001016-005+1 种基金the Key Research and Development Program of Shaanxi Province under Grant No.2017ZDXM-G-Y-012the Fundamental Research Funds for the Central Universities
文摘With rising capacity demand in mobile networks, the infrastructure is also becoming increasingly denser and complex. This results in collection of larger amount of raw data(big data) that is generated at different levels of network architecture and is typically underutilized. To unleash its full value, innovative machine learning algorithms need to be utilized in order to extract valuable insights which can be used for improving the overall network's performance. Additionally, a major challenge for network operators is to cope up with increasing number of complete(or partial) cell outages and to simultaneously reduce operational expenditure. This paper contributes towards the aforementioned problems by exploiting big data generated from the core network of 4 G LTE-A to detect network's anomalous behavior. We present a semi-supervised statistical-based anomaly detection technique to identify in time: first, unusually low user activity region depicting sleeping cell, which is a special case of cell outage; and second, unusually high user traffic area corresponding to a situation where special action such as additional resource allocation, fault avoidance solution etc. may be needed. Achieved results demonstrate that the proposed method can be used for timely and reliable anomaly detection in current and future cellular networks.
基金supported by the grants from Natural Science Foundation of China(Project No.61375045)the joint astronomic fund of the national natural science foundation of China and Chinese Academic Sinica(Project No.U1531242)Beijing Natural Science Foundation(4142030)
文摘In the past two decades, software aging has been studied by both academic and industry communities. Many scholars focused on analytical methods or time series to model software aging process. While machine learning has been shown as a very promising technique in application to forecast software state: normal or aging. In this paper, we proposed a method which can give practice guide to forecast software aging using machine learning algorithm. Firstly, we collected data from a running commercial web server and preprocessed these data. Secondly, feature selection algorithm was applied to find a subset of model parameters set. Thirdly, time series model was used to predict values of selected parameters in advance. Fourthly, some machine learning algorithms were used to model software aging process and to predict software aging. Fifthly, we used sensitivity analysis to analyze how heavily outcomes changed following input variables change. In the last, we applied our method to an IIS web server. Through analysis of the experiment results, we find that our proposed method can predict software aging in the early stage of system development life cycle.
基金supported by the National Natural Science Foundation of China under Grant No.61201024
文摘Electromagnetic Radiation Source Identification(ERSI) is a key technology that is widely used in military and radiation management and in electromagnetic interference diagnostics.The discriminative capability of machine learning methods has recently been used for facilitating ERSI.This paper presents a new approach to improve ERSI by adopting support vector machines,which are proven to be effective tools in pattern classification and regression,on the basis of the spatial distribution of electromagnetic radiation sources.Spatial information is converted from 3D cubes to 1D vectors with subscripts as inputs in order to simplify the model.The model is trained with 187 500 data sets in order to enable it to identify the types of radiation source types with an accuracy of up to 99.9%.The influence of parameters(e.g.,penalty parameter,reflection and noise from the ambient environment,and the scaling method for the input data) are discussed.The proposed method has good performance in noisy and reverberant environment.It has an identification accuracy of 82.15% when the signal-to-noise ratio is 20 dB.The proposed method has better accuracy in a noisy environment than artificial neural networks.Given that each Electromagnetic(EM) source has unique spatial characteristics,this method can be used for EM source identification and EM interference diagnostics.
文摘Ovarian cancer is one of the three most common gynecological cancers in the world,and is regarded as a priority in terms of women’s cancer.In the past few years,many researchers have attempted to develop and apply artificial intelligence(AI)techniques to multiple clinical scenarios of ovarian cancer,especially in the field of medical imaging.AI-assisted imaging studies have involved computer tomography(CT),ultrasonography(US),and magnetic resonance imaging(MRI).In this review,we perform a literature search on the published studies that using AI techniques in the medical care of ovarian cancer,and bring up the advances in terms of four clinical aspects,including medical diagnosis,pathological classification,targeted biopsy guidance,and prognosis prediction.Meanwhile,current status and existing issues of the researches on AI application in ovarian cancer are discussed.
基金Joint Research Project Between Chongqing University and National University of Singapore (No. ARF-151-000-014-112)the Basic Research & Applied Basic Research Program of Chongqing University (No.71341103)Natural Science Foundation of Chongqing S & T Committee(No. CSTC,2006BB5240)
文摘Diagnosis and treatment of breast cancer have been improved during the last decade; however, breast cancer is still a leading cause of death among women in the whole world. Early detection and accurate diagnosis of this disease has been demonstrated an approach to long survival of the patients. As an attempt to develop a reliable diagnosing method for breast cancer, we integrated support vector machine (SVM), k-nearest neighbor and probabilistic neural network into a complex machine learning approach to detect malignant breast tumour through a set of indicators consisting of age and ten cellular features of fine-needle aspiration of breast which were ranked according to signal-to-noise ratio to identify determinants distinguishing benign breast tumours from malignant ones. The method turned out to significantly improve the diagnosis, with a sensitivity of 94.04%, a specificity of 97.37%, and an overall accuracy up to 96.24% when SVM was adopted with the sigmoid kernel function under 5-fold cross validation. The results suggest that SVM is a promising methodology to be further developed into a practical adjunct implement to help discerning benign and malignant breast tumours and thus reduce the incidence of misdiagnosis.
基金the National Key Basic Research Program of China (Grant: 2013CB834204)the National Natural Science Foundation of China (Grant: 61300242, 61772291)+1 种基金the Tianjin Research Program of Application Foundation and Advanced Technology (Grant: 15JCQNJC41500, 17JCZDJC30500)the Open Project Foundation of Information Security Evaluation Center of Civil Aviation, Civil Aviation University of China (Grant: CAAC-ISECCA- 201701, CAAC-ISECCA-201702)
文摘Nowadays, machine learning is widely used in malware detection system as a core component. The machine learning algorithm is designed under the assumption that all datasets follow the same underlying data distribution. But the real-world malware data distribution is not stable and changes with time. By exploiting the knowledge of the machine learning algorithm and malware data concept drift problem, we show a novel learning evasive botnet architecture and a stealthy and secure C&C mechanism. Based on the email communication channel, we construct a stealthy email-based P2 P-like botnet that exploit the excellent reputation of email servers and a huge amount of benign email communication in the same channel. The experiment results show horizontal correlation learning algorithm is difficult to separate malicious email traffic from normal email traffic based on the volume features and time-related features with enough confidence. We discuss the malware data concept drift and possible defense strategies.
文摘In this paper, we present Real-Time Flow Filter (RTFF) -a system that adopts a middle ground between coarse-grained volume anomaly detection and deep packet inspection. RTFF was designed with the goal of scaling to high volume data feeds that are common in large Tier-1 ISP networks and providing rich, timely information on observed attacks. It is a software solution that is designed to run on off-the-shelf hardware platforms and incorporates a scalable data processing architecture along with lightweight analysis algorithms that make it suitable for deployment in large networks. RTFF also makes use of state of the art machine learning algorithms to construct attack models that can be used to detect as well as predict attacks.
基金partially supported by the National Key Technologies R&D Program of China under Grant No.2015BAK38B01the National Natural Science Foundation of China under Grant Nos.61174103 and 61272357the Fundamental Research Funds for the Central Universities under Grant No.06500025
文摘To prevent possible accidents,the study of data-driven analytics to predict hidden dangers in cloud service-based intelligent industrial production management has been the subject of increasing interest recently.A machine learning algorithm that uses timeliness managing extreme learning machine is utilized in this article to achieve the above prediction.Compared with traditional learning algorithms,extreme learning machine(ELM) exhibits high performance because of its unique feature of a high generalization capability at a fast learning speed.Timeliness managing ELM is proposed by incorporating timeliness management scheme into ELM.When using the timeliness managing ELM scheme to predict hidden dangers,newly incremental data could be added prior to the historical data to maximize the contribution of the newly incremental training data,because the incremental data may be able to contribute reasonable weights to represent the current production situation according to practical analysis of accidents in some industrial productions.Experimental results from a coal mine show that the use of timeliness managing ELM can improve the prediction accuracy of hidden dangers with better stability compared with other similar machine learning methods.