The paper presents a cognitive science framework for the analysis of knowledge-based systems,including people, media. simulation and expert systems, resulting in a practical model for the procedures ofknowledge engine...The paper presents a cognitive science framework for the analysis of knowledge-based systems,including people, media. simulation and expert systems, resulting in a practical model for the procedures ofknowledge engineering. Starting with the construct of a social organization model driven by anticipationand thed differentiating this into pesonal scientists with diverse relations to people and their internal andexternal communication, it provides powerful and general model of society. people, and the roles of peoplein society. This model extends naturally ic the role of conventional media in the knowledge processes ofsociety and the new roles of computer-based simulation and expert systems. In particular it provides amodel of knowledge transfer that enables the processes of knowledge engineering to be analyzed andautomated.展开更多
In this paper, the structure characteristics of open complex giant systems are concretely analysed in depth, thus the view and its significance to support the meta synthesis engineering with manifold knowledge models...In this paper, the structure characteristics of open complex giant systems are concretely analysed in depth, thus the view and its significance to support the meta synthesis engineering with manifold knowledge models are clarified. Furthermore, the knowledge based multifaceted modeling methodology for open complex giant systems is emphatically studied. The major points are as follows: (1) nonlinear mechanism and general information partition law; (2) from the symmetry and similarity to the acquisition of construction knowledge; (3) structures for hierarchical and nonhierarchical organizations; (4) the integration of manifold knowledge models; (5) the methodology of knowledge based multifaceted modeling.展开更多
With the popularization of social media,public opi-nion information on emergencies spreads rapidly on the Internet,the impact of negative public opinions on an event has become more significant.Based on the organizati...With the popularization of social media,public opi-nion information on emergencies spreads rapidly on the Internet,the impact of negative public opinions on an event has become more significant.Based on the organizational form of public opinion information,the knowledge graph is used to construct the knowledge base of public opinion risk cases on the emer-gency network.The emotion recognition model of negative pub-lic opinion information based on the bi-directional long short-term memory(BiLSTM)network is studied in the model layer design,and a linear discriminant analysis(LDA)topic extraction method combined with association rules is proposed to extract and mine the semantics of negative public opinion topics to real-ize further in-depth analysis of information topics.Focusing on public health emergencies,knowledge acquisition and knowl-edge processing of public opinion information are conducted,and the experimental results show that the knowledge graph framework based on the construction can facilitate in-depth theme evolution analysis of public opinion events,thus demon-strating important research significance for reducing online pub-lic opinion risks.展开更多
Recently, high-precision trajectory prediction of ballistic missiles in the boost phase has become a research hotspot. This paper proposes a trajectory prediction algorithm driven by data and knowledge(DKTP) to solve ...Recently, high-precision trajectory prediction of ballistic missiles in the boost phase has become a research hotspot. This paper proposes a trajectory prediction algorithm driven by data and knowledge(DKTP) to solve this problem. Firstly, the complex dynamics characteristics of ballistic missile in the boost phase are analyzed in detail. Secondly, combining the missile dynamics model with the target gravity turning model, a knowledge-driven target three-dimensional turning(T3) model is derived. Then, the BP neural network is used to train the boost phase trajectory database in typical scenarios to obtain a datadriven state parameter mapping(SPM) model. On this basis, an online trajectory prediction framework driven by data and knowledge is established. Based on the SPM model, the three-dimensional turning coefficients of the target are predicted by using the current state of the target, and the state of the target at the next moment is obtained by combining the T3 model. Finally, simulation verification is carried out under various conditions. The simulation results show that the DKTP algorithm combines the advantages of data-driven and knowledge-driven, improves the interpretability of the algorithm, reduces the uncertainty, which can achieve high-precision trajectory prediction of ballistic missile in the boost phase.展开更多
Tracking and analyzing data from research projects is critical for understanding research trends and supporting the development of science and technology strategies.However,the data from these projects is often comple...Tracking and analyzing data from research projects is critical for understanding research trends and supporting the development of science and technology strategies.However,the data from these projects is often complex and inadequate,making it challenging for researchers to conduct in-depth data mining to improve policies or management.To address this problem,this paper adopts a top-down approach to construct a knowledge graph(KG)for research projects.Firstly,we construct an integrated ontology by referring to the metamodel of various architectures,which is called the meta-model integration conceptual reference model.Subsequently,we use the dependency parsing method to extract knowledge from unstructured textual data and use the entity alignment method based on weakly supervised learning to classify the extracted entities,completing the construction of the KG for the research projects.In addition,a knowledge inference model based on representation learning is employed to achieve knowledge completion and improve the KG.Finally,experiments are conducted on the KG for research projects and the results demonstrate the effectiveness of the proposed method in enriching incomplete data within the KG.展开更多
结合国内图书馆普遍开展的论文引证检索服务的实际需求,在大量工作实践的基础上,设计并实现了一款基于ISI Web of Knowledge平台检索结果引证检索统计报告的软件,能够根据不同的统计指标,对检索结果进行快速统计。实践证明,该软件提高...结合国内图书馆普遍开展的论文引证检索服务的实际需求,在大量工作实践的基础上,设计并实现了一款基于ISI Web of Knowledge平台检索结果引证检索统计报告的软件,能够根据不同的统计指标,对检索结果进行快速统计。实践证明,该软件提高了工作效率的同时,保证了正确率。展开更多
基于ISI Web of Knowledge平台,对8所纺织背景高校在2001—2011年间的科技论文进行了多角度的统计和分析,探讨了8所纺织背景高校近年来的学科发展现状和趋势,客观评价其学科研究特点和学术影响力,为纺织背景高校增强自身自然科学基础研...基于ISI Web of Knowledge平台,对8所纺织背景高校在2001—2011年间的科技论文进行了多角度的统计和分析,探讨了8所纺织背景高校近年来的学科发展现状和趋势,客观评价其学科研究特点和学术影响力,为纺织背景高校增强自身自然科学基础研究提供参考和帮助,并为其进一步发展提供可参考的定量依据.展开更多
随着学校对图书馆经费投入的不断增加,数字环境下图书馆的合理使用变得越来越重要,其表现在数字资源上要有较多的用户访问和下载.从成员馆对比、登录情况、检索情况和成本等角度统计分析了东华大学ISI Web of Knowledge数据库使用情况,...随着学校对图书馆经费投入的不断增加,数字环境下图书馆的合理使用变得越来越重要,其表现在数字资源上要有较多的用户访问和下载.从成员馆对比、登录情况、检索情况和成本等角度统计分析了东华大学ISI Web of Knowledge数据库使用情况,并分析讨论了读者群和多校区使用情况等,为图书馆电子资源订购提供有效依据.展开更多
The conceptions of the knowledge screen generated by S-rough sets are given: f- screen and - screen , and then puts forward - filter theorem, - filter theorem of knowledge. At last, the applications of knowledge separ...The conceptions of the knowledge screen generated by S-rough sets are given: f- screen and - screen , and then puts forward - filter theorem, - filter theorem of knowledge. At last, the applications of knowledge separation are given according to - screen and - screen.展开更多
Using S-rough sets, this paper gives the concepts off-heredity knowledge and its heredity coefficient, and f-variation coefficient of knowledge; presents the theorem of f-attribute dependence of variation coefficient ...Using S-rough sets, this paper gives the concepts off-heredity knowledge and its heredity coefficient, and f-variation coefficient of knowledge; presents the theorem of f-attribute dependence of variation coefficient and the relation theorem of heredity-variation. The attribute dependence of f-variation coefficient and the relation of heredity-variation are important characteristics of S-rough sets. From such discussion, this paper puts forward the heredity mining off-knowledge and the algorithm of heredity mining, also gives its relative application.展开更多
The concept of F-knowledge is presented by employing S-rough sets. By engrafting and penetrating between the F-knowledge generated by S-rough sets and the RSA algorithm, the security transmission and recognition of mu...The concept of F-knowledge is presented by employing S-rough sets. By engrafting and penetrating between the F-knowledge generated by S-rough sets and the RSA algorithm, the security transmission and recognition of multi-agent F-knowledge are proposed, which includes the security transmission of multi-agent F-knowledge with positive direction secret key and the security transmission of multi-agent F-knowledge with reverse direction secret key. Finally, the recognition criterion and the applications of F-knowledge are presented. The security of F-knowledge is a new application research direction of S-rough sets in information systems.展开更多
In this paper,a novel method of ultra-lightweight convolution neural network(CNN)design based on neural architecture search(NAS)and knowledge distillation(KD)is proposed.It can realize the automatic construction of th...In this paper,a novel method of ultra-lightweight convolution neural network(CNN)design based on neural architecture search(NAS)and knowledge distillation(KD)is proposed.It can realize the automatic construction of the space target inverse synthetic aperture radar(ISAR)image recognition model with ultra-lightweight and high accuracy.This method introduces the NAS method into the radar image recognition for the first time,which solves the time-consuming and labor-consuming problems in the artificial design of the space target ISAR image automatic recognition model(STIIARM).On this basis,the NAS model’s knowledge is transferred to the student model with lower computational complexity by the flow of the solution procedure(FSP)distillation method.Thus,the decline of recognition accuracy caused by the direct compression of model structural parameters can be effectively avoided,and the ultralightweight STIIARM can be obtained.In the method,the Inverted Linear Bottleneck(ILB)and Inverted Residual Block(IRB)are firstly taken as each block’s basic structure in CNN.And the expansion ratio,output filter size,number of IRBs,and convolution kernel size are set as the search parameters to construct a hierarchical decomposition search space.Then,the recognition accuracy and computational complexity are taken as the objective function and constraint conditions,respectively,and the global optimization model of the CNN architecture search is established.Next,the simulated annealing(SA)algorithm is used as the search strategy to search out the lightweight and high accuracy STIIARM directly.After that,based on the three principles of similar block structure,the same corresponding channel number,and the minimum computational complexity,the more lightweight student model is designed,and the FSP matrix pairing between the NAS model and student model is completed.Finally,by minimizing the loss between the FSP matrix pairs of the NAS model and student model,the student model’s weight adjustment is completed.Thus the ultra-lightweight and high accuracy STIIARM is obtained.The proposed method’s effectiveness is verified by the simulation experiments on the ISAR image dataset of five types of space targets.展开更多
In previous researches on a model-based diagnostic system, the components are assumed mutually independent. Howerver , the assumption is not always the case because the information about whether a component is faulty ...In previous researches on a model-based diagnostic system, the components are assumed mutually independent. Howerver , the assumption is not always the case because the information about whether a component is faulty or not usually influences our knowledge about other components. Some experts may draw such a conclusion that 'if component m 1 is faulty, then component m 2 may be faulty too'. How can we use this experts' knowledge to aid the diagnosis? Based on Kohlas's probabilistic assumption-based reasoning method, we use Bayes networks to solve this problem. We calculate the posterior fault probability of the components in the observation state. The result is reasonable and reflects the effectiveness of the experts' knowledge.展开更多
To detect high frequency (HF) first-order sea echo spectra contaminated with ships, ionosphere interference, and other, a new characteristic-knowledge-aided detection method is proposed. With 2-D image features in r...To detect high frequency (HF) first-order sea echo spectra contaminated with ships, ionosphere interference, and other, a new characteristic-knowledge-aided detection method is proposed. With 2-D image features in range-Doppler spectrum, the trend of first-order sea echoes is extracted as indicative information by a multi-scale filter. Detection rules for both single and splitting first-order sea echoes are given based on the characteristic knowledge combining the indicative information with the global characteristics such as amplitude, symmetry, continuity, etc. Compared with the classical algorithms, the proposed method can detect and locate the first-order sea echo in the HF band more accurately especially in the environment with targets/clutters smearing. Experiments with real data verify the validity of the algorithm.展开更多
Short-term forecasting is a difficult problem because of the influence of non-linear factors and irregular events.A novel short-term forecasting method named TIK was proposed,in which ARMA forecasting model was used t...Short-term forecasting is a difficult problem because of the influence of non-linear factors and irregular events.A novel short-term forecasting method named TIK was proposed,in which ARMA forecasting model was used to consider the load time series trend forecasting,intelligence forecasting DESVR model was applied to estimate the non-linear influence,and knowledge mining methods were applied to correct the errors caused by irregular events.In order to prove the effectiveness of the proposed model,an application of the daily maximum load forecasting was evaluated.The experimental results show that the DESVR model improves the mean absolute percentage error(MAPE) from 2.82% to 2.55%,and the knowledge rules can improve the MAPE from 2.55% to 2.30%.Compared with the single ARMA forecasting method and ARMA combined SVR forecasting method,it can be proved that TIK method gains the best performance in short-term load forecasting.展开更多
In this paper, the reinforcement learning method for cooperative multi-agent systems(MAS) with incremental number of agents is studied. The existing multi-agent reinforcement learning approaches deal with the MAS with...In this paper, the reinforcement learning method for cooperative multi-agent systems(MAS) with incremental number of agents is studied. The existing multi-agent reinforcement learning approaches deal with the MAS with a specific number of agents, and can learn well-performed policies. However, if there is an increasing number of agents, the previously learned in may not perform well in the current scenario. The new agents need to learn from scratch to find optimal policies with others,which may slow down the learning speed of the whole team. To solve that problem, in this paper, we propose a new algorithm to take full advantage of the historical knowledge which was learned before, and transfer it from the previous agents to the new agents. Since the previous agents have been trained well in the source environment, they are treated as teacher agents in the target environment. Correspondingly, the new agents are called student agents. To enable the student agents to learn from the teacher agents, we first modify the input nodes of the networks for teacher agents to adapt to the current environment. Then, the teacher agents take the observations of the student agents as input, and output the advised actions and values as supervising information. Finally, the student agents combine the reward from the environment and the supervising information from the teacher agents, and learn the optimal policies with modified loss functions. By taking full advantage of the knowledge of teacher agents, the search space for the student agents will be reduced significantly, which can accelerate the learning speed of the holistic system. The proposed algorithm is verified in some multi-agent simulation environments, and its efficiency has been demonstrated by the experiment results.展开更多
Immune evolutionary algorithms with domain knowledge were presented to solve the problem of simultaneous localization and mapping for a mobile robot in unknown environments. Two operators with domain knowledge were de...Immune evolutionary algorithms with domain knowledge were presented to solve the problem of simultaneous localization and mapping for a mobile robot in unknown environments. Two operators with domain knowledge were designed in algorithms, where the feature of parallel line segments without the problem of data association was used to construct a vaccination operator, and the characters of convex vertices in polygonal obstacle were extended to develop a pulling operator of key point grid. The experimental results of a real mobile robot show that the computational expensiveness of algorithms designed is less than other evolutionary algorithms for simultaneous localization and mapping and the maps obtained are very accurate. Because immune evolutionary algorithms with domain knowledge have some advantages, the convergence rate of designed algorithms is about 44% higher than those of other algorithms.展开更多
文摘The paper presents a cognitive science framework for the analysis of knowledge-based systems,including people, media. simulation and expert systems, resulting in a practical model for the procedures ofknowledge engineering. Starting with the construct of a social organization model driven by anticipationand thed differentiating this into pesonal scientists with diverse relations to people and their internal andexternal communication, it provides powerful and general model of society. people, and the roles of peoplein society. This model extends naturally ic the role of conventional media in the knowledge processes ofsociety and the new roles of computer-based simulation and expert systems. In particular it provides amodel of knowledge transfer that enables the processes of knowledge engineering to be analyzed andautomated.
文摘In this paper, the structure characteristics of open complex giant systems are concretely analysed in depth, thus the view and its significance to support the meta synthesis engineering with manifold knowledge models are clarified. Furthermore, the knowledge based multifaceted modeling methodology for open complex giant systems is emphatically studied. The major points are as follows: (1) nonlinear mechanism and general information partition law; (2) from the symmetry and similarity to the acquisition of construction knowledge; (3) structures for hierarchical and nonhierarchical organizations; (4) the integration of manifold knowledge models; (5) the methodology of knowledge based multifaceted modeling.
基金supported by the National Social Science Foundation Major Project(22&ZD135)the National Social Science Fund National Emergency Management System Construction Research Project(20VYJ061).
文摘With the popularization of social media,public opi-nion information on emergencies spreads rapidly on the Internet,the impact of negative public opinions on an event has become more significant.Based on the organizational form of public opinion information,the knowledge graph is used to construct the knowledge base of public opinion risk cases on the emer-gency network.The emotion recognition model of negative pub-lic opinion information based on the bi-directional long short-term memory(BiLSTM)network is studied in the model layer design,and a linear discriminant analysis(LDA)topic extraction method combined with association rules is proposed to extract and mine the semantics of negative public opinion topics to real-ize further in-depth analysis of information topics.Focusing on public health emergencies,knowledge acquisition and knowl-edge processing of public opinion information are conducted,and the experimental results show that the knowledge graph framework based on the construction can facilitate in-depth theme evolution analysis of public opinion events,thus demon-strating important research significance for reducing online pub-lic opinion risks.
基金the National Natural Science Foundation of China (Grants No. 12072090 and No.12302056) to provide fund for conducting experiments。
文摘Recently, high-precision trajectory prediction of ballistic missiles in the boost phase has become a research hotspot. This paper proposes a trajectory prediction algorithm driven by data and knowledge(DKTP) to solve this problem. Firstly, the complex dynamics characteristics of ballistic missile in the boost phase are analyzed in detail. Secondly, combining the missile dynamics model with the target gravity turning model, a knowledge-driven target three-dimensional turning(T3) model is derived. Then, the BP neural network is used to train the boost phase trajectory database in typical scenarios to obtain a datadriven state parameter mapping(SPM) model. On this basis, an online trajectory prediction framework driven by data and knowledge is established. Based on the SPM model, the three-dimensional turning coefficients of the target are predicted by using the current state of the target, and the state of the target at the next moment is obtained by combining the T3 model. Finally, simulation verification is carried out under various conditions. The simulation results show that the DKTP algorithm combines the advantages of data-driven and knowledge-driven, improves the interpretability of the algorithm, reduces the uncertainty, which can achieve high-precision trajectory prediction of ballistic missile in the boost phase.
基金supported by the National Natural Science Foundation of China(72101263).
文摘Tracking and analyzing data from research projects is critical for understanding research trends and supporting the development of science and technology strategies.However,the data from these projects is often complex and inadequate,making it challenging for researchers to conduct in-depth data mining to improve policies or management.To address this problem,this paper adopts a top-down approach to construct a knowledge graph(KG)for research projects.Firstly,we construct an integrated ontology by referring to the metamodel of various architectures,which is called the meta-model integration conceptual reference model.Subsequently,we use the dependency parsing method to extract knowledge from unstructured textual data and use the entity alignment method based on weakly supervised learning to classify the extracted entities,completing the construction of the KG for the research projects.In addition,a knowledge inference model based on representation learning is employed to achieve knowledge completion and improve the KG.Finally,experiments are conducted on the KG for research projects and the results demonstrate the effectiveness of the proposed method in enriching incomplete data within the KG.
文摘基于ISI Web of Knowledge平台,对8所纺织背景高校在2001—2011年间的科技论文进行了多角度的统计和分析,探讨了8所纺织背景高校近年来的学科发展现状和趋势,客观评价其学科研究特点和学术影响力,为纺织背景高校增强自身自然科学基础研究提供参考和帮助,并为其进一步发展提供可参考的定量依据.
文摘随着学校对图书馆经费投入的不断增加,数字环境下图书馆的合理使用变得越来越重要,其表现在数字资源上要有较多的用户访问和下载.从成员馆对比、登录情况、检索情况和成本等角度统计分析了东华大学ISI Web of Knowledge数据库使用情况,并分析讨论了读者群和多校区使用情况等,为图书馆电子资源订购提供有效依据.
文摘The conceptions of the knowledge screen generated by S-rough sets are given: f- screen and - screen , and then puts forward - filter theorem, - filter theorem of knowledge. At last, the applications of knowledge separation are given according to - screen and - screen.
基金This project was supported by the National Natural Science Foundation of China (60364001), the Shandong ProvincialNatural Science Foundation of China (Y2004A04) and Fujian Provincial Education Foundation of China(JA04268).
文摘Using S-rough sets, this paper gives the concepts off-heredity knowledge and its heredity coefficient, and f-variation coefficient of knowledge; presents the theorem of f-attribute dependence of variation coefficient and the relation theorem of heredity-variation. The attribute dependence of f-variation coefficient and the relation of heredity-variation are important characteristics of S-rough sets. From such discussion, this paper puts forward the heredity mining off-knowledge and the algorithm of heredity mining, also gives its relative application.
基金supported partly by the Natural Science Foundation of Fujian Province of China(2009J01293)the Natural Science Foundation of Shandong Province of China(Y2007H02).
文摘The concept of F-knowledge is presented by employing S-rough sets. By engrafting and penetrating between the F-knowledge generated by S-rough sets and the RSA algorithm, the security transmission and recognition of multi-agent F-knowledge are proposed, which includes the security transmission of multi-agent F-knowledge with positive direction secret key and the security transmission of multi-agent F-knowledge with reverse direction secret key. Finally, the recognition criterion and the applications of F-knowledge are presented. The security of F-knowledge is a new application research direction of S-rough sets in information systems.
文摘In this paper,a novel method of ultra-lightweight convolution neural network(CNN)design based on neural architecture search(NAS)and knowledge distillation(KD)is proposed.It can realize the automatic construction of the space target inverse synthetic aperture radar(ISAR)image recognition model with ultra-lightweight and high accuracy.This method introduces the NAS method into the radar image recognition for the first time,which solves the time-consuming and labor-consuming problems in the artificial design of the space target ISAR image automatic recognition model(STIIARM).On this basis,the NAS model’s knowledge is transferred to the student model with lower computational complexity by the flow of the solution procedure(FSP)distillation method.Thus,the decline of recognition accuracy caused by the direct compression of model structural parameters can be effectively avoided,and the ultralightweight STIIARM can be obtained.In the method,the Inverted Linear Bottleneck(ILB)and Inverted Residual Block(IRB)are firstly taken as each block’s basic structure in CNN.And the expansion ratio,output filter size,number of IRBs,and convolution kernel size are set as the search parameters to construct a hierarchical decomposition search space.Then,the recognition accuracy and computational complexity are taken as the objective function and constraint conditions,respectively,and the global optimization model of the CNN architecture search is established.Next,the simulated annealing(SA)algorithm is used as the search strategy to search out the lightweight and high accuracy STIIARM directly.After that,based on the three principles of similar block structure,the same corresponding channel number,and the minimum computational complexity,the more lightweight student model is designed,and the FSP matrix pairing between the NAS model and student model is completed.Finally,by minimizing the loss between the FSP matrix pairs of the NAS model and student model,the student model’s weight adjustment is completed.Thus the ultra-lightweight and high accuracy STIIARM is obtained.The proposed method’s effectiveness is verified by the simulation experiments on the ISAR image dataset of five types of space targets.
文摘In previous researches on a model-based diagnostic system, the components are assumed mutually independent. Howerver , the assumption is not always the case because the information about whether a component is faulty or not usually influences our knowledge about other components. Some experts may draw such a conclusion that 'if component m 1 is faulty, then component m 2 may be faulty too'. How can we use this experts' knowledge to aid the diagnosis? Based on Kohlas's probabilistic assumption-based reasoning method, we use Bayes networks to solve this problem. We calculate the posterior fault probability of the components in the observation state. The result is reasonable and reflects the effectiveness of the experts' knowledge.
文摘To detect high frequency (HF) first-order sea echo spectra contaminated with ships, ionosphere interference, and other, a new characteristic-knowledge-aided detection method is proposed. With 2-D image features in range-Doppler spectrum, the trend of first-order sea echoes is extracted as indicative information by a multi-scale filter. Detection rules for both single and splitting first-order sea echoes are given based on the characteristic knowledge combining the indicative information with the global characteristics such as amplitude, symmetry, continuity, etc. Compared with the classical algorithms, the proposed method can detect and locate the first-order sea echo in the HF band more accurately especially in the environment with targets/clutters smearing. Experiments with real data verify the validity of the algorithm.
基金Projects(70671039,71071052) supported by the National Natural Science Foundation of ChinaProjects(10QX44,09QX68) supported by the Fundamental Research Funds for the Central Universities in China
文摘Short-term forecasting is a difficult problem because of the influence of non-linear factors and irregular events.A novel short-term forecasting method named TIK was proposed,in which ARMA forecasting model was used to consider the load time series trend forecasting,intelligence forecasting DESVR model was applied to estimate the non-linear influence,and knowledge mining methods were applied to correct the errors caused by irregular events.In order to prove the effectiveness of the proposed model,an application of the daily maximum load forecasting was evaluated.The experimental results show that the DESVR model improves the mean absolute percentage error(MAPE) from 2.82% to 2.55%,and the knowledge rules can improve the MAPE from 2.55% to 2.30%.Compared with the single ARMA forecasting method and ARMA combined SVR forecasting method,it can be proved that TIK method gains the best performance in short-term load forecasting.
基金supported by the National Key R&D Program of China (2018AAA0101400)the National Natural Science Foundation of China (62173251+3 种基金61921004U1713209)the Natural Science Foundation of Jiangsu Province of China (BK20202006)the Guangdong Provincial Key Laboratory of Intelligent Decision and Cooperative Control。
文摘In this paper, the reinforcement learning method for cooperative multi-agent systems(MAS) with incremental number of agents is studied. The existing multi-agent reinforcement learning approaches deal with the MAS with a specific number of agents, and can learn well-performed policies. However, if there is an increasing number of agents, the previously learned in may not perform well in the current scenario. The new agents need to learn from scratch to find optimal policies with others,which may slow down the learning speed of the whole team. To solve that problem, in this paper, we propose a new algorithm to take full advantage of the historical knowledge which was learned before, and transfer it from the previous agents to the new agents. Since the previous agents have been trained well in the source environment, they are treated as teacher agents in the target environment. Correspondingly, the new agents are called student agents. To enable the student agents to learn from the teacher agents, we first modify the input nodes of the networks for teacher agents to adapt to the current environment. Then, the teacher agents take the observations of the student agents as input, and output the advised actions and values as supervising information. Finally, the student agents combine the reward from the environment and the supervising information from the teacher agents, and learn the optimal policies with modified loss functions. By taking full advantage of the knowledge of teacher agents, the search space for the student agents will be reduced significantly, which can accelerate the learning speed of the holistic system. The proposed algorithm is verified in some multi-agent simulation environments, and its efficiency has been demonstrated by the experiment results.
基金Projects(60234030 60404021) supported by the National Natural Science Foundation of China
文摘Immune evolutionary algorithms with domain knowledge were presented to solve the problem of simultaneous localization and mapping for a mobile robot in unknown environments. Two operators with domain knowledge were designed in algorithms, where the feature of parallel line segments without the problem of data association was used to construct a vaccination operator, and the characters of convex vertices in polygonal obstacle were extended to develop a pulling operator of key point grid. The experimental results of a real mobile robot show that the computational expensiveness of algorithms designed is less than other evolutionary algorithms for simultaneous localization and mapping and the maps obtained are very accurate. Because immune evolutionary algorithms with domain knowledge have some advantages, the convergence rate of designed algorithms is about 44% higher than those of other algorithms.