基于帝国竞争演化与深度强化学习的背包问题优化算法

Knapsack Problem Optimization Algorithm Based on Imperialist Competitive Evolution and Deep Reinforcement Learning

在线阅读下载PDF

导出

摘要 0-1背包问题(knapsack problem,KP)是组合优化领域中一个具有广泛应用的经典NP难问题。针对原始帝国竞争算法(imperialist competition algorithm,ICA)在高维复杂问题中易陷入局部最优、全局探索能力不足的局限性,提出一种改进帝国竞争算法与融入多头注意力机制深度强化学习方法相结合的优化算法(improved imperialist competition algorithm incorporating deep reinforcement learning,IICA-DRL)。该算法通过引入插入交叉同化算子、双位变异机制和援助机制增强局部搜索能力和种群多样性,并利用多头注意力机制的深度强化学习模型对IICA高质量解进行优化,进一步增强了个体解的质量和算法的全局勘探能力。在4个测试集中的62个0-1 KP算例上进行性能评估,结果显示其中54个算例求解达到最优解。与20种元启发式算法进行了性能对比,实验结果表明,IICADRL算法具有较强的稳定性和有效性,初步验证了改进策略的可行性,为ICA求解背包问题提供了一个有效的算法设计方案。 The 0-1 knapsack problem(KP)is a classical NP-hard problem with wide applications in the field of combinatorial optimization.To address the limitations of the original imperialist competition algorithm(ICA),which is prone to fall into local optimality and lack of global exploration ability in high-dimensional complex problems,an optimization algorithm that combines the improved imperialist competition algorithm incorporating deep reinforcement learning with a multi-head attention mechanism(IICA-DRL)is proposed.The algorithm enhances the local search capability and population diversity by introducing the insertion cross assimilation operator,the two-bit mutation mechanism and the assistance mechanism,and optimizes the high quality solutions of IICA by using the deep reinforcement learning model with the multi-head attention mechanism,which further enhances the quality of the individual solutions and the global exploration capability of the algorithm.Performance evaluation is performed on 620-1 KP instances in 4 test sets,and the results show that 54 of the instances solved reach the optimal solution.The performance is compared with 20 meta-heuristic algorithms.The experimental results show that the IICA-DRL algorithm has strong stability and effectiveness,preliminarily verifies the feasibility of the improved strategy,and provides an effective algorithmic design scheme for ICA to solve the knapsack problem.

作者李斌潘智成 LI Bin;PAN Zhicheng(School of Mechanical and Automotive Engineering,Fujian University of Technology,Fuzhou 350118,China;Fujian Provincial Key Laboratory of Big Data Mining and Applications,Fujian University of Technology,Fuzhou 350118,China;School of Computer Science and Mathematics,Fujian University of Technology,Fuzhou 350118,China)

机构地区福建理工大学机械与汽车工程学院福建理工大学福建省大数据挖掘与应用技术重点实验室福建理工大学计算机科学与数学学院

出处《计算机工程与应用》北大核心 2025年第22期92-113,共22页 Computer Engineering and Applications

基金教育部人文社会科学研究规划基金(19YJA630031)。

关键词 0-1背包问题帝国竞争算法同化算子多样性机制多头注意力机制深度强化学习 0-1 knapsack problem imperialist competitive algorithm assimilation operator diversity mechanism multihead attention mechanism deep reinforcement learning

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

作者简介通信作者:李斌(1979-),男,博士(后),教授,CCF高级会员,研究方向为机器学习、群集智能与智慧港航,E-mail:whutmse2007_lb@126.com;潘智成(1998-),男,硕士研究生,研究方向为机器学习、群集智能。

引文网络
相关文献

1韩俊芳,任瑞仙,李军红.无线传感器网络安全时序数据流多层次提取方法[J].传感技术学报,2025,38(10):1892-1897.
2张海阳,陈耀登,孙涛,陈敏,黄向宇,孙健,王瑞春,范水勇.变分框架下双偏振雷达直接同化算子的构建及其初步应用[J].气象学报,2024,82(6):774-788. 被引量：2
3辛春花,闫凤,何婷.资源共享平台大数据负载均衡性控制方法[J].现代电子技术,2025,48(20):160-164.
4邱琦枫.浅析小米公司的公司战略与风险管理[J].电子商务评论,2024,13(3):8812-8819.
5王雨,李志强,韩帅,Abderrahim BENSLIMANE,李成.面向具备智能在轨服务功能的NTN-IoT的上行帧资源部署方案设计[J].中国科学:信息科学,2025,55(10):2491-2500.
6陆未央.班主任参与完善中学生心理健康“一生一档”的思路和策略[J].中小学心理健康教育,2025(33):70-72.
7张翾,李红月.基于MSCSO-Transformer-BiLSTM的短期电力负荷预测[J].佳木斯大学学报(自然科学版),2025,43(11):15-20.
8葛啸慈,钟莲.基于实时需求的上门取件员在线调度方法及其应用研究[J].电子制作,2025,33(20):58-62.
9周毓华,李欣蔚.铸牢中华民族共同体意识视阈下的对口援藏三十年——以粤、闽两省对口援助林芝市为例[J].西藏民族大学学报(哲学社会科学版),2025,46(4):128-136.
10张若枫.阿联酋对外援助:机制、特点与动因[J].北大中东研究,2025(1):59-73.

计算机工程与应用

2025年第22期

浏览历史

内容加载中请稍等...

基于帝国竞争演化与深度强化学习的背包问题优化算法

相关作者

相关机构

相关主题

浏览历史