检索结果-维普期刊中文期刊服务平台

视觉深度学习模型压缩加速综述被引量：2: 1; 作者丁贵广陈辉 +3 位作者王澳杨帆熊翊哲梁伊雯《智能系统学报》 CSCD 北大核心 2024年第5期1072-1081,共10页; 近年来,深度学习模型规模越来越大,在嵌入式设备等资源受限环境中,大规模视觉深度学习模型难以实现高效推理部署。模型压缩加速可以有效解决该挑战。尽管已经出现相关工作的综述,但相关工作集中在卷积神经网络的压缩加速,缺乏对视觉Tran... 展开更多; 关键词视觉深度学习模型压缩轻量化结构模型剪枝模型量化模型蒸馏 TRANSFORMER 序列剪枝; 在线阅读下载PDF 职称材料

视觉深度学习的三维重建方法综述被引量：15: 2; 作者李明阳陈伟 +3 位作者王珊珊黎捷田子建张帆《计算机科学与探索》 CSCD 北大核心 2023年第2期279-302,共24页; 近年来,三维重建作为计算机视觉的重要任务之一,得到广泛的关注和深入的研究。重点分析近年来使用深度学习重建通用对象的三维形状的研究进展。以深度学习进行三维重建环节为脉络,根据三维重建过程中数据深度特征表示方法将三维重建研... 展开更多; 关键词三维重建视觉深度学习表征重建几何重建三维补全动态人体重建; 在线阅读下载PDF 职称材料

Open TBM Tunnel Intelligent Construction Technology 被引量：2: 3; 作者 LIU Yongsheng CHEN Qiao +4 位作者 ZHANG Hepei LI Shu′ao LIN Chungang YIN Long LI Mengyu 《隧道建设(中英文)》北大核心 2025年第4期816-833,I0025-I0042,共36页; To fully leverage the advantages of mechanization and informatization in tunnel boring machine(TBM)operations,the authors aim to promote the advancement of tunnel construction technology toward intelligent development... 展开更多; 关键词 TUNNEL open TBM intelligent construction deep learning machine vision; 在线阅读下载PDF 职称材料

A survey of deep learning-based visual question answering 被引量：1: 4; 作者 HUANG Tong-yuan YANG Yu-ling YANG Xue-jiao 《Journal of Central South University》 SCIE EI CAS CSCD 2021年第3期728-746,共19页; With the warming up and continuous development of machine learning,especially deep learning,the research on visual question answering field has made significant progress,with important theoretical research significanc... 展开更多; 关键词 computer vision natural language processing visual question answering deep learning attention mechanism; 在线阅读下载PDF 职称材料

题名视觉深度学习模型压缩加速综述被引量：2: 1; 作者丁贵广陈辉王澳杨帆熊翊哲梁伊雯; 机构清华大学软件学院清华大学北京信息科学与技术国家研究中心涿溪脑与智能研究所; 出处《智能系统学报》 CSCD 北大核心 2024年第5期1072-1081,共10页; 基金国家自然科学基金项目(61925107,62271281) 浙江省自然科学基金项目(LDT23F01013F01).; 文摘近年来,深度学习模型规模越来越大,在嵌入式设备等资源受限环境中,大规模视觉深度学习模型难以实现高效推理部署。模型压缩加速可以有效解决该挑战。尽管已经出现相关工作的综述,但相关工作集中在卷积神经网络的压缩加速,缺乏对视觉Transformer模型压缩加速方法的整理和对比分析。因此,本文以视觉深度学习模型压缩技术为核心,对卷积神经网络和视觉Transformer模型2个最重要的视觉深度模型进行了相关技术手段的整理,并对技术热点和挑战进行了总结和分析。本文旨在为研究者提供一个全面了解模型压缩和加速领域的视角,促进深度学习模型压缩加速技术的发展。; 关键词视觉深度学习模型压缩轻量化结构模型剪枝模型量化模型蒸馏 TRANSFORMER 序列剪枝; Keywords visual deep learning model compression lightweight structure model pruning model quantization model distillation Transformer token pruning; 分类号 TP18 [自动化与计算机技术—控制理论与控制工程]; 在线阅读下载PDF 职称材料

题名视觉深度学习的三维重建方法综述被引量：15: 2; 作者李明阳陈伟王珊珊黎捷田子建张帆; 机构中国矿业大学计算机科学与技术学院中国矿业大学矿山数字化教育部工程研究中心中国矿业大学(北京)机电与信息工程学院; 出处《计算机科学与探索》 CSCD 北大核心 2023年第2期279-302,共24页; 基金国家自然科学基金(51874300,52074305) 国家自然科学基金山西省煤炭基地低碳联合资助项目(U1510115)。; 文摘近年来,三维重建作为计算机视觉的重要任务之一,得到广泛的关注和深入的研究。重点分析近年来使用深度学习重建通用对象的三维形状的研究进展。以深度学习进行三维重建环节为脉络,根据三维重建过程中数据深度特征表示方法将三维重建研究分为体素、点云、曲面网格、隐式曲面四类。再根据输入二维图像的数目分为单视图三维重建和多视图三维重建两类,根据网络架构以及它们使用的训练机制进行具体细分,在讨论每个类别的研究进展的同时,分析每种训练方法的发展前景及优缺点。研究近年来在特定三维重建领域的新热点,例如动态人体三维重建和不完整几何数据的三维补全,对一些关键论文进行比较,总结了这些领域存在的问题。介绍现阶段的三维数据集的重点应用场景和参数。总结现阶段三维重建领域存在数据集缺失、模型训练复杂、缺少特定领域针对性识别等问题。对三维重建在未来的具体应用领域发展前景进行了例证分析,并对三维重建的研究方向进行了展望。; 关键词三维重建视觉深度学习表征重建几何重建三维补全动态人体重建; Keywords 3D reconstruction visual deep learning characterization reconstruction geometric reconstruction 3D completion dynamic human reconstruction; 分类号 TP391 [自动化与计算机技术—计算机应用技术]; 在线阅读下载PDF 职称材料

题名Open TBM Tunnel Intelligent Construction Technology 被引量：2: 3; 作者 LIU Yongsheng CHEN Qiao ZHANG Hepei LI Shu′ao LIN Chungang YIN Long LI Mengyu; 机构 State Key Laboratory of Shield Machine and Boring Technology China Railway Tunnel Group Co. Guangdong Provincial Key Laboratory of Intelligent Monitoring and Maintenance of Tunnel Structure China Railway Tunnel Group(Shanghai)Special High-tech Co.; 出处《隧道建设(中英文)》北大核心 2025年第4期816-833,I0025-I0042,共36页; 文摘 To fully leverage the advantages of mechanization and informatization in tunnel boring machine(TBM)operations,the authors aim to promote the advancement of tunnel construction technology toward intelligent development.This involved exploring the deep integration of next-generation artificial intelligence technologies,such as sensing technology,automatic control technology,big data technology,deep learning,and machine vision,with key operational processes,including TBM excavation,direction adjustment,step changes,inverted arch block assembly,material transportation,and operation status assurance.The results of this integration are summarized as follows.(1)TBM key excavation parameter prediction algorithm was developed with an accuracy rate exceeding 90%.The TBM intelligent step-change control algorithm,based on machine vision,achieved an image segmentation accuracy rate of 95%and gripper shoe positioning error of±5 mm.(2)An automatic positioning system for inverted arch blocks was developed,enabling real-time perception of the spatial position and deviation during the assembly process.The system maintains an elevation positioning deviation within±3 mm and a horizontal positioning deviation within±10 mm,reducing the number of surveyors in each work team.(3)A TBM intelligent rail transportation system that achieves real-time human-machine positioning,automatic switch opening and closing,automatic obstacle avoidance,intelligent transportation planning,and integrated scheduling and command was designed.Each locomotive formation reduces one shunter and improves comprehensive transportation efficiency by more than 20%.(4)Intelligent analysis and prediction algorithms were developed to monitor and predict the trends of the hydraulic and gear oil parameters in real time,enhancing the proactive maintenance and system reliability.; 关键词 TUNNEL open TBM intelligent construction deep learning machine vision; Keywords 隧道敞开式TBM 智能施工深度学习机器视觉; 分类号 U455.43 [建筑科学—桥梁与隧道工程]; 在线阅读下载PDF 职称材料

题名A survey of deep learning-based visual question answering 被引量：1: 4; 作者 HUANG Tong-yuan YANG Yu-ling YANG Xue-jiao; 机构 School of Artificial Intelligence School of Computer Science and Engineering; 出处《Journal of Central South University》 SCIE EI CAS CSCD 2021年第3期728-746,共19页; 基金 Project(61702063)supported by the National Natural Science Foundation of China。; 文摘 With the warming up and continuous development of machine learning,especially deep learning,the research on visual question answering field has made significant progress,with important theoretical research significance and practical application value.Therefore,it is necessary to summarize the current research and provide some reference for researchers in this field.This article conducted a detailed and in-depth analysis and summarized of relevant research and typical methods of visual question answering field.First,relevant background knowledge about VQA(Visual Question Answering)was introduced.Secondly,the issues and challenges of visual question answering were discussed,and at the same time,some promising discussion on the particular methodologies was given.Thirdly,the key sub-problems affecting visual question answering were summarized and analyzed.Then,the current commonly used data sets and evaluation indicators were summarized.Next,in view of the popular algorithms and models in VQA research,comparison of the algorithms and models was summarized and listed.Finally,the future development trend and conclusion of visual question answering were prospected.; 关键词 computer vision natural language processing visual question answering deep learning attention mechanism; Keywords 计算机视觉自然语言处理视觉问答深度学习注意力机制; 分类号 TP18 [自动化与计算机技术—控制理论与控制工程] TP391.41 [自动化与计算机技术—计算机应用技术]; 在线阅读下载PDF 职称材料

	题名	作者	出处	发文年	被引量	操作
1	视觉深度学习模型压缩加速综述	丁贵广陈辉王澳杨帆熊翊哲梁伊雯	《智能系统学报》 CSCD 北大核心	2024	2	在线阅读下载PDF 职称材料
2	视觉深度学习的三维重建方法综述	李明阳陈伟王珊珊黎捷田子建张帆	《计算机科学与探索》 CSCD 北大核心	2023	15	在线阅读下载PDF 职称材料
3	Open TBM Tunnel Intelligent Construction Technology	LIU Yongsheng CHEN Qiao ZHANG Hepei LI Shu′ao LIN Chungang YIN Long LI Mengyu	《隧道建设(中英文)》北大核心	2025	2	在线阅读下载PDF 职称材料
4	A survey of deep learning-based visual question answering	HUANG Tong-yuan YANG Yu-ling YANG Xue-jiao	《Journal of Central South University》 SCIE EI CAS CSCD	2021	1	在线阅读下载PDF 职称材料