期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
视觉深度学习模型压缩加速综述 被引量:2
1
作者 丁贵广 陈辉 +3 位作者 王澳 杨帆 熊翊哲 梁伊雯 《智能系统学报》 CSCD 北大核心 2024年第5期1072-1081,共10页
近年来,深度学习模型规模越来越大,在嵌入式设备等资源受限环境中,大规模视觉深度学习模型难以实现高效推理部署。模型压缩加速可以有效解决该挑战。尽管已经出现相关工作的综述,但相关工作集中在卷积神经网络的压缩加速,缺乏对视觉Tran... 近年来,深度学习模型规模越来越大,在嵌入式设备等资源受限环境中,大规模视觉深度学习模型难以实现高效推理部署。模型压缩加速可以有效解决该挑战。尽管已经出现相关工作的综述,但相关工作集中在卷积神经网络的压缩加速,缺乏对视觉Transformer模型压缩加速方法的整理和对比分析。因此,本文以视觉深度学习模型压缩技术为核心,对卷积神经网络和视觉Transformer模型2个最重要的视觉深度模型进行了相关技术手段的整理,并对技术热点和挑战进行了总结和分析。本文旨在为研究者提供一个全面了解模型压缩和加速领域的视角,促进深度学习模型压缩加速技术的发展。 展开更多
关键词 视觉深度学习 模型压缩 轻量化结构 模型剪枝 模型量化 模型蒸馏 TRANSFORMER 序列剪枝
在线阅读 下载PDF
视觉深度学习的三维重建方法综述 被引量:15
2
作者 李明阳 陈伟 +3 位作者 王珊珊 黎捷 田子建 张帆 《计算机科学与探索》 CSCD 北大核心 2023年第2期279-302,共24页
近年来,三维重建作为计算机视觉的重要任务之一,得到广泛的关注和深入的研究。重点分析近年来使用深度学习重建通用对象的三维形状的研究进展。以深度学习进行三维重建环节为脉络,根据三维重建过程中数据深度特征表示方法将三维重建研... 近年来,三维重建作为计算机视觉的重要任务之一,得到广泛的关注和深入的研究。重点分析近年来使用深度学习重建通用对象的三维形状的研究进展。以深度学习进行三维重建环节为脉络,根据三维重建过程中数据深度特征表示方法将三维重建研究分为体素、点云、曲面网格、隐式曲面四类。再根据输入二维图像的数目分为单视图三维重建和多视图三维重建两类,根据网络架构以及它们使用的训练机制进行具体细分,在讨论每个类别的研究进展的同时,分析每种训练方法的发展前景及优缺点。研究近年来在特定三维重建领域的新热点,例如动态人体三维重建和不完整几何数据的三维补全,对一些关键论文进行比较,总结了这些领域存在的问题。介绍现阶段的三维数据集的重点应用场景和参数。总结现阶段三维重建领域存在数据集缺失、模型训练复杂、缺少特定领域针对性识别等问题。对三维重建在未来的具体应用领域发展前景进行了例证分析,并对三维重建的研究方向进行了展望。 展开更多
关键词 三维重建 视觉深度学习 表征重建 几何重建 三维补全 动态人体重建
在线阅读 下载PDF
Open TBM Tunnel Intelligent Construction Technology 被引量:2
3
作者 LIU Yongsheng CHEN Qiao +4 位作者 ZHANG Hepei LI Shu′ao LIN Chungang YIN Long LI Mengyu 《隧道建设(中英文)》 北大核心 2025年第4期816-833,I0025-I0042,共36页
To fully leverage the advantages of mechanization and informatization in tunnel boring machine(TBM)operations,the authors aim to promote the advancement of tunnel construction technology toward intelligent development... To fully leverage the advantages of mechanization and informatization in tunnel boring machine(TBM)operations,the authors aim to promote the advancement of tunnel construction technology toward intelligent development.This involved exploring the deep integration of next-generation artificial intelligence technologies,such as sensing technology,automatic control technology,big data technology,deep learning,and machine vision,with key operational processes,including TBM excavation,direction adjustment,step changes,inverted arch block assembly,material transportation,and operation status assurance.The results of this integration are summarized as follows.(1)TBM key excavation parameter prediction algorithm was developed with an accuracy rate exceeding 90%.The TBM intelligent step-change control algorithm,based on machine vision,achieved an image segmentation accuracy rate of 95%and gripper shoe positioning error of±5 mm.(2)An automatic positioning system for inverted arch blocks was developed,enabling real-time perception of the spatial position and deviation during the assembly process.The system maintains an elevation positioning deviation within±3 mm and a horizontal positioning deviation within±10 mm,reducing the number of surveyors in each work team.(3)A TBM intelligent rail transportation system that achieves real-time human-machine positioning,automatic switch opening and closing,automatic obstacle avoidance,intelligent transportation planning,and integrated scheduling and command was designed.Each locomotive formation reduces one shunter and improves comprehensive transportation efficiency by more than 20%.(4)Intelligent analysis and prediction algorithms were developed to monitor and predict the trends of the hydraulic and gear oil parameters in real time,enhancing the proactive maintenance and system reliability. 展开更多
关键词 TUNNEL open TBM intelligent construction deep learning machine vision
在线阅读 下载PDF
A survey of deep learning-based visual question answering 被引量:1
4
作者 HUANG Tong-yuan YANG Yu-ling YANG Xue-jiao 《Journal of Central South University》 SCIE EI CAS CSCD 2021年第3期728-746,共19页
With the warming up and continuous development of machine learning,especially deep learning,the research on visual question answering field has made significant progress,with important theoretical research significanc... With the warming up and continuous development of machine learning,especially deep learning,the research on visual question answering field has made significant progress,with important theoretical research significance and practical application value.Therefore,it is necessary to summarize the current research and provide some reference for researchers in this field.This article conducted a detailed and in-depth analysis and summarized of relevant research and typical methods of visual question answering field.First,relevant background knowledge about VQA(Visual Question Answering)was introduced.Secondly,the issues and challenges of visual question answering were discussed,and at the same time,some promising discussion on the particular methodologies was given.Thirdly,the key sub-problems affecting visual question answering were summarized and analyzed.Then,the current commonly used data sets and evaluation indicators were summarized.Next,in view of the popular algorithms and models in VQA research,comparison of the algorithms and models was summarized and listed.Finally,the future development trend and conclusion of visual question answering were prospected. 展开更多
关键词 computer vision natural language processing visual question answering deep learning attention mechanism
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部