Fusing Geometrical and Visual Information via Superpoints for the Semantic Segmentation of 3D Road Scenes 被引量：8

Fusing Geometrical and Visual Information via Superpoints for the Semantic Segmentation of 3D Road Scenes

导出

摘要 This paper addresses the problem of the semantic segmentation of large-scale 3D road scenes by incorporating the complementary advantages of point clouds and images.To make full use of geometrical and visual information,this paper extracts 3D geometric features from a point cloud using a deep neural network for 3D semantic segmentation and extracts 2D visual features from images using a Convolutional Neural Network(CNN)for 2D semantic segmentation.In order to bridge the features of the two modalities,this paper uses superpoints as an intermediate representation to connect the 2D features with the 3D features.A superpoint-based pooling method is proposed to fuse the features from the two different modalities for joint learning.To evaluate the approach,the paper generates 3D scenes from the Virtual KITTI dataset.The results of the experiments demonstrate that the proposed approach is capable of segmenting large-scale 3D road scenes based on the compact and semantically homogeneous superpoints,and that it achieves considerable improvements over the 2D image and 3D point cloud semantic segmentation methods. This paper addresses the problem of the semantic segmentation of large-scale 3D road scenes by incorporating the complementary advantages of point clouds and images.To make full use of geometrical and visual information, this paper extracts 3D geometric features from a point cloud using a deep neural network for 3D semantic segmentation and extracts 2D visual features from images using a Convolutional Neural Network(CNN)for 2D semantic segmentation.In order to bridge the features of the two modalities, this paper uses superpoints as an intermediate representation to connect the 2D features with the 3D features.A superpoint-based pooling method is proposed to fuse the features from the two different modalities for joint learning.To evaluate the approach, the paper generates 3D scenes from the Virtual KITTI dataset.The results of the experiments demonstrate that the proposed approach is capable of segmenting large-scale 3D road scenes based on the compact and semantically homogeneous superpoints, and that it achieves considerable improvements over the 2D image and 3D point cloud semantic segmentation methods.

作者 Liuyuan Deng Ming Yang Zhidong Liang Yuesheng He Chunxiang Wang

机构地区 the Department of Automation the Key Laboratory of System Control and Information Processing the Research Institute of Robotics

出处《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2020年第4期498-507,共10页 清华大学学报（自然科学版（英文版）

基金 supported by the National Natural Science Foundation of China(No.U1764264/61873165) Shanghai Automotive Industry Science and Technology Development Foundation(No.1807) the International Chair on Automated Driving of Ground Vehicle.

关键词 SCENE understanding point cloud SEMANTIC segmentation MULTI-MODAL information fusion deep learning scene understanding point cloud semantic segmentation multi-modal information fusion deep learning

分类号 U495 [交通运输工程—交通运输规划与管理]

作者简介 Liuyuan Deng,received the BS degree from the University of Electronic Science and Technology of China,Chengdu,China,in 2013.He is currently a PhD candidate at the Department of Automation,Shanghai Jiao Tong University,Shanghai,China.His research interests include computer vision,deep learning,semantic scene understanding,and visual localization for intelligent driving.E-mail:lydeng@sjtu.edu.cn;Corresponding author:Ming Yang,received the MS and PhD degrees from Tsinghua University,Beijing,China,in 1999 and 2003,respectively.He is currently a full tenure professor at Shanghai Jiao Tong University,director of the Department of Automation,and the deputy director of the Innovation Center of Intelligent Connected Vehicles.He has been working in the field of intelligent vehicles for more than 20 years.E-mail:MingYang@sjtu.edu.cn;Zhidong Liang,is currently a master student in Shanghai Jiao Tong University.He received the BS degree from the same institution in 2017.His research interests include deep learning,3D semantic instance segmentation,and autonomous driving.Email:709800954@qq.com;Yuesheng He,received the PhD degree from the Department of Computer Science,Hong Kong Baptist University,Hong Kong,in 2012.He held a post-doctoral position at the Department of Computer and Information Science,Faculty of Science and Technology,University of Macao.He is currently a research fellow with the Department of Automation,Shanghai Jiao Tong University.His research areas are machine learning,computer graphics,virtual reality,computer 3D animation,computer image,and signal processing.E-mail:heyuesh@sjtu.edu.cn;Chunxiang Wang,received the PhD degree from Harbin Institute of Technology,China,in 1999.She is currently an associate professor with the Department of Automation,Shanghai Jiao Tong University,Shanghai,China.Her research interests include autonomous driving,assistant driving,and mobile robots,etc.She has been working in the field of intelligent vehicles for more than 10 years and participated in several related research projects,such as European CyberC3 project,ITER transfer cask project,etc.E-mail:wangcx@sjtu.edu.cn

引文网络
相关文献

同被引文献52

1周继苗,李必军,陈世增.一种多层特征融合的道路场景实时分割方法[J].测绘通报,2020(1):10-15. 被引量：8
2唐方勤,任爱珠.Agent-Based Evacuation Model Incorporating Fire Scene and Building Geometry[J].Tsinghua Science and Technology,2008,13(5):708-714. 被引量：5
3张青松,赵国敏,刘金兰.Performance-Based Design for Large Crowd Venue Control Using a Multi-Agent Model[J].Tsinghua Science and Technology,2009,14(3):352-359. 被引量：2
4杨宇鹏,赵卫东,王志成,陈刚.基于图论的Normalized Cut图像分割方法研究[J].计算机与现代化,2010(1):113-116. 被引量：6
5刘松涛,殷福亮.基于图割的图像分割方法及其新进展[J].自动化学报,2012,38(6):911-922. 被引量：143
6刘磊,石志国,宿浩茹,李红.基于高阶马尔可夫随机场的图像分割[J].计算机研究与发展,2013,50(9):1933-1942. 被引量：13
7王春瑶,陈俊周,李炜.超像素分割算法研究综述[J].计算机应用研究,2014,31(1):6-12. 被引量：117
8高如新,王俊孟.基于双目立体视觉的煤体积测量[J].计算机系统应用,2014,23(5):126-133. 被引量：20
9严云洋,瞿学新,朱全银,李翔,赵阳.基于离群点检测的分类结果置信度的度量方法[J].南京大学学报（自然科学版）,2019,55(1):102-109. 被引量：4
10熊胜军,赵飞,赵恒,敖磊.线结构光自同步扫描三维形貌测量系统[J].光子学报,2014,43(11):147-152. 被引量：9

引证文献8

1Ling Zhang,Jianchao Liu,Fangxing Shang,Gang Li,Juming Zhao,Yueqin Zhang.Robust Segmentation Method for Noisy Images Based on an Unsupervised Denosing Filter[J].Tsinghua Science and Technology,2021,26(5):736-748. 被引量：3
2王龙飞,严春满.道路场景语义分割综述[J].激光与光电子学进展,2021,58(12):36-58. 被引量：25
3崔峥,王增才,张杰,闫明,王普圣.基于三维点云分割的煤堆体积计算方法研究[J].中国矿业,2022,31(4):96-101. 被引量：8
4Wenhan Wu,Maoyin Chen,Jinghai Li,Binglu Liu,Xiaolu Wang,Xiaoping Zheng.Visual Information Based Social Force Model for Crowd Evacuation[J].Tsinghua Science and Technology,2022,27(3):619-629. 被引量：1
5钟志峰,何佳伟,侯瑞洁,晏阳天,刘梦娜,赵明俊.改进UNet的轻量化道路图像语义分割算法[J].现代电子技术,2022,45(19):71-76. 被引量：7
6宋亮,谷玉海,石文天.基于改进BiSeNet的非结构化道路分割算法研究[J].应用光学,2023,44(3):556-564. 被引量：3
7Chao Qi,Jianqin Yin,Zhicheng Zhang,Jin Tang.Dynamic Scene Graph Generation of Point Clouds with Structural Representation Learning[J].Tsinghua Science and Technology,2024,29(1):232-243.
8毛亚纯,杨哲玺,曹旺,齐迹.基于多特征约束的露天采场道路点云提取[J].东北大学学报（自然科学版）,2024,45(9):1326-1333.

二级引证文献47

1王飞,夏建超,闫飞.基于三维激光雷达的料堆检测与高度分布建模[J].冶金自动化,2023,47(S01):213-217.
2贺海涛,王佳豪,张海峰,荣耀,崔耀.基于U-Net的放煤状态控制关键技术研究[J].煤炭科学技术,2022,50(S02):237-243. 被引量：5
3王伯涛,周福强,吴国新,王少红.基于改进YOLOv7的输电线路绝缘子识别检测研究[J].电子测量技术,2023,46(23):127-134. 被引量：2
4陶伟.基于时序图像语义分割的隧道拥堵检测方法探析[J].自动化应用,2021(4):144-147.
5谷湘煜,刘晓熠,周仁彬.多特征融合的道路场景语义分割算法[J].科学技术与工程,2021,21(33):14251-14257. 被引量：5
6龚志力,谷玉海,朱腾腾,石文天.融合注意力机制与轻量化DeepLabv3+的非结构化道路识别[J].微电子学与计算机,2022,39(2):26-33. 被引量：5
7姚燕,胡立坤,郭军.基于改进DeepLabv3+网络的轻量级语义分割算法[J].激光与光电子学进展,2022,59(4):192-199. 被引量：9
8闫志恒,任超,李毅,徐宁辉,张胜国.一种改进的融合不同尺度特征的遥感影像道路提取新方法[J].测绘通报,2022(9):58-62. 被引量：4
9Junfen Chen,Jie Han,Xiangjie Meng,Yan Li,Haifeng Li.Graph Convolutional Network Combined with Semantic Feature Guidance for Deep Clustering[J].Tsinghua Science and Technology,2022,27(5):855-868. 被引量：2
10亢洁,田野,杨刚.基于改进SSD的人群异常行为检测算法研究[J].红外技术,2022,44(12):1316-1323. 被引量：5

1谢梦,刘伟,李二珠,杨梦圆,王晓檀.深度卷积神经网络支持下的遥感影像语义分割[J].测绘通报,2020(5):36-42. 被引量：8
2Bilal Gondal,Haider Haider,Yuga Komaki,Fukiko Komaki,Dejan Micic,David T Rubin,Atsushi Sakuraba.Efficacy of various endoscopic modalities in detecting dysplasia in ulcerative colitis:A systematic review and network meta-analysis[J].World Journal of Gastrointestinal Endoscopy,2020,12(5):159-171. 被引量：1
3马琳杰,刘伟嵬,唐梓珏,王灏,杨征宇,孙慧,王奉涛.一种基于PPCNN的金属激光熔化沉积熔池状态识别方法[J].内燃机与配件,2020(10):23-26. 被引量：1
4杨奎河,张超.基于卷积神经网络的评论监管模型的设计与实现[J].信息通信,2020(4):39-40.
5姜洪权,贺帅,高建民,王荣喜,高智勇,王晓桥,夏锋社,程雷.一种改进卷积神经网络模型的焊缝缺陷识别方法[J].机械工程学报,2020,56(8):235-242. 被引量：33
6Mo-Mo Sun,Jie Shen.Positron emission tomography/computed tomography findings of multiple cystic lymphangiomas in an adult: A case report[J].World Journal of Clinical Cases,2020,8(10):1973-1978.
7Abdulghani Saadi,Arun Kanmanthareddy,Mahesh Anantha-Narayanan,Karen Hardy,Mark Williams,Venkata M Alla.Access to smart devices and utilization of online health resources among older cardiac rehabilitation participants[J].World Journal of Cardiology,2020,12(5):203-209.

Tsinghua Science and Technology

2020年第4期

浏览历史

内容加载中请稍等...

Fusing Geometrical and Visual Information via Superpoints for the Semantic Segmentation of 3D Road Scenes 被引量：8

同被引文献52

引证文献8

二级引证文献47

相关作者

相关机构

相关主题

浏览历史