期刊文献+

图文识别技术综述 被引量:14

Overview of image and text recognition technology
在线阅读 下载PDF
导出
摘要 本文概括性的介绍了图文识别所涉及的技术。首先介绍了图文识别的背景知识,包括应用领域、技术难点及挑战和系统实施流程等;其次介绍了图文识别技术的预处理方法及流程,包括旋转校正、线检测、特征匹配、字符轮廓提取及分割、OCR识别流程;接着介绍了图文识别过程中常用的特征提取基础网络和检测网络,以及它们的场景适配问题;然后介绍了近年来出现的各种图文检测深度学习网络、图文识别深度学习网络、端到端图文检测与识别深度学习网络,并分析了各类检测和识别网络的网络架构、算法思路及其特点;最后介绍了公开的图文识别训练、测试数据集以及不同算法的性能比较。 This paper gives a general introduction for the technology of image to text recognition.Firstly,the background of the image to text recognition is introduced,including application scenarios、technical difficulties and challenges、and system implementation process.Secondly,the preprocessing methods and processes of image to text recognition technology are introduced,including rotation correction、line detection、feature matching、extraction and segmentation of the character contour、and the whole processing of the OCR(Optical Character Recognition).Thirdly,we introduce the basic feature extraction network and the detection network framework commonly used in the process of image to text recognition;also,we discuss about the problem of scene adaptation when they are applied to the task of image to text recognition.Then,we introduce the various text detection deep learning network,text recognition deep learning network,end-to-end text detection and recognition network that have emerged in recent years;at the same time,we analyze the algorithm ideas and characteristics of various detection and recognition networks.Finally,we list the open data sets used in the domain of image to text recognition and performance comparison of different algorithm.
作者 牛小明 毕可骏 唐军 NIU Xiaoming;BI Kejun;TANG Jun(Sichuan Changhong Electric Co.,Ltd.,Software&Service Center,Mianyang 621000,China)
出处 《中国体视学与图像分析》 2019年第3期241-256,共16页 Chinese Journal of Stereology and Image Analysis
关键词 图文检测 文本识别 端到端识别 image to text detection text recognition end to end recognition
作者简介 牛小明(1983-),男(汉),硕士,四川长虹电器股份有限公司资深专家。E-mail:xiaoming1.niu@changhong.com
  • 相关文献

参考文献3

二级参考文献7

  • 1张黔,胡庆,杨静宇,蒋韧.统计和结构模式识别方法结合的多特征印鉴真伪鉴别方法[J].计算机学报,1995,18(3):190-198. 被引量:12
  • 2Jain R, Kasturi R, Schunck B G. Machine Vision [M]. New York:McGraw-Hill Inc, 1995.
  • 3章毓晋.图像工程-图像处理与分析 [M].北京:清华大学出版社,1999..
  • 4Canny J F. A computational approach to edge detection [J]. IEEE Trans Pattern Analysis and Machine Intelligence, 1986, 8(6):679-698.
  • 5Ballard D H. Generalizing the Hough transform to detect arbitrary shapes [J]. Pattern Recognition, 1981, 13(2):111-122.
  • 6Tupin F, Maitre H, Mangin J, et al. Detection of linear features in SAR images:Application to road network extraction [J]. IEEE Trans Geoscience Remote Sensing, 1998, 36(2):434-453.
  • 7Fischler M A, Tenenbaum J M, Wolf H C. Detection of roads and linear structures in low resolution aerial imagery using a multisource knowledge integration technique [J]. Comput Graph Image Processing, 1981, 15(3):201-223.

共引文献36

同被引文献97

引证文献14

二级引证文献60

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部