Semantic segmentation is a crucial step for document understanding.In this paper,an NVIDIA Jetson Nano-based platform is applied for implementing semantic segmentation for teaching artificial intelligence concepts and...Semantic segmentation is a crucial step for document understanding.In this paper,an NVIDIA Jetson Nano-based platform is applied for implementing semantic segmentation for teaching artificial intelligence concepts and programming.To extract semantic structures from document images,we present an end-to-end dilated convolution network architecture.Dilated convolutions have well-known advantages for extracting multi-scale context information without losing spatial resolution.Our model utilizes dilated convolutions with residual network to represent the image features and predicting pixel labels.The convolution part works as feature extractor to obtain multidimensional and hierarchical image features.The consecutive deconvolution is used for producing full resolution segmentation prediction.The probability of each pixel decides its predefined semantic class label.To understand segmentation granularity,we compare performances at three different levels.From fine grained class to coarse class levels,the proposed dilated convolution network architecture is evaluated on three document datasets.The experimental results have shown that both semantic data distribution imbalance and network depth are import factors that influence the document’s semantic segmentation performances.The research is aimed at offering an education resource for teaching artificial intelligence concepts and techniques.展开更多
To quickly find documents with high similarity in existing documentation sets, fingerprint group merging retrieval algorithm is proposed to address both sides of the problem:a given similarity threshold could not be t...To quickly find documents with high similarity in existing documentation sets, fingerprint group merging retrieval algorithm is proposed to address both sides of the problem:a given similarity threshold could not be too low and fewer fingerprints could lead to low accuracy. It can be proved that the efficiency of similarity retrieval is improved by fingerprint group merging retrieval algorithm with lower similarity threshold. Experiments with the lower similarity threshold r=0.7 and high fingerprint bits k=400 demonstrate that the CPU time-consuming cost decreases from 1 921 s to 273 s. Theoretical analysis and experimental results verify the effectiveness of this method.展开更多
The document image segmentation is very useful for printing, faxing and data processing. An algorithm is developed for segmenting and classifying document image. Feature used for classification is based on the histogr...The document image segmentation is very useful for printing, faxing and data processing. An algorithm is developed for segmenting and classifying document image. Feature used for classification is based on the histogram distribution pattern of different image classes. The important attribute of the algorithm is using wavelet correlation image to enhance raw image's pattern, so the classification accuracy is improved. In this paper document image is divided into four types; background, photo, text and graph. Firstly, the document image background has been distingusished easily by former normally method;secondly, three image types will be distinguished by their typical histograms, in order to make histograms feature clearer, each resolution's HH wavelet subimage is used to add to the raw image at their resolution. At last, the photo, text and praph have been devided according to how the feature fit to the Laplacian distrbution by 2 and L . Simulations show that classification accuracy is significantly improved. The comparison with related shows that our algorithm provides both lower classification error rates and better visual results.展开更多
Valid as folk law,Ming Bai document is the code of conduct for the villagers,which is applied in Danjiang T own,Langde T own and Wangfeng T ownship in Leishan county,Qiandongnan Miao and Dong autonomous prefecture in ...Valid as folk law,Ming Bai document is the code of conduct for the villagers,which is applied in Danjiang T own,Langde T own and Wangfeng T ownship in Leishan county,Qiandongnan Miao and Dong autonomous prefecture in Guizhou province.It refers to comprehensive administration,safety construction,safety production,fire fighting,traffic safety,prohibition of opium,fire protection of forest,family plan,and traditional festival etc. Mostly they are posted on the conspicuous wooden buildings and bricks wall of villages,Ming Bai document plays an irreplaceable role in adjusting local social relations. Moreover,its legal culture and the value of legal sociology are worthy of having a discussion and research.展开更多
自联合国国际搜索与救援咨询团(The International Search and Rescue Advisory Group,简称INSARAG)成立30多年来,通过不断总结巨灾国际救援经验,形成了一套覆盖国际救援准备阶段、行动阶段到撤离阶段的国际救援全流程全要素的协调工作...自联合国国际搜索与救援咨询团(The International Search and Rescue Advisory Group,简称INSARAG)成立30多年来,通过不断总结巨灾国际救援经验,形成了一套覆盖国际救援准备阶段、行动阶段到撤离阶段的国际救援全流程全要素的协调工作机制,并通过出台一系列的指南、指导性文件和推荐性技术文件,规范救援能力和队伍建设,强化国际救援协调和现场救援的效率。该文系统介绍了INSARAG标准和技术文件组成体系架构,并阐述了各标准及技术文件的出台背景、主要内容及对中国的搜救队伍建设的推动作用,并讨论其对我国灾害救援工作的启示与借鉴意义。展开更多
基金Project(61806107)supported by the National Natural Science Foundation of ChinaProject supported by the Shandong Key Laboratory of Wisdom Mine Information Technology,ChinaProject supported by the Opening Project of State Key Laboratory of Digital Publishing Technology,China。
文摘Semantic segmentation is a crucial step for document understanding.In this paper,an NVIDIA Jetson Nano-based platform is applied for implementing semantic segmentation for teaching artificial intelligence concepts and programming.To extract semantic structures from document images,we present an end-to-end dilated convolution network architecture.Dilated convolutions have well-known advantages for extracting multi-scale context information without losing spatial resolution.Our model utilizes dilated convolutions with residual network to represent the image features and predicting pixel labels.The convolution part works as feature extractor to obtain multidimensional and hierarchical image features.The consecutive deconvolution is used for producing full resolution segmentation prediction.The probability of each pixel decides its predefined semantic class label.To understand segmentation granularity,we compare performances at three different levels.From fine grained class to coarse class levels,the proposed dilated convolution network architecture is evaluated on three document datasets.The experimental results have shown that both semantic data distribution imbalance and network depth are import factors that influence the document’s semantic segmentation performances.The research is aimed at offering an education resource for teaching artificial intelligence concepts and techniques.
基金Project(60873081) supported by the National Natural Science Foundation of ChinaProject(NCET-10-0787) supported by the Program for New Century Excellent Talents in University, ChinaProject(11JJ1012) supported by the Natural Science Foundation of Hunan Province, China
文摘To quickly find documents with high similarity in existing documentation sets, fingerprint group merging retrieval algorithm is proposed to address both sides of the problem:a given similarity threshold could not be too low and fewer fingerprints could lead to low accuracy. It can be proved that the efficiency of similarity retrieval is improved by fingerprint group merging retrieval algorithm with lower similarity threshold. Experiments with the lower similarity threshold r=0.7 and high fingerprint bits k=400 demonstrate that the CPU time-consuming cost decreases from 1 921 s to 273 s. Theoretical analysis and experimental results verify the effectiveness of this method.
文摘The document image segmentation is very useful for printing, faxing and data processing. An algorithm is developed for segmenting and classifying document image. Feature used for classification is based on the histogram distribution pattern of different image classes. The important attribute of the algorithm is using wavelet correlation image to enhance raw image's pattern, so the classification accuracy is improved. In this paper document image is divided into four types; background, photo, text and graph. Firstly, the document image background has been distingusished easily by former normally method;secondly, three image types will be distinguished by their typical histograms, in order to make histograms feature clearer, each resolution's HH wavelet subimage is used to add to the raw image at their resolution. At last, the photo, text and praph have been devided according to how the feature fit to the Laplacian distrbution by 2 and L . Simulations show that classification accuracy is significantly improved. The comparison with related shows that our algorithm provides both lower classification error rates and better visual results.
基金Supported by National High Technology Research and Development Program of China (863 Program) (2008AA01Z144) National Natural Science Foundation of China (60803093 60975055)
基金the Li Jianjun presided over Kong Xuechang tender “Yangming culture and modern social governance”(KXT ZD201507)Sub-topics “Yangming culture and the rule of law social construction research” part of the research results
文摘Valid as folk law,Ming Bai document is the code of conduct for the villagers,which is applied in Danjiang T own,Langde T own and Wangfeng T ownship in Leishan county,Qiandongnan Miao and Dong autonomous prefecture in Guizhou province.It refers to comprehensive administration,safety construction,safety production,fire fighting,traffic safety,prohibition of opium,fire protection of forest,family plan,and traditional festival etc. Mostly they are posted on the conspicuous wooden buildings and bricks wall of villages,Ming Bai document plays an irreplaceable role in adjusting local social relations. Moreover,its legal culture and the value of legal sociology are worthy of having a discussion and research.
文摘自联合国国际搜索与救援咨询团(The International Search and Rescue Advisory Group,简称INSARAG)成立30多年来,通过不断总结巨灾国际救援经验,形成了一套覆盖国际救援准备阶段、行动阶段到撤离阶段的国际救援全流程全要素的协调工作机制,并通过出台一系列的指南、指导性文件和推荐性技术文件,规范救援能力和队伍建设,强化国际救援协调和现场救援的效率。该文系统介绍了INSARAG标准和技术文件组成体系架构,并阐述了各标准及技术文件的出台背景、主要内容及对中国的搜救队伍建设的推动作用,并讨论其对我国灾害救援工作的启示与借鉴意义。