摘要
人类文化遗产的数字化应用是数字图书馆计划的重要组成部分 .目前 ,数字化手书中文古籍尚缺乏有效的内容检索手段 .提出了一种基于视觉相似性的计算机古籍内容检索方法 ,研制出关键支撑技术 .该方法提取视觉对象的形态特征、全局位置特征和页面特征 ,采用高维空间索引技术组织形态特征构成的特征空间 ,完成视觉相似对象的快速检索 ,定义精度控制参数 ,动态调整由形态到语义的映射 ,借助约束验证技术提高一组相关对象的检索精度 .原型系统证实了新方法的可行性 ,获得了直接在数字化图像上自动完成古籍内容检索的技术效果 .
The application of digitized civilization legacy plays an important role in the digital library project. Due to the intrinsic handwritten nature, it lacks effective mechanisms to perform content retrieval on digitized Chinese antique books. An original method for content retrieval based on visual similarity is proposed and some key techniques are studied. By extracting morphological, positional and page features from images, the method makes up a feature space and applies spatial indexing to it. A range searching strategy is then employed to get all analogs to the query sample. In addition, a precision parameter is defined to dynamically adjust the mapping from morphological feature to semantics, and a constraint verifying technique is developed to improve the overall precision. The operational prototypical system demonstrates its feasibility and gets the effectiveness of automatic content-based retrieval directly on page images.
出处
《软件学报》
EI
CSCD
北大核心
2001年第9期1336-1342,共7页
Journal of Software
基金
国家自然科学基金资助项目 (6 99330 10 )
上海市自然科学基金资助项目 (0 0 ZD140 0 6 )~~
关键词
图像检索
特征提取
空间索引
古籍检索
中文古籍
计算机检索
视觉相似性
Constraint theory
Digital libraries
Feature extraction
Image processing
Indexing (of information)
Semantics
Software prototyping