This paper presents a novel efficient semantic image classification algorithm for high-level feature indexing of high-dimension image database. Experiments show that the algorithm performs well. The size of the train ...This paper presents a novel efficient semantic image classification algorithm for high-level feature indexing of high-dimension image database. Experiments show that the algorithm performs well. The size of the train set and the test set is 7 537 and 5 000 respectively. Based on this theory, another ground is built with 12,000 images, which are divided into three classes: city, landscape and person, the total result of the classifications is 88.92%, meanwhile, some preliminary results are presented for image understanding based on semantic image classification and low level features. The groundtruth for the experiments is built with the images from Corel database, photos and some famous face databases.展开更多
The problem considered in this paper is how to detect the degree of similarity in the content of digital images useful in image retrieval,i.e.,to what extent is the content of a query image similar to content of other...The problem considered in this paper is how to detect the degree of similarity in the content of digital images useful in image retrieval,i.e.,to what extent is the content of a query image similar to content of other images.The solution to this problem results from the detection of subsets that are rough sets contained in covers of digital images determined by perceptual tolerance relations(PTRs).Such relations are defined within the context of perceptual representative spaces that hearken back to work by J.H.Poincare on representative spaces as models of physical continua.Classes determined by a PTR provide content useful in content-based image retrieval(CBIR).In addition,tolerance classes provide a means of determining when subsets of image covers are tolerance rough sets(TRSs).It is the nearness of TRSs present in image tolerance spaces that provide a promising approach to CBIR,especially in cases such as satellite images or aircraft identification where there are subtle differences between pairs of digital images,making it difficult to quantify the similarities between such images.The contribution of this article is the introduction of the nearness of tolerance rough sets as an effective means of measuring digital image similarities and,as a significant consequence,successfully carrying out CBIR.展开更多
A novel image retrieval approach based on color features and anisotropic directional information is proposed for content based image retrieval systems (CBIR). The color feature is described by the color histogram ...A novel image retrieval approach based on color features and anisotropic directional information is proposed for content based image retrieval systems (CBIR). The color feature is described by the color histogram (CH), which is translation and rotation invariant. However, the CH does not contain spatial information which is very important for the image retrieval. To overcome this shortcoming, the subband energy of the lifting directionlet transform (L-DT) is proposed to describe the directional information, in which L-DT is characterized by multi-direction and anisotropic basis functions compared with the wavelet transform. A global similarity measure is designed to implement the fusion of both color feature and anisotropic directionality for the retrieval process. The retrieval experiments using a set of COREL images demonstrate that the higher query precision and better visual effect can be achieved.展开更多
Two lines of image representation based on multiple features fusion demonstrate excellent performance in image retrieval.However,there are some problems in both of them:1)the methods defining directly texture in color...Two lines of image representation based on multiple features fusion demonstrate excellent performance in image retrieval.However,there are some problems in both of them:1)the methods defining directly texture in color space put more emphasis on color than texture feature;2)the methods extract several features respectively and combine them into a vector,in which bad features may lead to worse performance after combining directly good and bad features.To address the problems above,a novel hybrid framework for color image retrieval through combination of local and global features achieves higher retrieval precision.The bag-of-visual words(BoW)models and color intensity-based local difference patterns(CILDP)are exploited to capture local and global features of an image.The proposed fusion framework combines the ranking results of BoW and CILDP through graph-based density method.The performance of our proposed framework in terms of average precision on Corel-1K database is86.26%,and it improves the average precision by approximately6.68%and12.53%over CILDP and BoW,respectively.Extensive experiments on different databases demonstrate the effectiveness of the proposed framework for image retrieval.展开更多
To realize content-hased retrieval of large image databases, it is required to develop an efficient index and retrieval scheme. This paper proposes an index algorithm of clustering called CMA, which supports fast retr...To realize content-hased retrieval of large image databases, it is required to develop an efficient index and retrieval scheme. This paper proposes an index algorithm of clustering called CMA, which supports fast retrieval of large image databases. CMA takes advantages of k-means and self-adaptive algorithms. It is simple and works without any user interactions. There are two main stages in this algorithm. In the first stage, it classifies images in a database into several clusters, and automatically gets the necessary parameters for the next stage-k-means iteration. The CMA algorithm is tested on a large database of more than ten thousand images and compare it with k-means algorithm. Experimental results show that this algorithm is effective in both precision and retrieval time.展开更多
A novel content based image retrieval (CBIR) algorithmusing relevant feedback is presented. The proposed frameworkhas three major contributions: a novel feature descriptor calledcolor spectral histogram (CSH) to ...A novel content based image retrieval (CBIR) algorithmusing relevant feedback is presented. The proposed frameworkhas three major contributions: a novel feature descriptor calledcolor spectral histogram (CSH) to measure the similarity betweenimages; two-dimensional matrix based indexing approach proposedfor short-term learning (STL); and long-term learning (LTL).In general, image similarities are measured from feature representationwhich includes color quantization, texture, color, shapeand edges. However, CSH can describe the image feature onlywith the histogram. Typically the image retrieval process starts byfinding the similarity between the query image and the imagesin the database; the major computation involved here is that theselection of top ranking images requires a sorting algorithm to beemployed at least with the lower bound of O(n log n). A 2D matrixbased indexing of images can enormously reduce the searchtime in STL. The same structure is used for LTL with an aim toreduce the amount of log to be maintained. The performance ofthe proposed framework is analyzed and compared with the existingapproaches, the quantified results indicates that the proposedfeature descriptor is more effectual than the existing feature descriptorsthat were originally developed for CBIR. In terms of STL,the proposed 2D matrix based indexing minimizes the computationeffort for retrieving similar images and for LTL, the proposed algorithmtakes minimum log information than the existing approaches.展开更多
Traditional inverse synthetic aperture radar(ISAR)imaging methods for maneuvering targets have low resolution and poor capability of noise suppression. An ISAR imaging method of maneuvering targets based on phase retr...Traditional inverse synthetic aperture radar(ISAR)imaging methods for maneuvering targets have low resolution and poor capability of noise suppression. An ISAR imaging method of maneuvering targets based on phase retrieval is proposed,which can provide a high-resolution and focused map of the spatial distribution of scatterers on the target. According to theoretical derivation, the modulus of raw data from the maneuvering target is not affected by radial motion components for ISAR imaging system, so the phase retrieval algorithm can be used for ISAR imaging problems. However, the traditional phase retrieval algorithm will be not applicable to ISAR imaging under the condition of random noise. To solve this problem, an algorithm is put forward based on the range Doppler(RD) algorithm and oversampling smoothness(OSS) phase retrieval algorithm. The algorithm captures the target information in order to reduce the influence of the random phase on ISAR echoes, and then applies OSS for focusing imaging based on prior information of the RD algorithm. The simulated results demonstrate the validity of this algorithm, which cannot only obtain high resolution imaging for high speed maneuvering targets under the condition of random noise, but also substantially improve the success rate of the phase retrieval algorithm.展开更多
随着社交网络的普及和多媒体数据的急剧增长,有效的跨模态检索引起了人们越来越多的关注.由于哈希有效的检索效率和低存储成本,其被广泛用于跨模态检索任务中.然而,这些基于深度学习的跨模态哈希检索方法大多数是利用图像网络和文本网...随着社交网络的普及和多媒体数据的急剧增长,有效的跨模态检索引起了人们越来越多的关注.由于哈希有效的检索效率和低存储成本,其被广泛用于跨模态检索任务中.然而,这些基于深度学习的跨模态哈希检索方法大多数是利用图像网络和文本网络各自生成对应模态的哈希码,难以获得更加有效的哈希码,无法进一步减小不同模态数据之间的模态鸿沟.为了更好地提高跨模态哈希检索的性能,本文提出了一种基于迁移知识的跨模态双重哈希(Cross-modal Dual Hashing based on Transfer Knowledge,CDHTK).CDHTK通过结合图像网络、知识迁移网络以及文本网络进行跨模态哈希检索任务.对于图像模态,CDHTK融合图像网络和知识迁移网络各自生成的哈希码,进而生成具有判别性的图像哈希码;对于文本模态,CDHTK融合文本网络和知识迁移网络各自生成的哈希码,从而生成有效的文本哈希码.CDHTK通过采用预测标签的交叉熵损失、生成哈希码的联合三元组量化损失以及迁移知识的差分损失来共同优化哈希码的生成过程,从而提高模型的检索效果,在2个常用的数据集(IAPR TC-12,MIR-Flickr 25K)上进行的实验验证了CDHTK的有效性,比当前最先进的跨模态哈希方法(Adaptive Label correlation based asymm Etric Cross-modal Hashing,ALECH)分别高出6.82%和5.13%.展开更多
文摘This paper presents a novel efficient semantic image classification algorithm for high-level feature indexing of high-dimension image database. Experiments show that the algorithm performs well. The size of the train set and the test set is 7 537 and 5 000 respectively. Based on this theory, another ground is built with 12,000 images, which are divided into three classes: city, landscape and person, the total result of the classifications is 88.92%, meanwhile, some preliminary results are presented for image understanding based on semantic image classification and low level features. The groundtruth for the experiments is built with the images from Corel database, photos and some famous face databases.
基金supported by the Natural Sciences and Engineering Research Council of Canada (NSERC) research grants 194376 and 185986Manitoba Centre of Excellence Fund(MCEF) grant and Canadian Network Centre of Excellence(NCE) and Canadian Arthritis Network(CAN) grant SRI-BIO-05.
文摘The problem considered in this paper is how to detect the degree of similarity in the content of digital images useful in image retrieval,i.e.,to what extent is the content of a query image similar to content of other images.The solution to this problem results from the detection of subsets that are rough sets contained in covers of digital images determined by perceptual tolerance relations(PTRs).Such relations are defined within the context of perceptual representative spaces that hearken back to work by J.H.Poincare on representative spaces as models of physical continua.Classes determined by a PTR provide content useful in content-based image retrieval(CBIR).In addition,tolerance classes provide a means of determining when subsets of image covers are tolerance rough sets(TRSs).It is the nearness of TRSs present in image tolerance spaces that provide a promising approach to CBIR,especially in cases such as satellite images or aircraft identification where there are subtle differences between pairs of digital images,making it difficult to quantify the similarities between such images.The contribution of this article is the introduction of the nearness of tolerance rough sets as an effective means of measuring digital image similarities and,as a significant consequence,successfully carrying out CBIR.
基金supported by the National High Technology Research and Development Program of China (863 Program) (2007AA12Z1362007AA12Z223)+2 种基金the National Basic Research Program of China (973Program) (2006CB705707)the National Natural Science Foundation of China (60672126, 60607010)the Program for Cheung Kong Scholars and Innovative Research Team in University (IRT0645)
文摘A novel image retrieval approach based on color features and anisotropic directional information is proposed for content based image retrieval systems (CBIR). The color feature is described by the color histogram (CH), which is translation and rotation invariant. However, the CH does not contain spatial information which is very important for the image retrieval. To overcome this shortcoming, the subband energy of the lifting directionlet transform (L-DT) is proposed to describe the directional information, in which L-DT is characterized by multi-direction and anisotropic basis functions compared with the wavelet transform. A global similarity measure is designed to implement the fusion of both color feature and anisotropic directionality for the retrieval process. The retrieval experiments using a set of COREL images demonstrate that the higher query precision and better visual effect can be achieved.
基金Projects(61370200,61672130,61602082) supported by the National Natural Science Foundation of ChinaProject(1721203049-1) supported by the Science and Technology Research and Development Plan Project of Handan,Hebei Province,China
文摘Two lines of image representation based on multiple features fusion demonstrate excellent performance in image retrieval.However,there are some problems in both of them:1)the methods defining directly texture in color space put more emphasis on color than texture feature;2)the methods extract several features respectively and combine them into a vector,in which bad features may lead to worse performance after combining directly good and bad features.To address the problems above,a novel hybrid framework for color image retrieval through combination of local and global features achieves higher retrieval precision.The bag-of-visual words(BoW)models and color intensity-based local difference patterns(CILDP)are exploited to capture local and global features of an image.The proposed fusion framework combines the ranking results of BoW and CILDP through graph-based density method.The performance of our proposed framework in terms of average precision on Corel-1K database is86.26%,and it improves the average precision by approximately6.68%and12.53%over CILDP and BoW,respectively.Extensive experiments on different databases demonstrate the effectiveness of the proposed framework for image retrieval.
基金This project was supported by National High Tech Foundation of 863 (2001AA115123)
文摘To realize content-hased retrieval of large image databases, it is required to develop an efficient index and retrieval scheme. This paper proposes an index algorithm of clustering called CMA, which supports fast retrieval of large image databases. CMA takes advantages of k-means and self-adaptive algorithms. It is simple and works without any user interactions. There are two main stages in this algorithm. In the first stage, it classifies images in a database into several clusters, and automatically gets the necessary parameters for the next stage-k-means iteration. The CMA algorithm is tested on a large database of more than ten thousand images and compare it with k-means algorithm. Experimental results show that this algorithm is effective in both precision and retrieval time.
文摘A novel content based image retrieval (CBIR) algorithmusing relevant feedback is presented. The proposed frameworkhas three major contributions: a novel feature descriptor calledcolor spectral histogram (CSH) to measure the similarity betweenimages; two-dimensional matrix based indexing approach proposedfor short-term learning (STL); and long-term learning (LTL).In general, image similarities are measured from feature representationwhich includes color quantization, texture, color, shapeand edges. However, CSH can describe the image feature onlywith the histogram. Typically the image retrieval process starts byfinding the similarity between the query image and the imagesin the database; the major computation involved here is that theselection of top ranking images requires a sorting algorithm to beemployed at least with the lower bound of O(n log n). A 2D matrixbased indexing of images can enormously reduce the searchtime in STL. The same structure is used for LTL with an aim toreduce the amount of log to be maintained. The performance ofthe proposed framework is analyzed and compared with the existingapproaches, the quantified results indicates that the proposedfeature descriptor is more effectual than the existing feature descriptorsthat were originally developed for CBIR. In terms of STL,the proposed 2D matrix based indexing minimizes the computationeffort for retrieving similar images and for LTL, the proposed algorithmtakes minimum log information than the existing approaches.
基金supported by the National Natural Science Foundation of China(6157138861601398)the National Natural Science Foundation of Hebei Province(F2016203251)
文摘Traditional inverse synthetic aperture radar(ISAR)imaging methods for maneuvering targets have low resolution and poor capability of noise suppression. An ISAR imaging method of maneuvering targets based on phase retrieval is proposed,which can provide a high-resolution and focused map of the spatial distribution of scatterers on the target. According to theoretical derivation, the modulus of raw data from the maneuvering target is not affected by radial motion components for ISAR imaging system, so the phase retrieval algorithm can be used for ISAR imaging problems. However, the traditional phase retrieval algorithm will be not applicable to ISAR imaging under the condition of random noise. To solve this problem, an algorithm is put forward based on the range Doppler(RD) algorithm and oversampling smoothness(OSS) phase retrieval algorithm. The algorithm captures the target information in order to reduce the influence of the random phase on ISAR echoes, and then applies OSS for focusing imaging based on prior information of the RD algorithm. The simulated results demonstrate the validity of this algorithm, which cannot only obtain high resolution imaging for high speed maneuvering targets under the condition of random noise, but also substantially improve the success rate of the phase retrieval algorithm.
文摘随着社交网络的普及和多媒体数据的急剧增长,有效的跨模态检索引起了人们越来越多的关注.由于哈希有效的检索效率和低存储成本,其被广泛用于跨模态检索任务中.然而,这些基于深度学习的跨模态哈希检索方法大多数是利用图像网络和文本网络各自生成对应模态的哈希码,难以获得更加有效的哈希码,无法进一步减小不同模态数据之间的模态鸿沟.为了更好地提高跨模态哈希检索的性能,本文提出了一种基于迁移知识的跨模态双重哈希(Cross-modal Dual Hashing based on Transfer Knowledge,CDHTK).CDHTK通过结合图像网络、知识迁移网络以及文本网络进行跨模态哈希检索任务.对于图像模态,CDHTK融合图像网络和知识迁移网络各自生成的哈希码,进而生成具有判别性的图像哈希码;对于文本模态,CDHTK融合文本网络和知识迁移网络各自生成的哈希码,从而生成有效的文本哈希码.CDHTK通过采用预测标签的交叉熵损失、生成哈希码的联合三元组量化损失以及迁移知识的差分损失来共同优化哈希码的生成过程,从而提高模型的检索效果,在2个常用的数据集(IAPR TC-12,MIR-Flickr 25K)上进行的实验验证了CDHTK的有效性,比当前最先进的跨模态哈希方法(Adaptive Label correlation based asymm Etric Cross-modal Hashing,ALECH)分别高出6.82%和5.13%.