[Objective]Fish pose estimation(FPE)provides fish physiological information,facilitating health monitoring in aquaculture.It aids decision-making in areas such as fish behavior recognition.When fish are injured or def...[Objective]Fish pose estimation(FPE)provides fish physiological information,facilitating health monitoring in aquaculture.It aids decision-making in areas such as fish behavior recognition.When fish are injured or deficient,they often display abnormal behaviors and noticeable changes in the positioning of their body parts.Moreover,the unpredictable posture and orientation of fish during swimming,combined with the rapid swimming speed of fish,restrict the current scope of research in FPE.In this research,a FPE model named HPFPE is presented to capture the swimming posture of fish and accurately detect their key points.[Methods]On the one hand,this model incorporated the CBAM module into the HRNet framework.The attention module enhanced accuracy without adding computational complexity,while effectively capturing a broader range of contextual information.On the other hand,the model incorporated dilated convolution to increase the receptive field,allowing it to capture more spatial context.[Results and Discussions]Experiments showed that compared with the baseline method,the average precision(AP)of HPFPE based on different backbones and input sizes on the oplegnathus punctatus datasets had increased by 0.62,1.35,1.76,and 1.28 percent point,respectively,while the average recall(AR)had also increased by 0.85,1.50,1.40,and 1.00,respectively.Additionally,HPFPE outperformed other mainstream methods,including DeepPose,CPM,SCNet,and Lite-HRNet.Furthermore,when compared to other methods using the ornamental fish data,HPFPE achieved the highest AP and AR values of 52.96%,and 59.50%,respectively.[Conclusions]The proposed HPFPE can accurately estimate fish posture and assess their swimming patterns,serving as a valuable reference for applications such as fish behavior recognition.展开更多
The cold chain in the production area of fruits and vegetables is the primary link to reduce product loss and improve product quality,but it is also a weak link.With the application of big data technology in cold chai...The cold chain in the production area of fruits and vegetables is the primary link to reduce product loss and improve product quality,but it is also a weak link.With the application of big data technology in cold chain logistics,intelligent devices,and technologies have become important carriers for improving the efficiency of cold chain logistics in fruit and vegetable production areas,extending the shelf life of fruits and vegetables,and reducing fruit and vegetable losses.They have many advantages in fruit and vegetable pre-cooling,sorting and packaging,testing,warehousing,transportation,and other aspects.This article summarizes the rapidly developing and widely used intelligent technologies at home and abroad in recent years,including automated guided vehicle intelligent handling based on electromagnetic or optical technology,intelligent sorting based on sensors,electronic optics,and other technologies,intelligent detection based on computer vision technology,intelligent transportation based on perspective imaging technology,etc.It analyses and studies the innovative research and achievements of various scholars in applying intelligent technology in fruit and vegetable cold chain storage,sorting,detection,transportation,and other links,and improves the efficiency of fruit and vegetable cold chain logistics.However,applying intelligent technology in fruit and vegetable cold chain logistics also faces many problems.The challenges of high cost,difficulty in technological integration,and talent shortages have limited the development of intelligent technology in the field of fruit and vegetable cold chains.To solve the current problems,it is proposed that costs be controlled through independent research and development,technological innovation,and other means to lower the entry threshold for small enterprises.Strengthen integrating intelligent technology and cold chain logistics systems to improve data security and system compatibility.At the same time,the government should introduce relevant policies,provide necessary financial support,and establish talent training mechanisms.Accelerate the development and improvement of intelligent technology standards in the field of cold chain logistics.Through technological innovation,cost control,talent cultivation,and policy guidance,we aim to promote the upgrading of the agricultural industry and provide ideas for improving the quality and efficiency of fruit and vegetable cold chain logistics.展开更多
The mean shift tracker has difficulty in tracking fast moving targets and suffers from tracking error accumulation problem. To overcome the limitations of the mean shift method, a new approach is proposed by integrati...The mean shift tracker has difficulty in tracking fast moving targets and suffers from tracking error accumulation problem. To overcome the limitations of the mean shift method, a new approach is proposed by integrating the mean shift algorithm and frame-difference methods. The rough position of the moving tar- get is first located by the direct frame-difference algorithm and three-frame-difference algorithm for the immobile camera scenes and mobile camera scenes, respectively. Then, the mean shift algorithm is used to achieve precise tracking of the target. Several tracking experiments show that the proposed method can effectively track first moving targets and overcome the tracking error accumulation problem.展开更多
With the digital image technology,a crack detection method of reinforced concrete bridge was studied for the performance assessment.The effects including the image gray level,pixel rate,noise filter,and edge detection...With the digital image technology,a crack detection method of reinforced concrete bridge was studied for the performance assessment.The effects including the image gray level,pixel rate,noise filter,and edge detection were analyzed considering cracks qualities.A computer program was developed by visual C++6.0 programming language to detect the cracks,which was tested by 15cases of bridge video images.The results indicate that the relative error is within 6%for cracks larger than 0.3 mm cracks and it is less than 10%for crack width between 0.2 mm and 0.3 mm.In addition,for the crack below 0.1 mm,the relative error is more than30%because the bridge is in safe stage and it is very difficult to detect the actual width of crack.展开更多
With the warming up and continuous development of machine learning,especially deep learning,the research on visual question answering field has made significant progress,with important theoretical research significanc...With the warming up and continuous development of machine learning,especially deep learning,the research on visual question answering field has made significant progress,with important theoretical research significance and practical application value.Therefore,it is necessary to summarize the current research and provide some reference for researchers in this field.This article conducted a detailed and in-depth analysis and summarized of relevant research and typical methods of visual question answering field.First,relevant background knowledge about VQA(Visual Question Answering)was introduced.Secondly,the issues and challenges of visual question answering were discussed,and at the same time,some promising discussion on the particular methodologies was given.Thirdly,the key sub-problems affecting visual question answering were summarized and analyzed.Then,the current commonly used data sets and evaluation indicators were summarized.Next,in view of the popular algorithms and models in VQA research,comparison of the algorithms and models was summarized and listed.Finally,the future development trend and conclusion of visual question answering were prospected.展开更多
We set up computer vision system for tomato images. By using this system, the RGB value of tomato image was converted into HIS value whose H was used to acquire the color character of the surface of tomato. To use mul...We set up computer vision system for tomato images. By using this system, the RGB value of tomato image was converted into HIS value whose H was used to acquire the color character of the surface of tomato. To use multilayer feed forward neural network with GA can finish automatic identification of tomato maturation. The results of experiment showed that the accuracy was up to 94%.展开更多
Low-rank matrix recovery is an important problem extensively studied in machine learning, data mining and computer vision communities. A novel method is proposed for low-rank matrix recovery, targeting at higher recov...Low-rank matrix recovery is an important problem extensively studied in machine learning, data mining and computer vision communities. A novel method is proposed for low-rank matrix recovery, targeting at higher recovery accuracy and stronger theoretical guarantee. Specifically, the proposed method is based on a nonconvex optimization model, by solving the low-rank matrix which can be recovered from the noisy observation. To solve the model, an effective algorithm is derived by minimizing over the variables alternately. It is proved theoretically that this algorithm has stronger theoretical guarantee than the existing work. In natural image denoising experiments, the proposed method achieves lower recovery error than the two compared methods. The proposed low-rank matrix recovery method is also applied to solve two real-world problems, i.e., removing noise from verification code and removing watermark from images, in which the images recovered by the proposed method are less noisy than those of the two compared methods.展开更多
This paper proposes a robust method of parameter estimation and data classification for multiple-structural data based on the linear error in variable(EIV) model.The traditional EIV model fitting problem is analyzed...This paper proposes a robust method of parameter estimation and data classification for multiple-structural data based on the linear error in variable(EIV) model.The traditional EIV model fitting problem is analyzed and a robust growing algorithm is developed to extract the underlying linear structure of the observed data.Under the structural density assumption,the C-step technique borrowed from the Rousseeuw's robust MCD estimator is used to keep the algorithm robust and the mean-shift algorithm is adopted to ensure a good initialization.To eliminate the model ambiguities of the multiple-structural data,statistical hypotheses tests are used to refine the data classification and improve the accuracy of the model parameter estimation.Experiments show that the efficiency and robustness of the proposed algorithm.展开更多
To get the high compression ratio as well as the high-quality reconstructed image, an effective image compression scheme named irregular segmentation region coding based on spiking cortical model(ISRCS) is presented...To get the high compression ratio as well as the high-quality reconstructed image, an effective image compression scheme named irregular segmentation region coding based on spiking cortical model(ISRCS) is presented. This scheme is region-based and mainly focuses on two issues. Firstly, an appropriate segmentation algorithm is developed to partition an image into some irregular regions and tidy contours, where the crucial regions corresponding to objects are retained and a lot of tiny parts are eliminated. The irregular regions and contours are coded using different methods respectively in the next step. The other issue is the coding method of contours where an efficient and novel chain code is employed. This scheme tries to find a compromise between the quality of reconstructed images and the compression ratio. Some principles and experiments are conducted and the results show its higher performance compared with other compression technologies, in terms of higher quality of reconstructed images, higher compression ratio and less time consuming.展开更多
文摘[Objective]Fish pose estimation(FPE)provides fish physiological information,facilitating health monitoring in aquaculture.It aids decision-making in areas such as fish behavior recognition.When fish are injured or deficient,they often display abnormal behaviors and noticeable changes in the positioning of their body parts.Moreover,the unpredictable posture and orientation of fish during swimming,combined with the rapid swimming speed of fish,restrict the current scope of research in FPE.In this research,a FPE model named HPFPE is presented to capture the swimming posture of fish and accurately detect their key points.[Methods]On the one hand,this model incorporated the CBAM module into the HRNet framework.The attention module enhanced accuracy without adding computational complexity,while effectively capturing a broader range of contextual information.On the other hand,the model incorporated dilated convolution to increase the receptive field,allowing it to capture more spatial context.[Results and Discussions]Experiments showed that compared with the baseline method,the average precision(AP)of HPFPE based on different backbones and input sizes on the oplegnathus punctatus datasets had increased by 0.62,1.35,1.76,and 1.28 percent point,respectively,while the average recall(AR)had also increased by 0.85,1.50,1.40,and 1.00,respectively.Additionally,HPFPE outperformed other mainstream methods,including DeepPose,CPM,SCNet,and Lite-HRNet.Furthermore,when compared to other methods using the ornamental fish data,HPFPE achieved the highest AP and AR values of 52.96%,and 59.50%,respectively.[Conclusions]The proposed HPFPE can accurately estimate fish posture and assess their swimming patterns,serving as a valuable reference for applications such as fish behavior recognition.
基金National Natural Science Foundation of China(32301718)Chinese Academy of Agricultural Sciences under the Special Institute-level Coordination Project for Basic Research Operating Costs(S202328)。
文摘The cold chain in the production area of fruits and vegetables is the primary link to reduce product loss and improve product quality,but it is also a weak link.With the application of big data technology in cold chain logistics,intelligent devices,and technologies have become important carriers for improving the efficiency of cold chain logistics in fruit and vegetable production areas,extending the shelf life of fruits and vegetables,and reducing fruit and vegetable losses.They have many advantages in fruit and vegetable pre-cooling,sorting and packaging,testing,warehousing,transportation,and other aspects.This article summarizes the rapidly developing and widely used intelligent technologies at home and abroad in recent years,including automated guided vehicle intelligent handling based on electromagnetic or optical technology,intelligent sorting based on sensors,electronic optics,and other technologies,intelligent detection based on computer vision technology,intelligent transportation based on perspective imaging technology,etc.It analyses and studies the innovative research and achievements of various scholars in applying intelligent technology in fruit and vegetable cold chain storage,sorting,detection,transportation,and other links,and improves the efficiency of fruit and vegetable cold chain logistics.However,applying intelligent technology in fruit and vegetable cold chain logistics also faces many problems.The challenges of high cost,difficulty in technological integration,and talent shortages have limited the development of intelligent technology in the field of fruit and vegetable cold chains.To solve the current problems,it is proposed that costs be controlled through independent research and development,technological innovation,and other means to lower the entry threshold for small enterprises.Strengthen integrating intelligent technology and cold chain logistics systems to improve data security and system compatibility.At the same time,the government should introduce relevant policies,provide necessary financial support,and establish talent training mechanisms.Accelerate the development and improvement of intelligent technology standards in the field of cold chain logistics.Through technological innovation,cost control,talent cultivation,and policy guidance,we aim to promote the upgrading of the agricultural industry and provide ideas for improving the quality and efficiency of fruit and vegetable cold chain logistics.
基金supported by the Fundamental Research Funds for the Central Universities Project(CDJZR10170010)
文摘The mean shift tracker has difficulty in tracking fast moving targets and suffers from tracking error accumulation problem. To overcome the limitations of the mean shift method, a new approach is proposed by integrating the mean shift algorithm and frame-difference methods. The rough position of the moving tar- get is first located by the direct frame-difference algorithm and three-frame-difference algorithm for the immobile camera scenes and mobile camera scenes, respectively. Then, the mean shift algorithm is used to achieve precise tracking of the target. Several tracking experiments show that the proposed method can effectively track first moving targets and overcome the tracking error accumulation problem.
基金Project(51178193)supported by the National Natural Science Foundation of ChinaProject(2009 353-344-570)supported by the Ministry of Transport of ChinaProject(2010-02-051)supported by the Transportation Department of Guangdong Province,China
文摘With the digital image technology,a crack detection method of reinforced concrete bridge was studied for the performance assessment.The effects including the image gray level,pixel rate,noise filter,and edge detection were analyzed considering cracks qualities.A computer program was developed by visual C++6.0 programming language to detect the cracks,which was tested by 15cases of bridge video images.The results indicate that the relative error is within 6%for cracks larger than 0.3 mm cracks and it is less than 10%for crack width between 0.2 mm and 0.3 mm.In addition,for the crack below 0.1 mm,the relative error is more than30%because the bridge is in safe stage and it is very difficult to detect the actual width of crack.
基金Project(61702063)supported by the National Natural Science Foundation of China。
文摘With the warming up and continuous development of machine learning,especially deep learning,the research on visual question answering field has made significant progress,with important theoretical research significance and practical application value.Therefore,it is necessary to summarize the current research and provide some reference for researchers in this field.This article conducted a detailed and in-depth analysis and summarized of relevant research and typical methods of visual question answering field.First,relevant background knowledge about VQA(Visual Question Answering)was introduced.Secondly,the issues and challenges of visual question answering were discussed,and at the same time,some promising discussion on the particular methodologies was given.Thirdly,the key sub-problems affecting visual question answering were summarized and analyzed.Then,the current commonly used data sets and evaluation indicators were summarized.Next,in view of the popular algorithms and models in VQA research,comparison of the algorithms and models was summarized and listed.Finally,the future development trend and conclusion of visual question answering were prospected.
文摘We set up computer vision system for tomato images. By using this system, the RGB value of tomato image was converted into HIS value whose H was used to acquire the color character of the surface of tomato. To use multilayer feed forward neural network with GA can finish automatic identification of tomato maturation. The results of experiment showed that the accuracy was up to 94%.
基金Projects(61173122,61262032) supported by the National Natural Science Foundation of ChinaProjects(11JJ3067,12JJ2038) supported by the Natural Science Foundation of Hunan Province,China
文摘Low-rank matrix recovery is an important problem extensively studied in machine learning, data mining and computer vision communities. A novel method is proposed for low-rank matrix recovery, targeting at higher recovery accuracy and stronger theoretical guarantee. Specifically, the proposed method is based on a nonconvex optimization model, by solving the low-rank matrix which can be recovered from the noisy observation. To solve the model, an effective algorithm is derived by minimizing over the variables alternately. It is proved theoretically that this algorithm has stronger theoretical guarantee than the existing work. In natural image denoising experiments, the proposed method achieves lower recovery error than the two compared methods. The proposed low-rank matrix recovery method is also applied to solve two real-world problems, i.e., removing noise from verification code and removing watermark from images, in which the images recovered by the proposed method are less noisy than those of the two compared methods.
基金supported by the National High Technology Research and Development Program of China (863 Program) (2007AA04Z227)
文摘This paper proposes a robust method of parameter estimation and data classification for multiple-structural data based on the linear error in variable(EIV) model.The traditional EIV model fitting problem is analyzed and a robust growing algorithm is developed to extract the underlying linear structure of the observed data.Under the structural density assumption,the C-step technique borrowed from the Rousseeuw's robust MCD estimator is used to keep the algorithm robust and the mean-shift algorithm is adopted to ensure a good initialization.To eliminate the model ambiguities of the multiple-structural data,statistical hypotheses tests are used to refine the data classification and improve the accuracy of the model parameter estimation.Experiments show that the efficiency and robustness of the proposed algorithm.
基金supported by the National Science Foundation of China(60872109)the Program for New Century Excellent Talents in University(NCET-06-0900)
文摘To get the high compression ratio as well as the high-quality reconstructed image, an effective image compression scheme named irregular segmentation region coding based on spiking cortical model(ISRCS) is presented. This scheme is region-based and mainly focuses on two issues. Firstly, an appropriate segmentation algorithm is developed to partition an image into some irregular regions and tidy contours, where the crucial regions corresponding to objects are retained and a lot of tiny parts are eliminated. The irregular regions and contours are coded using different methods respectively in the next step. The other issue is the coding method of contours where an efficient and novel chain code is employed. This scheme tries to find a compromise between the quality of reconstructed images and the compression ratio. Some principles and experiments are conducted and the results show its higher performance compared with other compression technologies, in terms of higher quality of reconstructed images, higher compression ratio and less time consuming.