AIM:To establish pupil diameter measurement algorithms based on infrared images that can be used in real-world clinical settings.METHODS:A total of 188 patients from outpatient clinic at He Eye Specialist Shenyang Hos...AIM:To establish pupil diameter measurement algorithms based on infrared images that can be used in real-world clinical settings.METHODS:A total of 188 patients from outpatient clinic at He Eye Specialist Shenyang Hospital from Spetember to December 2022 were included,and 13470 infrared pupil images were collected for the study.All infrared images for pupil segmentation were labeled using the Labelme software.The computation of pupil diameter is divided into four steps:image pre-processing,pupil identification and localization,pupil segmentation,and diameter calculation.Two major models are used in the computation process:the modified YoloV3 and Deeplabv 3+models,which must be trained beforehand.RESULTS:The test dataset included 1348 infrared pupil images.On the test dataset,the modified YoloV3 model had a detection rate of 99.98% and an average precision(AP)of 0.80 for pupils.The DeeplabV3+model achieved a background intersection over union(IOU)of 99.23%,a pupil IOU of 93.81%,and a mean IOU of 96.52%.The pupil diameters in the test dataset ranged from 20 to 56 pixels,with a mean of 36.06±6.85 pixels.The absolute error in pupil diameters between predicted and actual values ranged from 0 to 7 pixels,with a mean absolute error(MAE)of 1.06±0.96 pixels.CONCLUSION:This study successfully demonstrates a robust infrared image-based pupil diameter measurement algorithm,proven to be highly accurate and reliable for clinical application.展开更多
Deep learning techniques are revolutionizing the developmentof medical image segmentation.With the advancement of Transformer models,especially ViT and Swin-Transformer,which enhances the remote-dependent modeling cap...Deep learning techniques are revolutionizing the developmentof medical image segmentation.With the advancement of Transformer models,especially ViT and Swin-Transformer,which enhances the remote-dependent modeling capability of the model through the self-attention mechanism,better segmentation performance can be achieve.Moreover,the high computational cost of Transformer has motivated researchers to explore more efficient models,such as the Mamba model based on state-space modeling(SSM),and for the field of medical segmentation,reducing the number of model parameters is also necessary.In this study,a novel asymmetric model called LA-UMamba was proposed,which integrates visual Mamba module to efficiently capture complex visual features and remote dependencies.The classical design of U-Net was adopted in the upsampling phase to help reduce the number of references and recover more details.To mitigate the information loss problem,an auxiliary U-Net downsampling layer was designed to focus on sizing without extracting features,thus enhancing the protection of input information while maintaining the efficiency of the model.The experiments were conducted on the ACDC MRI cardiac segmentation dataset,and the results showed that the proposed LA-UMamba achieves proved performance compared to the baseline model in several evaluation metrics,such as IoU,Accuracy,Precision,HD and ASD,which improved that the model is successful in optimizing the detail processing and reducing the complexity of the model,providing a new perspective for further optimization of medical image segmentation techniques.展开更多
Building information modeling(BIM)object classification takes a lot of time and energy.Misclassification or omission of any object may lead to the emergence of abnormal results,which have a great impact on the project...Building information modeling(BIM)object classification takes a lot of time and energy.Misclassification or omission of any object may lead to the emergence of abnormal results,which have a great impact on the project workflow and results.Roundly understanding BIM object classification,by improving Swin Transformer classifier algorithm parameters,using the model primitives extracted from IFC format BIM model file,deep learning of 7 types of BIM object categories is taken.Through the performance and evaluation indicators obtained in training,the results improve the classification accuracy.展开更多
To improve the accuracy of short text matching,a short text matching method with knowledge and structure enhancement for BERT(KS-BERT)was proposed in this study.This method first introduced external knowledge to the i...To improve the accuracy of short text matching,a short text matching method with knowledge and structure enhancement for BERT(KS-BERT)was proposed in this study.This method first introduced external knowledge to the input text,and then sent the expanded text to both the context encoder BERT and the structure encoder GAT to capture the contextual relationship features and structural features of the input text.Finally,the match was determined based on the fusion result of the two features.Experiment results based on the public datasets BQ_corpus and LCQMC showed that KS-BERT outperforms advanced models such as ERNIE 2.0.This Study showed that knowledge enhancement and structure enhancement are two effective ways to improve BERT in short text matching.In BQ_corpus,ACC was improved by 0.2%and 0.3%,respectively,while in LCQMC,ACC was improved by 0.4%and 0.9%,respectively.展开更多
文摘AIM:To establish pupil diameter measurement algorithms based on infrared images that can be used in real-world clinical settings.METHODS:A total of 188 patients from outpatient clinic at He Eye Specialist Shenyang Hospital from Spetember to December 2022 were included,and 13470 infrared pupil images were collected for the study.All infrared images for pupil segmentation were labeled using the Labelme software.The computation of pupil diameter is divided into four steps:image pre-processing,pupil identification and localization,pupil segmentation,and diameter calculation.Two major models are used in the computation process:the modified YoloV3 and Deeplabv 3+models,which must be trained beforehand.RESULTS:The test dataset included 1348 infrared pupil images.On the test dataset,the modified YoloV3 model had a detection rate of 99.98% and an average precision(AP)of 0.80 for pupils.The DeeplabV3+model achieved a background intersection over union(IOU)of 99.23%,a pupil IOU of 93.81%,and a mean IOU of 96.52%.The pupil diameters in the test dataset ranged from 20 to 56 pixels,with a mean of 36.06±6.85 pixels.The absolute error in pupil diameters between predicted and actual values ranged from 0 to 7 pixels,with a mean absolute error(MAE)of 1.06±0.96 pixels.CONCLUSION:This study successfully demonstrates a robust infrared image-based pupil diameter measurement algorithm,proven to be highly accurate and reliable for clinical application.
文摘Deep learning techniques are revolutionizing the developmentof medical image segmentation.With the advancement of Transformer models,especially ViT and Swin-Transformer,which enhances the remote-dependent modeling capability of the model through the self-attention mechanism,better segmentation performance can be achieve.Moreover,the high computational cost of Transformer has motivated researchers to explore more efficient models,such as the Mamba model based on state-space modeling(SSM),and for the field of medical segmentation,reducing the number of model parameters is also necessary.In this study,a novel asymmetric model called LA-UMamba was proposed,which integrates visual Mamba module to efficiently capture complex visual features and remote dependencies.The classical design of U-Net was adopted in the upsampling phase to help reduce the number of references and recover more details.To mitigate the information loss problem,an auxiliary U-Net downsampling layer was designed to focus on sizing without extracting features,thus enhancing the protection of input information while maintaining the efficiency of the model.The experiments were conducted on the ACDC MRI cardiac segmentation dataset,and the results showed that the proposed LA-UMamba achieves proved performance compared to the baseline model in several evaluation metrics,such as IoU,Accuracy,Precision,HD and ASD,which improved that the model is successful in optimizing the detail processing and reducing the complexity of the model,providing a new perspective for further optimization of medical image segmentation techniques.
文摘Building information modeling(BIM)object classification takes a lot of time and energy.Misclassification or omission of any object may lead to the emergence of abnormal results,which have a great impact on the project workflow and results.Roundly understanding BIM object classification,by improving Swin Transformer classifier algorithm parameters,using the model primitives extracted from IFC format BIM model file,deep learning of 7 types of BIM object categories is taken.Through the performance and evaluation indicators obtained in training,the results improve the classification accuracy.
文摘To improve the accuracy of short text matching,a short text matching method with knowledge and structure enhancement for BERT(KS-BERT)was proposed in this study.This method first introduced external knowledge to the input text,and then sent the expanded text to both the context encoder BERT and the structure encoder GAT to capture the contextual relationship features and structural features of the input text.Finally,the match was determined based on the fusion result of the two features.Experiment results based on the public datasets BQ_corpus and LCQMC showed that KS-BERT outperforms advanced models such as ERNIE 2.0.This Study showed that knowledge enhancement and structure enhancement are two effective ways to improve BERT in short text matching.In BQ_corpus,ACC was improved by 0.2%and 0.3%,respectively,while in LCQMC,ACC was improved by 0.4%and 0.9%,respectively.