研究动态
Articles below are published ahead of final publication in an issue. Please cite articles in the following format: authors, (year), title, journal, DOI.

CCGL-YOLOV5:一种跨模态、跨尺度的全局-局部注意力YOLOV5肺肿瘤检测模型。

CCGL-YOLOV5:A cross-modal cross-scale global-local attention YOLOV5 lung tumor detection model.

发表日期:2023 Aug 28
作者: Tao Zhou, Fengzhen Liu, Xinyu Ye, Hongwei Wang, Huiling Lu
来源: COMPUTERS IN BIOLOGY AND MEDICINE

摘要:

多模态医学图像检测是医学图像分析中的关键技术,在肿瘤诊断中起着重要作用。多模态肺部肿瘤图像中存在不同大小和形状的病变,这使得有效提取肺部肿瘤病变的关键特征变得困难。本文提出了一种跨模态跨尺度全局-局部注意力YOLOV5肺部肿瘤检测模型(CCGL-YOLOV5)。主要工作如下:首先,设计了跨模态融合变换器模块(CMFTM),通过多模态特征的交互辅助融合,提高了多模态关键病变特征的提取能力和融合能力;其次,提出了全局-局部特征交互模块(GLFIM),通过双向交互分支增强了多模态全局特征和多模态局部特征之间的交互能力。第三,设计了跨尺度注意力融合模块(CSAFM),通过对特征融合进行多尺度注意力分组,获得丰富的多尺度特征。进行了与先进网络的对比实验。CCGL-YOLOV5模型在多模态肺部肿瘤PET / CT数据集上的Acc、Rec、mAP、F1 score和FPS分别为97.83%、97.39%、96.67%、97.61%和98.59。实验结果表明,本文中CCGL-YOLOV5模型的性能优于其他典型模型。CCGL-YOLOV5模型可以有效利用多模态特征信息,对多模态医学图像研究和临床疾病诊断具有重要意义。版权所有©2023 Elsevier Ltd.保留所有权利。
Multimodal medical image detection is a key technology in medical image analysis, which plays an important role in tumor diagnosis. There are different sizes lesions and different shapes lesions in multimodal lung tumor images, which makes it difficult to effectively extract key features of lung tumor lesions.A Cross-modal Cross-scale Clobal-Local Attention YOLOV5 Lung Tumor Detection Model (CCGL-YOLOV5) is proposed in this paper. The main works are as follows: Firstly, the Cross-Modal Fusion Transformer Module (CMFTM) is designed to improve the multimodal key lesion feature extraction ability and fusion ability through the interactive assisted fusion of multimodal features; Secondly, the Global-Local Feature Interaction Module (GLFIM) is proposed to enhance the interaction ability between multimodal global features and multimodal local features through bidirectional interactive branches. Thirdly, the Cross-Scale Attention Fusion Module (CSAFM) is designed to obtain rich multi-scale features through grouping multi-scale attention for feature fusion.The comparison experiments with advanced networks are done. The Acc, Rec, mAP, F1 score and FPS of CCGL-YOLOV5 model on multimodal lung tumor PET/CT dataset are 97.83%, 97.39%, 96.67%, 97.61% and 98.59, respectively; The experimental results show that the performance of CCGL-YOLOV5 model in this paper are better than other typical models.The CCGL-YOLOV5 model can effectively use the multimodal feature information. There are important implications for multimodal medical image research and clinical disease diagnosis in CCGL-YOLOV5 model.Copyright © 2023 Elsevier Ltd. All rights reserved.