融合Transformer和线索交叉聚合的结直肠息肉分割方法
DOI:
作者:
作者单位:

江西理工大学

作者简介:

通讯作者:

中图分类号:

TP391.4

基金项目:

国家自然科学基金 ;江西省自然科学基金


Colorectal polyp segmentation method fusing Transformer and cue cross-polymerization
Author:
Affiliation:

Jiangxi University

Fund Project:

National Natural Science Foundation of China; Natural Science Foundation of Jiangxi Province

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对结直肠息肉图像分割时动态信息处理和边缘细节捕捉能力不足,导致边界信息损失和错误分割等问题,本文提出一种建立在Swin Transformer框架上的线索交叉聚合结肠息肉分割方法。该方法首先从全局上下文信息特征入手,通过Transformer编码器对图像的病变特征进行逐级提取。其次利用改进的二阶通道注意力机制实现高阶特征间的相互依赖,增强跨层级信息交互能力,有效提取丰富的多尺度上下文特征信息。再次采用反向频率通道注意力机制中的离散余弦变换,在通道中嵌入更多的信息,突出多尺度上下文信息的通道特征。最后通过线索交叉聚合模块将上述多尺度特征与单目特征进行结合,从动态和静态深度两个层面增强图像特征,进而提升动态信息处理和细节捕捉能力。在数据集CVC-ClinicDB、Kvasir 、CVC-ColonDB 和ETIS-LaribPolypDB上进行测试,Dice指数分别为0.942、0.924、0.800和0.774。MIou指数分别为0.896、0.878、0.726和0.697。实验数据表明,本文提出的方法能有效分割结直肠息肉图像,为结直肠息肉的诊断提供了新思路。

    Abstract:

    In order to solve the problems of insufficient dynamic information processing and edge detail capture in colorectal polyp image segmentation, such as boundary information loss and wrong segmentation, this paper proposes a colon polyp segmentation method based on Swin Transformer framework. The method starts with the global context information features and extracts the lesion features step by step using Transformer encoder. Secondly, the improved second-order channel attention mechanism is used to realize the interdependence between higher-order features, enhance the cross-level information interaction ability, and effectively extract rich multi-scale context feature information. Again, the discrete cosine transform (DCT) in the attention mechanism of the reverse frequency channel is used to embed more information in the channel and highlight the channel characteristics of multi-scale context information. Finally, the above multi-scale features are combined with monocular features through the cross-cue aggregation module to enhance the image features from both dynamic and static depth levels, thus improving the dynamic information processing and detail capture capabilities. When tested on the datasets CVC-ClinicDB, Kvasir, CVC-ColonDB, and ETIS-LaribPolypDB, Dice indices were 0.942, 0.924, 0.800, and 0.774, respectively. The MIou index was 0.896, 0.878, 0.726 and 0.697, respectively. The experimental data show that the proposed method can effectively segment colorectal polyp images and provide a new idea for the diagnosis of colorectal polyp.

    参考文献
    相似文献
    引证文献
引用本文
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2023-07-05
  • 最后修改日期:2023-09-22
  • 录用日期:2023-10-31
  • 在线发布日期:
  • 出版日期: