位置信息特征增强与多尺度融合的皮肤镜图像分类
DOI:
CSTR:
作者:
作者单位:

西安建筑科技大学

作者简介:

通讯作者:

中图分类号:

TP391.41

基金项目:

国家自然科学基金项目(面上项目,重点项目,重大项目)


Classification of dermoscopic images based on enhanced positional information and feature fusion
Author:
Affiliation:

1.Xi'2.'3.an University of Architecture and Technology

Fund Project:

The National Natural Science Foundation of China (General Program, Key Program, Major Research Plan)

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对皮肤镜图像分类存在的特征提取过程中信息丢失、边界模糊、多尺度特征利用不足等问题,提出了一种改进FastViT(Fast Hybrid Vision Transformer)网络的EFFViT(Enhanced Positional Information and Feature Fusion Vision Transformer)网络以提高皮肤病变分类精度。首先,设计双通道门控坐标注意力特征令牌混合器,增强局部空间位置信息表达,提升病灶定位与细节提取能力;其次,构建特征增强模块,加强细节信息捕捉;最后,提出多尺度特征融合模块,整合不同尺度信息,增强全局与局部特征感知。EFFViT网络在ISIC 2018与 ISIC 2019数据集的实验结果表明,该模型的分类准确率、精确率、召回率、和F1-Score分别达到94.7%、92.5%、93.3%、92.0%和93.8%、90.6%、90.2%、90.3%,与当前主流算法相比,EFFViT在皮肤病变图像分类任务中表现优异。

    Abstract:

    To address the issues of information loss, blurred boundaries, and insufficient utilization of multi-scale features in the feature extraction process of dermoscopic image classification, an improved FastViT (Fast Hybrid Vision Transformer) network, named EFFViT (Enhanced Positional Information and Feature Fusion Vision Transformer), is proposed to enhance skin lesion classification accuracy. First, a dual-channel gated coordinate attention feature token mixer is designed to enhance local spatial positional information representation, thereby improving lesion localization and detail extraction capabilities. Second, a feature enhancement module is constructed to strengthen the model’s ability to capture fine details. Finally, a multi-scale feature fusion module is introduced to integrate information from different scales, enhancing the perception of both global and local features. Experimental results on the ISIC 2018 and ISIC 2019 datasets demonstrate that the proposed EFFViT model achieves classification accuracy, precision, recall, and F1-score of 94.7%, 92.5%, 93.3%, and 92.0% on ISIC 2018, and 93.8%, 90.6%, 90.2%, and 90.3% on ISIC 2019, respectively. Compared with current state-of-the-art algorithms, EFFViT demonstrates superior performance in skin lesion image classification tasks.

    参考文献
    相似文献
    引证文献
引用本文
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-06-06
  • 最后修改日期:2025-08-19
  • 录用日期:2025-09-17
  • 在线发布日期:
  • 出版日期:
文章二维码