多尺度信息交互与融合的乳腺病理图像分类

丁维龙; 朱峰龙; 郑魁; 贾秀鹏

发布时间： 2024-04-15
摘要点击次数： 1060
全文下载次数： 717
DOI: 10.11834/jig.221178
2024 | Volume 29 | Number 4

多尺度信息交互与融合的乳腺病理图像分类

丁维龙¹, 朱峰龙¹, 郑魁², 贾秀鹏³(1.浙江工业大学计算机科学技术学院, 杭州 310023;2.上海派影医疗科技有限公司, 上海 201306;3.宁波市临床病理诊断中心, 宁波 315031)

摘要

目的基于深度学习方法进行乳腺癌识别是一项具有挑战的任务,目前较多研究使用单一倍率下的乳腺组织病理图像作为模型的输入,忽略了乳腺组织病理图像固有的多倍率特点,而少数将不同倍率下的图像作为模型输入的研究,存在特征利用率较低以及不同倍率的图像之间缺乏信息交互等问题。方法针对上述问题,提出一种基于多尺度和分组注意力机制的卷积神经网络改进策略。该策略主要包括信息交互模块和特征融合模块。前者通过空间注意力加强不同倍率的图像之间的相关性,然后将加权累加的结果反馈给原始分支进行动态选择实现特征流通;后者则利用一种分组注意力来提升特征的利用率,同时基于特征金字塔来消除图像之间的感受野差异。结果本文将上述策略应用到多种卷积网络中,并与最新的方法进行比较。在Camelyon16公开数据集上进行五折交叉验证实验,并对每一项评价指标计算均值和标准差。相比于单一尺度图像作为输入的卷积网络,本文改进的方法在准确率上提升0.9%～1.1%,F1分数提升1.1%～1.2%;相较于对比方法中性能最好的TransPath网络,本文改进的DenseNet201（dense convolutional network）在准确率上提升0.6%,精确率提升0.8%,F1分数提升0.6%,并且各项指标的标准差低于Transpath,表明加入策略的网络具有更好的稳定性。结论本文所提出的策略能弥补一般多尺度网络的缺陷,并具备一定的通用性,可获得更好的乳腺癌分类性能。

关键词

乳腺病理图像分类密集卷积网络多尺度注意力特征融合

Classification of breast pathological images based on multiscale information interaction and fusion

Ding Weilong¹, Zhu Fenglong¹, Zheng Kui², Jia Xiupeng³(1.College of Computer Science and Technology, Zhejiang University of Technology, Hangzhou 310023, China;2.Shanghai Paiying Medical Technology Co., Ltd., Shanghai 201306, China;3.Ningbo Clinical Pathology Diagnostic Center, Ningbo 315031, China)

Abstract

Objective Breast cancer recognition based on deep learning methods is a challenging task due to the large size of breast histopathology images(single image size is approximately 1 GB).Thus,these images must be cut and then identified due to the current computational power limitations.Current research on breast cancer recognition focuses on singlescale networks,ignoring the characteristics of multiple magnifications and pyramidal structure storage of breast histopathology images.Several studies on multiscale networks only input images of different magnifications into the network model and concatenate or aggregate various features after multilayer convolutional layer operations.The feature fusion is simple and ignores the correlation between images of different scales as well as the guidance between images of different scales when extracting their texture features in the shallow part of the network model.Therefore,problems such as low feature utiliza- tion and lack of information interaction exist between images of different magnifications.Method This paper proposes a convolutional neural network improvement strategy based on multiscale and group attention mechanisms to address the above problems.The strategy mainly includes the following two modules:information interaction and feature fusion modules.The first module extracts clear cell morphological structure and global context information from high- and low-magnification images,respectively,through a spatial attention mechanism.The feature information with high relevance to the classification target of the main branch will be given additional weight.Finally,these features are weighted and accumulated,and the results are fed back to the original branch for dynamic selection to achieve feature interaction and circulation.The second module considers that the number of channels on the feature map will multiply as the depth of the network increases,and the general channel attention encounters problems of large computation and low feature activation rate.Therefore,this paper proposes group attention based on group convolution and combines it into the feature fusion module.In addition,a difference in the receptive field of the images is observed at different magnifications(i.e.,the actual length of each pixel is different).Thus,this paper uses a feature pyramid to eliminate the perceptual domain difference in the feature fusion process .Result In this paper,the above strategy is applied to a variety of convolutional neural networks and compared with the latest methods.A fivefold cross-validation experiment is conducted on the Camelyon16 public dataset,and the mean and standard deviation are calculated for each evaluation metric.Compared with the single-scale convolutional network,the introduced method in this paper demonstrated 0.9%-1.1% improvement in accuracy and 1.1%-1.2% in F1-score.Compared with the best-performing TransPath network in the single-scale network,the enhanced DenseNet201 in this paper demonstrated a 0.6% improvement in accuracy,0.8% in precision,0.6% in F1-score,and the standard deviation of the indicators is lower than that of TransPath,indicating that the network incorporating the strategy has a better stability.Conclusion Overall,the proposed strategy in this paper can compensate for the shortcomings of general multiscale networks and has certain generality to obtain superior performance in breast cancer image classification.Thus,this strategy is useful for future multiscale research and feature extraction for downstream tasks.

Keywords

classification of breast pathological images dense convolutional network multiscale attention fusion of features

在线采编平台

论文出版

年度会议

下载中心

年度信息