医学 AI - arXivDaily 专题

2606.20250 2026-06-19 cs.CV 新提交 90%

Single-Stage Hierarchical Rectification for Weakly Supervised Histopathology Segmentation

单阶段层次化校正用于弱监督组织病理学分割

Duc T. Nguyen, Hoang-Long Nguyen, Thanh-Ha DO, Huy-Hieu Pham

发表机构 * VinUni-Illinois Smart Health Center, VinUniversity, Hanoi, Vietnam（越南河内VinUniversity VinUni-Illinois智慧健康中心）； The Computer Vision and Medical AI Lab, VinUniversity, Hanoi, Vietnam（越南河内VinUniversity计算机视觉与医学人工智能实验室）； Posts and Telecommunications Institute of Technology, Hanoi, Vietnam（越南河内邮电技术学院）

专题命中病理影像：弱监督组织病理学分割

AI总结提出单阶段层次化校正框架，通过层次化特征校正模块在单次训练中直接生成高保真激活图，解决多阶段弱监督分割中的误差传播和计算开销问题。

Comments Accepted to MICCAI 2026. This is the pre-review submitted version, not the camera-ready version. The final authenticated version will be available in the MICCAI 2026 proceedings

详情

AI中文摘要

现有的计算病理学中的弱监督语义分割方法依赖于多阶段范式：类激活图生成、离线伪掩码细化和全监督再训练。虽然这种解耦方法已被广泛采用，但它存在根本性缺陷。多阶段过程不仅导致高计算训练成本，还遭受误差传播：浅层CNN中的局部纹理偏差产生假阳性伪影，后续细化步骤往往无法纠正。为了通过简单而高效的方法解决这些持续存在的挑战，我们提出了单阶段层次化校正（SSHR）框架。我们的方法不是事后被动地细化CAM，而是在前向传播过程中主动净化中间特征表示。我们引入了一个层次化特征校正模块（HFRM），利用深层全局语义上下文过滤浅层中的局部异常。该机制在单个训练循环内直接生成高保真激活图。在LUAD-HistoSeg和BCSS数据集上的实验表明，SSHR优于最先进的多阶段方法。此外，SSHR将训练时间减少了2到5倍。这种效率降低了计算开销，并加速了大规模组织病理学工作流的临床转化。代码可在以下网址获取：this https URL

英文摘要

Existing weakly supervised semantic segmentation (WSSS) methods in computational pathology rely on a multi-stage paradigm: class activation map (CAM) generation, offline pseudo-mask refinement, and fully supervised retraining. While established, this decoupled approach presents fundamental limitations. The multi-stage process not only incurs high computational training costs but also suffers from error propagation: local texture biases in shallow CNN layers generate false-positive artifacts that subsequent refinement steps often fail to correct. To address these persistent challenges through a simple yet highly effective approach, we propose the Single-Stage Hierarchical Rectification (SSHR) framework. Rather than passively refining CAMs post-hoc, our method proactively purifies intermediate feature representations during the forward pass. We introduce a Hierarchical Feature Rectification Module (HFRM) that utilizes deep global semantic context to filter out local anomalies in shallow layers. This mechanism generates high-fidelity activation maps directly within a single training loop. Experiments on the LUAD-HistoSeg and BCSS datasets demonstrate that SSHR outperforms state-of-the-art multi-stage methods. Furthermore, SSHR reduces training duration by 2 to 5 times. This efficiency minimizes computational overhead and accelerates clinical translation for large-scale histopathology workflows. The code is available at: https://github.com/trongduc-nguyen/SSHR

URL PDF HTML ☆

赞 0 踩 0

2606.19966 2026-06-19 cs.CV cs.LG 新提交 90%

Semantic-Anchored Evidential Fusion for Domain-Robust Whole-Slide Survival Analysis

语义锚定证据融合用于域鲁棒的全切片生存分析

Yucheng Xing, Ling Huang, Pei Liu, Jingying Ma, Jiaqing Xu, Kai He, Mengling Feng

发表机构 * National University of Singapore（新加坡国立大学）； Imperial College London（帝国理工学院）； Hunan University（湖南大学）

专题命中病理影像：提出SAEFS框架用于全切片生存分析

AI总结提出SAEFS框架，通过视觉问答提取语义锚点，结合双流证据提取和狄利克雷主观逻辑建模不确定性，实现跨域零样本生存分析，平均C-index提升10.2%。

详情

AI中文摘要

全切片图像（WSIs）广泛用于计算癌症预后。然而，现有方法主要关注域内性能，难以泛化到不同临床中心。这一局限性源于它们依赖像素级表示，极易受到染色协议和扫描硬件导致的域特定伪影影响。我们假设高级病理语义（如肿瘤分级和微环境结构）提供了域不变的语义表示，反映了人类病理学家的鲁棒诊断逻辑。因此，我们提出了语义锚定证据融合生存（SAEFS）框架，其中SAEFS通过视觉问答（VQA）从WSIs中推导语义锚点，采用双流WSI证据提取架构，使用基于狄利克雷的主观逻辑建模不确定性，并通过谨慎合取规则融合语义和视觉证据，以避免来自相关源的过度自信融合。仅在单一源域上训练并在四个未见域上进行零样本评估，SAEFS在预测准确性和可靠性上均一致优于最先进模型，平均C-index提升10.2%。定量分析进一步表明，VQA导出的语义特征比像素级特征表现出显著更低的跨中心差异，突显了其在跨中心临床应用中的鲁棒性。

英文摘要

Whole-slide images (WSIs) are widely used for computational cancer prognosis. However, most existing methods primarily focus on in-domain performance and fail to generalize across clinical centers. This limitation stems from their reliance on pixel-derived representations that are highly susceptible to domain-specific artifacts caused by staining protocols and scanner hardware. We hypothesize that high-level pathology semantics, such as tumor grade and micro-environmental architecture, provide a domain-invariant semantic representation that mirrors the robust diagnostic logic of human pathologists. Therefore, we propose a Semantic-Anchored Evidential Fusion Survival (SAEFS) framework, where SAEFS derives semantic anchors from WSIs via Visual Question Answering (VQA), employs a dual-stream WSI evidence extraction architecture, uses Dirichlet-based Subjective Logic to model uncertainty, and fuses semantic and visual evidence through a cautious conjunction rule to avoid overconfident fusion from correlated sources. Trained exclusively on one source domain and evaluated zero-shot across four unseen domains, SAEFS consistently outperforms state-of-the-art models both in prediction accuracy and reliability, improving the average C-index by 10.2%. Quantitative analyses further show that VQA-derived semantic features exhibit significantly lower cross-center divergence than pixel-derived features, highlighting their robustness for cross-center clinical applications.

URL PDF HTML ☆

赞 0 踩 0