arXiv 论文速递

Snapshot: 20260210_0409

PANC: Prior-Aware Normalized Cut for Object Segmentation

Authors: Juan Gutiérrez, Victor Gutiérrez-Garcia, José Luis Blanco-Murillo

First: 2026-02-06T18:07:20+00:00 · Latest: 2026-02-06T18:07:20+00:00

Abstract

Fully unsupervised segmentation pipelines naively seek the most salient object, should this be present. As a result, most of the methods reported in the literature deliver non-deterministic partitions that are sensitive to initialization, seed order, and threshold heuristics. We propose PANC, a weakly supervised spectral segmentation framework that uses a minimal set of annotated visual tokens to produce stable, controllable, and reproducible object masks. From the TokenCut approach, we augment the token-token affinity graph with a handful of priors coupled to anchor nodes. By manipulating the graph topology, we bias the spectral eigenspace toward partitions that are consistent with the annotations. Our approach preserves the global grouping enforced by dense self-supervised visual features, trading annotated tokens for significant gains in reproducibility, user control, and segmentation quality. Using 5 to 30 annotations per dataset, our training-free method achieves state-of-the-art performance among weakly and unsupervised approaches on standard benchmarks (e.g., DUTS-TE, ECSSD, MS COCO). Contrarily, it excels in domains where dense labels are costly or intra-class differences are subtle. We report strong and reliable results on homogeneous, fine-grained, and texture-limited domains, achieving 96.8% (+14.43% over SotA), 78.0% (+0.2%), and 78.8% (+0.37%) average mean intersection-over-union (mIoU) on CrackForest (CFD), CUB-200-2011, and HAM10000 datasets, respectively. For multi-object benchmarks, the framework showcases explicit, user-controllable semantic segmentation.

中文标题/摘要

标题：PANC：先验归一化切分用于对象分割

完全无监督的分割管道通常会寻找最显眼的对象，如果存在的话。因此，文献中报道的大多数方法会生成非确定性的分区，这些分区对初始化、种子顺序和阈值启发式方法敏感。我们提出了一种弱监督的谱分割框架PANC，该框架使用少量注释的视觉标记来生成稳定、可控和可重复的对象掩码。从TokenCut方法出发，我们通过将少量先验与锚节点结合来增强标记-标记亲和图。通过操纵图的拓扑结构，我们偏向于与注释一致的谱特征空间。我们的方法保留了由密集自监督视觉特征强制执行的全局分组，用注释的标记换取更高的可重复性、用户控制和分割质量。使用每数据集5到30个注释，我们的无需训练方法在标准基准（如DUTS-TE、ECSSD、MS COCO）上实现了弱监督和无监督方法中的最佳性能。在密集标签成本高或类内差异细微的领域，它表现出色。我们在同质、细粒度和纹理受限领域报告了强而可靠的结果，分别在CrackForest (CFD)、CUB-200-2011和HAM10000数据集上实现了96.8%（+14.43%超过SotA）、78.0%（+0.2%）和78.8%（+0.37%）的平均交并比（mIoU）。对于多对象基准，该框架展示了明确、用户可控的语义分割。

Summary / 总结

PANC is a weakly supervised spectral segmentation framework that uses a minimal set of annotated visual tokens to produce stable and controllable object masks. By augmenting the token-token affinity graph with priors, PANC biases the spectral eigenspace towards partitions consistent with the annotations, while preserving global grouping enforced by dense self-supervised visual features. On standard benchmarks, PANC achieves state-of-the-art performance with 5 to 30 annotations per dataset, reporting 96.8% (14.43% over SotA), 78.0% (+0.2%), and 78.8% (+0.37%) average mIoU on CrackForest, CUB-200-2011, and HAM10000 datasets, respectively.

PANC 是一种弱监督光谱分割框架，使用少量注释的视觉标记来生成稳定且可控的对象掩码。通过在标记标记图中添加先验知识，PANC 将谱特征空间偏向与注释一致的分割，同时保留由密集自监督视觉特征强制执行的全局分组。该方法在每个数据集使用 5 到 30 个注释时实现了最先进的性能，并在同质性、细粒度和纹理受限领域展示了强大的结果，分别在 CrackForest (CFD)、CUB-200-2011 和 HAM10000 数据集上实现了 96.8%、78.0% 和 78.8% 的平均交并比 (mIoU)。

Prompt Reinjection: Alleviating Prompt Forgetting in Multimodal Diffusion Transformers

Authors: Yuxuan Yao, Yuxuan Chen, Hui Li, Kaihui Cheng, Qipeng Guo, Yuwei Sun, Zilong Dong, Jingdong Wang, Siyu Zhu

First: 2026-02-06T17:19:53+00:00 · Latest: 2026-02-06T17:19:53+00:00

Comments: 18 pages