arXivDaily arXiv每日学术速递 周一至周五更新

视觉与机器人

3D 视觉

三维重建、NeRF、Gaussian Splatting、点云和空间智能。

今日/当前日期收录 1 信号源:cs.CV, cs.GR, cs.RO
2606.20531 2026-06-19 cs.CV 新提交 专题 85

VisDom: Sparse Novel View Synthesis with Visible Domain Constraint

VisDom: 具有可见域约束的稀疏新视角合成

Mariia Gladkova*, Tarun Yenamandra*, Edmond Boyer, Robert Maier, Tony Tung, Daniel Cremers

发表机构 * TU Munich(慕尼黑工业大学) MCML(慕尼黑机器学习中心)

专题命中 NeRF :提出可见域约束增强NeRF和GS的稀疏视图合成

AI总结 提出VisDom,一种无学习的几何约束,通过最小多视角可见性要求增强视觉外壳重建,作为稀疏新视角合成中的空间先验,集成到NeRF和GS管线中,从四张输入图像实现高质量重建。

详情
AI中文摘要

稀疏新视角合成(NVS)由于从少量输入视角恢复3D几何的歧义性仍然具有挑战性。虽然基于NeRF和高斯泼溅(GS)的方法在密集监督下表现良好,但在稀疏设置中它们往往过拟合,产生漂浮伪影和不一致的几何。轮廓一致性通常用作正则化器,但还不够,因为轮廓一致区域可能超出真实物体几何。我们引入VisDom,一种无学习的几何约束,通过强制执行最小多视角可见性要求来增强经典的基于雕刻的视觉外壳重建。具体地,我们将可见域定义为至少被$K$个视角观察到的3D空间子集,并将其用作标准基于轮廓重建之上的额外过滤标准。这在稀疏视角设置中提供了更强的空间先验。我们通过限制体积采样和指导优化过程中的高斯放置,将VisDom集成到隐式(NeRF)和显式(GS)管线中。在三个具有挑战性的数据集上的实验表明,稀疏NVS的一致改进,使得从仅四张输入图像就能实现高质量以物体为中心的重建。我们的方法是领域无关的,仅需要轮廓,并且不引入学习参数,使其成为现有方法的简单补充。在GaussianObject之上应用VisDom进一步提高了在Omni3D和MipNeRF360上的性能,同时以22倍的训练成本匹配或超越它。

英文摘要

Sparse novel view synthesis (NVS) remains challenging due to the ambiguity of recovering 3D geometry from few input views. While NeRF- and Gaussian Splatting (GS)-based methods perform well with dense supervision, they often overfit in sparse settings, producing floating artifacts and inconsistent geometry. Silhouette consistency is commonly used as a regularizer, but it remains insufficient, as silhouette-consistent regions can extend beyond the true object geometry. We introduce VisDom, a learning-free geometric constraint that augments classical carving-based visual hull reconstruction by enforcing a minimum multi-view visibility requirement. Specifically, we define a visible domain as the subset of 3D space observed by at least $K$ views and use it as an additional filtering criterion on top of standard silhouette-based reconstruction. This provides a stronger spatial prior in sparse-view settings. We integrate VisDom into both implicit (NeRF) and explicit (GS) pipelines by restricting volumetric sampling and guiding Gaussian placement during optimization. Experiments on three challenging datasets show consistent improvements in sparse-view NVS, enabling high-quality object-centric reconstruction from as few as four input images. Our method is domain-agnostic, requires only silhouettes, and introduces no learned parameters, making it a simple complement to existing approaches. Applying VisDom on top of GaussianObject further improves performance on Omni3D and MipNeRF360, while matching or surpassing it at 22 $\times$ lower training cost.