DarkVGGT: Seeing Through Darkness Using Thermal Geometry without Daylight Tax

arXiv cs.CV·Minseong Kweon, Wenyuan Zhao, Nuo Chen, Lulin Liu, Huiwen Han, Zihao Zhu, Srinivas Shakkottai, Chao Tian, Zhiwen Fan

2d ago

·~1 min·6/11/2026·en·0

Quick Answer

DarkVGGT introduces a novel RGB-T feed-forward geometry framework that enhances 3D scene estimation in low-light conditions using thermal modeling.

Quick Take

DarkVGGT introduces a novel RGB-T feed-forward geometry framework that enhances 3D scene estimation in low-light conditions using thermal modeling. It outperforms existing methods in depth and camera pose estimation on low-visibility benchmarks, addressing the limitations of traditional RGB-based approaches.

Key Points

DarkVGGT employs physics-aware thermal modeling for robust 3D estimation.
It features thermal factorization to extract reliable thermal cues.
The framework improves depth and camera pose estimation in low-light scenarios.
Experiments show consistent performance gains over existing geometry baselines.
It maintains effectiveness in well-lit environments while enhancing low-light performance.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Article Content

From source RSS / original summary

arXiv:2606. 11326v1 Announce Type: new Abstract: Recent feed-forward 3D reconstruction methods have demonstrated strong performance and flexibility in efficient end-to-end scene geometry estimation from image streams. However, their reliance on visible-light appearance makes them vulnerable in dark and low-visibility environments, where RGB cues are severely degraded and geometric evidence becomes ambiguous.

To address this challenge, we propose DarkVGGT, an RGB-T feed-forward geometry framework that uses physics-aware thermal modeling for robust 3D estimation in low-light scenes. DarkVGGT introduces two complementary modules. First, physics-inspired thermal factorization extracts emissive-dominant, geometry-consistent thermal cues while isolating sparse reflective residuals that may introduce geometric ambiguity.

Second, geometry-shared thermal routing isolates modality-invariant geometric structures from thermal-specific patterns, selectively injecting reliability-aware structural guidance into the RGB stream. Together, these components enable accurate thermal-informed geometry estimation under degraded RGB conditions while largely preserving performance in well-lit environments.

Experiments on low-visibility RGB-T benchmarks demonstrate consistent improvements in both depth and camera pose estimation over existing feed-forward geometry baselines.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CV

See more →

arXiv cs.CV·Shahrzad Esmat, Chaunte W. Lacewell, Sameh Gobriel, Nilesh Jain, Ali Jannesari

1w ago

FeaturedOriginal

LLM-Guided ANN Index Optimization for Human-Object Interaction Retrieval

AI Summary

A phase-aware LLM agent optimizes human-object interaction retrieval, outperforming Optuna TPE by 33.3% and VDTuner by 34.2% on the HICO-DET benchmark. This method enhances throughput by 15.3x over UniIR and demonstrates strong transferability across vector database management systems.

#LLM #Agent #Inference #AI Startup