Principled Reflection Separation via Nonlinear Superposition and Feature Interaction

arXiv cs.CV·Qiming Hu, Mingjia Li, Yuntong Li, Xiaojie Guo

4h ago

·~2 min·6/3/2026·en·0

Quick Take

This study introduces a learnable nonlinear superposition model for single-image reflection separation, addressing limitations of linear composition models. The proposed dual-stream interactive framework enhances decomposition fidelity and generalization across real-world benchmarks, revealing that effective reflection separation involves learning nonlinear interactions rather than merely reversing linear mixtures.

Key Points

Introduces a nonlinear superposition model for improved reflection separation.
Proposes a dual-stream framework modeling bidirectional dependencies.
Demonstrates superior performance on diverse real-world benchmarks.
Unifies activation, gating, and attention mechanisms in the model.
Reveals reflection separation involves learning nonlinear interactions.

Article Content

From source RSS / original summary

arXiv:2606. 02831v1 Announce Type: new Abstract: Single-image reflection separation is fundamentally challenged by the entanglement of transmission and reflection layers under complex image formation processes. Existing approaches largely rely on simplified assumptions or independent modeling, limiting their ability to handle real-world scenarios. In this work, we revisit the problem from a unified perspective and identify a key issue of existing approaches, i. e.

, the widely adopted linear composition model in the sRGB domain fails to capture the nonlinear coupling introduced by real-world image signal processing pipelines. To address this, we introduce a learnable nonlinear superposition model that more faithfully characterizes layer interactions and improves decomposition fidelity.

Building upon this formulation, we propose a generalized dual-stream interactive framework that explicitly models bidirectional dependencies between transmission and reflection through feature exchange. This framework unifies activation-, gating-, and attention-based interaction mechanisms, and is compatible with both CNN and Transformer backbones. Extensive experiments on diverse real-world benchmarks demonstrate that the proposed approach achieves superior performance with strong generalization capability.

More importantly, our study reveals that reflection separation is not about undoing a linear mixture, but about learning nonlinear formation and interaction}, offering new insights into the design of principled image decomposition models. Code and models are publicly available at https://mingcv. github. io/DIRS-Page.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CV

See more →

arXiv cs.CV·Fabian Degen, Oishi Deb, Jindong Gu, Junchi Yu, Samuele Marro, Philip Torr, Jialin Yu

4h ago

Original

Plan2Map: A Multimodal Benchmark for Document-Grounded Geospatial Boundary Reconstruction from Planning Records

AI Summary

Plan2Map introduces a 208-case benchmark for reconstructing geospatial boundaries from UK planning documents. The GeoPlanAgent system achieves a mean IoU of 0.736, significantly outperforming baseline models, highlighting the challenges in localization and map registration.

#Agent #AI Coding #Inference