FS-DVS: A Frequency-Selective Dynamic Visual Sensing Paradigm for Enhancing Information Completeness

arXiv cs.CV·Feiyu Ji, Xiaokang Yang, Xiaoyun Yuan

3h ago

·~2 min·6/8/2026·en·0

Quick Answer

This paper shows that The FS-DVS (Frequency-Selective Dynamic Vision Sensor) enhances information completeness by integrating a learnable spatial filter that mimics retinal ganglion cell aggregation, achieving improved object detection and action recognition performance.

Quick Take

The FS-DVS (Frequency-Selective Dynamic Vision Sensor) enhances information completeness by integrating a learnable spatial filter that mimics retinal ganglion cell aggregation, achieving improved object detection and action recognition performance. This paradigm emphasizes mid-frequency components, aligning with human contrast sensitivity functions, and offers high noise resilience without merely increasing sensor sensitivity.

Key Points

FS-DVS integrates a learnable spatial filter before event triggering.
The spatial filter evolves into center-surround patterns emphasizing mid-frequency components.
Significant performance gains in object detection and action recognition are achieved.
The model aligns with human contrast sensitivity functions across various tasks.
FS-DVS provides a robust blueprint for next-generation neuromorphic sensors.

Article Content

From source RSS / original summary

arXiv:2606. 06856v1 Announce Type: new Abstract: Dynamic vision sensors (DVS) offer exceptional temporal resolution and dynamic range by asynchronously reporting pixel-level intensity changes. However, conventional DVS rely on a per-pixel independent triggering mechanism, ignoring the spatial integration performed by biological retinal ganglion cells (RGCs).

Consequently, they lack the contrast sensitivity function (CSF) and its inherent sensitivity to mid-spatial frequencies, which inevitably leads to information incompleteness due to sub-threshold signal loss. To bridge this gap, we propose FS-DVS (Frequency-Selective Dynamic Vision Sensor), a novel paradigm that integrates a learnable spatial filter strictly preceding the event triggering process to mimic the RGC aggregation mechanism.

By developing a differentiable event simulation framework, the spatial filter can be optimized end-to-end with downstream tasks. Our study reveals that starting from a delta function, the learned spatial filters spontaneously evolve into center-surround patterns that emphasize mid-frequency components, consistently aligning with human CSF.

Beyond achieving substantial performance gains in object detection and action recognition, the consistent convergence to human-like CSF characteristics across different tasks underscores the universality of this mid-frequency selective mechanism. Compared to naively increasing sensor sensitivity or relying on post-processing, our paradigm achieves selective information enhancement with high noise resilience, providing a robust, biologically plausible blueprint for next-generation neuromorphic sensors.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CV

See more →

arXiv cs.CV·Shahrzad Esmat, Chaunte W. Lacewell, Sameh Gobriel, Nilesh Jain, Ali Jannesari

3d ago

FeaturedOriginal

LLM-Guided ANN Index Optimization for Human-Object Interaction Retrieval

AI Summary

A phase-aware LLM agent optimizes human-object interaction retrieval, outperforming Optuna TPE by 33.3% and VDTuner by 34.2% on the HICO-DET benchmark. This method enhances throughput by 15.3x over UniIR and demonstrates strong transferability across vector database management systems.

#LLM #Agent #Inference #AI Startup

FS-DVS: A Frequency-Selective Dynamic Visual Sensing Paradigm for Enhancing Information Completeness

Quick Answer

Quick Take

Key Points

Article Content

Want this in your inbox every morning?

More from arXiv cs.CV

LLM-Guided ANN Index Optimization for Human-Object Interaction Retrieval

Biomazon: A Multimodal Dataset for 3D Forest Structure and Biomass Modeling in the Amazon Basin

Optimal Transport Flow Matching by Design

Related in this space

The Sim-to-Real Gap of Foundation Model Agents: A Unified MDP Perspective

The Importance of Out-of-Band Metadata for Safe Autonomous Agents: The Redpanda Agentic Data Plane

Aptiv to Deliver Production-Ready Edge AI with Long-Term Support with NVIDIA