Beyond Benchmarks: Continuous Edge Inference for Fine-Grained Roadside Perception
Quick Answer
Edge-TSR is a continuous edge inference system for roadside perception on NVIDIA Jetson Orin Nano, addressing performance degradation of 20-30% in real-world deployments compared to benchmarks.
Quick Take
Edge-TSR is a continuous edge inference system for roadside perception on NVIDIA Jetson Orin Nano, addressing performance degradation of 20-30% in real-world deployments compared to benchmarks. It enhances classification accuracy by up to 10.16% while maintaining 16.18 FPS under thermal limits without cloud offload, highlighting the need for deployment-aware evaluation in edge AI systems.
Key Points
- Edge-TSR integrates detection, tracking, and classification with minimal computational overhead.
- Real-world deployment shows 20-30% performance drop compared to static-image benchmarks.
- Achieves up to 10.16% accuracy improvement over per-frame inference baselines.
- Sustains 16.18 FPS during a 55-minute vehicular deployment without cloud offload.
- Releases a sample annotated video dataset for reproducible evaluation.
Paper Resources
Article Content
From source RSS / original summaryarXiv:2606. 17241v1 Announce Type: new Abstract: Continuous AI inference on resource-constrained edge hardware introduces deployment effects that are largely invisible to conventional benchmark evaluation, including temporal instability in streaming video, thermal throttling under sustained load, and workload-dependent performance variability. We present Edge-TSR, a deployment-oriented continuous edge inference system for sustained roadside perception on the NVIDIA Jetson Orin Nano.
Edge-TSR integrates detection, tracking, fine-grained classification, and a lightweight track-aware temporal stabilization mechanism that improves streaming inference consistency with negligible computational overhead. Our central finding is that benchmark-centric evaluation systematically overstates deployed edge inference performance. Across three state-of-the-art baselines, we observe consistent 20-30% relative degradation when transitioning from static-image evaluation to real-world streaming deployment.
Edge-TSR addresses this gap through temporal inference stabilization, recovering up to 10. 16% classification accuracy over per-frame inference baselines while maintaining sustained real-time performance under continuous operation. We evaluate the complete system under diverse real-world deployment conditions, jointly characterizing inference quality, latency, throughput, and thermal behavior during long-duration operation. A 55-minute vehicular deployment over a 26 km route demonstrates sustained operation at 16.
18 FPS within safe thermal limits on a single embedded device without cloud offload. Our findings show that deployment-aware evaluation and temporal inference stabilization are necessary components of continuously operating edge AI systems intended for real-world sensing deployments. We release a sample annotated streaming video evaluation dataset and full system implementation to support reproducible deployment-centric evaluation.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.CV
See more →LLM-Guided ANN Index Optimization for Human-Object Interaction Retrieval
A phase-aware LLM agent optimizes human-object interaction retrieval, outperforming Optuna TPE by 33.3% and VDTuner by 34.2% on the HICO-DET benchmark. This method enhances throughput by 15.3x over UniIR and demonstrates strong transferability across vector database management systems.


