RAMS: Resource-Adaptive and Detection-Conditioned Model Switching for Embedded Edge Perception

arXiv cs.CV·Kushal Khemani, Evan Leri, George Xu, Amit Hod

6/16/2026

·~2 min·6/16/2026·en·1

Quick Answer

RAMS introduces a lightweight runtime controller for dynamic model switching among YOLOv8 tiers on embedded devices, achieving up to 5.6x faster inference with 74% accuracy retention.

Quick Take

Under heavy load, detection-conditioned switching improves accuracy scores significantly, demonstrating effective resource adaptation for edge perception tasks.

Key Points

RAMS dynamically selects among YOLOv8 NANO, SMALL, and MEDIUM tiers without model-reload latency.
The safety2 policy on Jetson Orin achieves 3.41 ms mean latency, 5.6x faster than fixed-MEDIUM.
Detection-conditioned switching improves SWAS by 25.4% under oracle scoring and 47.3% under detector-derived scoring.
Live KITTI evaluation shows VRU recall rates of 24.2%, 41.2%, and 59.0% across tiers.
RAMS operates effectively across various platforms, including Raspberry Pi 5 and x86 laptops.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Source Excerpt

arXiv:2606. 14716v1 Announce Type: new Abstract: Edge object detection on embedded hardware requires balancing inference latency and detection quality under changing resource pressure. We present RAMS, a lightweight runtime controller that monitors device pressure, calibrates switching thresholds from idle behavior, and dynamically selects among three resident YOLOv8 tiers (NANO/SMALL/MEDIUM at 320/416/640 px) without model-reload latency.

RAMS defines five switching policies, including two detection-conditioned variants that prevent aggressive downgrades after recent vulnerable-road-user (VRU) detections. …

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CV

See more →

arXiv cs.CV·Aavash Chhetri, Bibek Niroula, Eduard Vazquez, Yash Raj Shrestha, Prashnna Gyawali, Loris Bazzani, Binod Bhattarai

3w ago

FeaturedOriginal

ProMoE-FL: Prototype-conditioned Mixture of Experts for Multimodal Federated Learning with Missing Modalities

AI Summary

ProMoE-FL introduces a Prototype-conditioned Mixture-of-Experts framework for multimodal federated learning, effectively addressing missing modalities. It outperforms existing methods on four chest X-ray datasets, demonstrating superior feature synthesis capabilities in both homogeneous and heterogeneous settings.

#LLM #AI Coding #AI Startup #Enterprise AI

RAMS: Resource-Adaptive and Detection-Conditioned Model Switching for Embedded Edge Perception

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.CV

ProMoE-FL: Prototype-conditioned Mixture of Experts for Multimodal Federated Learning with Missing Modalities

-Guided ANN Index Optimization for Human-Object Interaction Retrieval

Eddeep: a deep-learning framework for fast eddy-current distortion correction in diffusion MRI

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.CV

ProMoE-FL: Prototype-conditioned Mixture of Experts for Multimodal Federated Learning with Missing Modalities

LLM-Guided ANN Index Optimization for Human-Object Interaction Retrieval

Eddeep: a deep-learning framework for fast eddy-current distortion correction in diffusion MRI

-Guided ANN Index Optimization for Human-Object Interaction Retrieval