Forced Deferral: Manipulating Routing Decisions in Multimodal LLM Cascades

arXiv cs.AI·Zhongye Liu, Yaopei Zeng, Yurui Chang, Lu Lin

6/16/2026

·~1 min·6/16/2026·en·0

Quick Answer

This paper shows that The Forced Deferral Attack (FDA) exploits vulnerabilities in multimodal large language model (MLLM) cascades by manipulating the confidence of a weaker model, causing queries to be rerouted to a stronger model.

Quick Take

This adversarial image attack consistently increases strong-model routing across various datasets, outperforming traditional methods like image perturbation. The findings highlight a significant security risk in MLLM systems, where compute allocation can be maliciously influenced without directly affecting answer correctness.

Key Points

FDA reduces weak model confidence, forcing queries to strong models in MLLM cascades.
The attack uses a temperature-flattened objective to optimize universal border triggers.
FDA outperforms image perturbation and prompt injection methods across various benchmarks.
MLLM cascades expose vulnerabilities that can be exploited to manipulate compute resources.
This research highlights the need for improved security measures in systems.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Source Excerpt

arXiv:2606. 15308v1 Announce Type: new Abstract: While multimodal (MLLMs) have shown strong visual reasoning abilities, serving a large model for every query is computationally expensive. MLLM cascades mitigate this cost by first querying a weak but cheaper model and deferring to a strong model when the weak model's output is unconfident.

However, since the weak model's confidence directly controls compute allocation, these systems expose a new attack surface: an adversary can manipulate confidence so that their queries are consistently deferred to the strong model. …

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.AI

See more →

arXiv cs.AI·Ji Wu, Yunshan Peng, Wentao Bai, Yunke Bai, Wenzheng Shu, Jinan Pang, Yanxiang Zeng, Xialong Liu

2d ago

FeaturedOriginal

HOBA: Hierarchical On-Policy Bidding Agents for Adaptive Online Advertising

AI Summary

HOBA (Hierarchical On-policy Bidding Agents) is a novel hierarchical reinforcement learning framework that enhances online advertising bidding systems by improving adaptability and reducing hyperparameter tuning costs. It utilizes a for hyperparameter inference, a SARSA agent for expert model selection, and a dynamic expert pool for bid execution, achieving a +3.6% increase in target cost during large-scale deployment and outperforming state-of-the-art baselines on AuctionNet.

#LLM #Agent #Inference #AI Startup

Forced Deferral: Manipulating Routing Decisions in Multimodal LLM Cascades

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.AI

HOBA: Hierarchical On-Policy Bidding Agents for Adaptive Online Advertising

AINTMA: Agentic AI Architecture for Autonomous Test Management with Generative Intelligence, Secure Cloud Communication and Adaptive Quality Analytics

RAIL Guard: Closing the Evaluation-to-Remediation Gap in Responsible AI for Agents

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.AI

HOBA: Hierarchical On-Policy Bidding Agents for Adaptive Online Advertising

AINTMA: Agentic AI Architecture for Autonomous Test Management with Generative Intelligence, Secure Cloud Communication and Adaptive Quality Analytics

RAIL Guard: Closing the Evaluation-to-Remediation Gap in Responsible AI for LLM Agents

RAIL Guard: Closing the Evaluation-to-Remediation Gap in Responsible AI for Agents