ARIADNE: Agnostic Routing for Inference-time Adapter DyNamic sElection
Quick Answer
ARIADNE introduces a training-free, adapter-agnostic routing framework for dynamic adapter selection at inference time, achieving 97.44% of upper bound performance on Llama 3.2 1B Instruct across 23 NLP tasks.
Quick Take
ARIADNE introduces a training-free, adapter-agnostic routing framework for dynamic adapter selection at inference time, achieving 97.44% of upper bound performance on Llama 3.2 1B Instruct across 23 NLP tasks. It scales to 44 tasks with 89.7% average selection accuracy without requiring additional training or access to adapter internals.
Key Points
- ARIADNE uses centroids from training set embeddings for adapter selection.
- Compatible with any parameter-efficient fine-tuning methods without modifications.
- Achieves 97.44% performance on Llama 3.2 1B Instruct across 23 tasks.
- Scales to 44 tasks with an average selection accuracy of 89.7%.
- Eliminates the need for additional router training or internal adapter access.
Paper Resources
Article Content
From source RSS / original summaryarXiv:2606. 19079v1 Announce Type: new Abstract: The increasing deployment of parameter-efficient fine-tuning (PEFT) has led to model ecosystems in which a single backbone is paired with many task-specialized adapters. In this setting, inference-time queries often arrive without task labels, requiring the system to automatically select the most appropriate adapter from a growing and heterogeneous adapter pool.
Existing routing methods either depend on access to adapter internals, such as weight decompositions or gradient-based statistics, or require additional router training, which limits scalability and portability as new adapters are added. We introduce ARIADNE, a training-free, adapter-agnostic routing framework for dynamic adapter selection at inference time.
ARIADNE represents each adapter through a set of centroids computed from embeddings of its training set, capturing the data distribution associated with that adapter. Given an unlabeled input, it selects an adapter by measuring proximity to these centroids in latent space. Because routing is performed entirely in the input embedding space, ARIADNE is compatible with arbitrary PEFT methods and requires no modification to the adapters or training procedures. Primarily evaluated with Llama 3.
2 1B Instruct on 23 diverse NLP tasks, ARIADNE recovers 97. 44% of the upper bound performance. Scaling to 44 tasks, it achieves 89. 7% average selection accuracy, without additional training or access to adapter internals.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.AI
See more →Arbor: Tree Search as a Cognition Layer for Autonomous Agents
Arbor introduces a multi-agent framework utilizing structured tree search for optimizing LLM inference, achieving up to 193% throughput-latency improvement compared to vendor-optimized systems. It employs an Orchestrator and Critic agent for stability and coordination, demonstrating hardware-agnostic performance with minimal variance.