LLMs Contain Multitudes: How Deployment Context Reshapes Model-Level Preferences and Values
Quick Answer
This study reveals that deployment context significantly alters the preferences and values of large language models (LLMs), with context-induced rank shifts in country preferences and utility judgments across five models.
Quick Take
This study reveals that deployment context significantly alters the preferences and values of large language models (LLMs), with context-induced rank shifts in country preferences and utility judgments across five models. The findings indicate that model-level properties are context-dependent, challenging the notion of stable preferences in LLMs.
Key Points
- Deployment context causes greater variation in model preferences than prompt perturbations.
- Significant rank shifts observed in country preferences across 15 countries.
- Utility judgments show substantial variation in fine-grained rankings within domains.
- Cardinal exchange rates between outcomes can shift by a factor of 2.47 at the median.
- Model-level preferences are better understood as context-conditioned rather than fixed.
Paper Resources
Article Content
From source RSS / original summaryarXiv:2606. 13944v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly characterised in recent evaluation work as having stable, model-level preference and value systems. However, accompanying robustness checks are limited to incidental prompt perturbations such as syntax variation and option reordering. This leaves open whether the measured properties survive when the surrounding task context changes, as it does in most real deployments.
We test this directly across two established pairwise paradigms: ranking country preferences and eliciting utility judgements. In both, we make the deployment context -- the high-level task the model is performing while making concrete value-dependent choices -- our controlled variable, varied across framings such as writing a Reddit post or a news article. Across five LLMs and over 1. 2M pairwise decisions, deployment context produces variation far larger than prompt paraphrasing and temperature controls.
In country preference rankings over 15 countries, context induces widespread, statistically significant rank shifts; the aggregate Global North favouritism reported in prior work is itself context-dependent, with each model's bias shifting systematically across contexts. In utility elicitation over 50 outcomes, broad cross-category ordering is preserved, but fine-grained rankings within domains vary substantially, and cardinal exchange rates between outcomes (e. g.
how many lives in one region equal one in another) shift by a factor of 2. 47 at the median. Reported model-level preferences and utilities are therefore better understood as context-conditioned measurements than fixed model-level properties: safety guarantees obtained under one framing provide limited assurance in another.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.CL
See more →Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?
The REFLECT benchmark reveals that current LLM judges are unreliable, achieving below 55% accuracy in evaluating reasoning and evidence use, highlighting the need for improved evaluation methods for deep research agents.