LLMs Contain Multitudes: How Deployment Context Reshapes Model-Level Preferences and Values

arXiv cs.CL·Filip Trhlik, Aoife O'Flynn, Angela Yu, Arduin Findeis, Paula Buttery

6h ago

·~2 min·6/15/2026·en·0

Quick Answer

Quick Take

This study reveals that deployment context significantly alters the preferences and values of large language models (LLMs), with context-induced rank shifts in country preferences and utility judgments across five models. The findings indicate that model-level properties are context-dependent, challenging the notion of stable preferences in LLMs.

Key Points

Deployment context causes greater variation in model preferences than prompt perturbations.
Significant rank shifts observed in country preferences across 15 countries.
Utility judgments show substantial variation in fine-grained rankings within domains.
Cardinal exchange rates between outcomes can shift by a factor of 2.47 at the median.
Model-level preferences are better understood as context-conditioned rather than fixed.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Article Content

From source RSS / original summary

arXiv:2606. 13944v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly characterised in recent evaluation work as having stable, model-level preference and value systems. However, accompanying robustness checks are limited to incidental prompt perturbations such as syntax variation and option reordering. This leaves open whether the measured properties survive when the surrounding task context changes, as it does in most real deployments.

We test this directly across two established pairwise paradigms: ranking country preferences and eliciting utility judgements. In both, we make the deployment context -- the high-level task the model is performing while making concrete value-dependent choices -- our controlled variable, varied across framings such as writing a Reddit post or a news article. Across five LLMs and over 1. 2M pairwise decisions, deployment context produces variation far larger than prompt paraphrasing and temperature controls.

In country preference rankings over 15 countries, context induces widespread, statistically significant rank shifts; the aggregate Global North favouritism reported in prior work is itself context-dependent, with each model's bias shifting systematically across contexts. In utility elicitation over 50 outcomes, broad cross-category ordering is preserved, but fine-grained rankings within domains vary substantially, and cardinal exchange rates between outcomes (e. g.

how many lives in one region equal one in another) shift by a factor of 2. 47 at the median. Reported model-level preferences and utilities are therefore better understood as context-conditioned measurements than fixed model-level properties: safety guarantees obtained under one framing provide limited assurance in another.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CL

See more →

arXiv cs.CL·Leyao Wang, Yanan He, Peng Chen, Asaf Yehudai, Yixin Liu, Rex Ying, Michal Shmueli-Scheuer, Arman Cohan

3w ago

FeaturedOriginal

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

AI Summary

The REFLECT benchmark reveals that current LLM judges are unreliable, achieving below 55% accuracy in evaluating reasoning and evidence use, highlighting the need for improved evaluation methods for deep research agents.

#LLM #Agent #Inference #Policy