Quantifying Aleatoric Uncertainty of In-Context Learning for Robust Measure of LLM Prediction Confidence
Quick Answer
This paper introduces self-function vectors to quantify aleatoric uncertainty in In-Context Learning (ICL) for LLMs, enhancing prediction reliability.
Quick Take
This paper introduces self-function vectors to quantify aleatoric uncertainty in In-Context Learning (ICL) for LLMs, enhancing prediction reliability. The proposed evaluation protocol allows for precise separation of aleatoric from epistemic uncertainty, demonstrating improved measurement of LLM predictions over existing methods. This advancement is crucial for applications like hallucination detection.
Key Points
- Introduces self-function vectors for modeling aleatoric uncertainty in LLMs.
- Proposes a new evaluation protocol for separating aleatoric and epistemic uncertainty.
- Demonstrates improved reliability in measuring LLM predictions under ICL.
- Framework applicable to real-world datasets and synthetic tasks.
- Supports trustworthy applications like hallucination detection.
Paper Resources
Article Content
From source RSS / original summaryarXiv:2606. 19353v1 Announce Type: new Abstract: In-Context Learning (ICL) allows LLMs to adapt to new tasks from a few demonstrations, but its reliability remains a concern: predictions are highly sensitive to both prompt design and the model's ability to understand the context, obscuring whether failures arise from data properties or model limitations.
Uncertainty decomposition-separating aleatoric from epistemic sources-is particularly crucial in this setting, yet existing methods, designed for standard generation tasks, fail to capture the unique dynamics of ICL. To address this, we introduce a concept of self-function vectors, built upon Bayesian views and the mechanistic interpretability of ICL.
These vectors leverage internal model representations to model the latent concept learned during in-context prompting, thereby enabling a direct estimation of aleatoric uncertainty within a Bayesian framework and circumventing the reliance on brittle input or decoding manipulations.
Given the lack of established benchmarks and suitable evaluation protocols, we also propose the first and rigorous evaluation protocol, in which data is manipulated in controlled ways so as to quantify aleatoric uncertainty precisely and separately from epistemic uncertainty.
With this new evaluation framework, initially grounded in synthetic tasks for conceptual development and subsequently extended to real-world datasets, we show that our proposed methodology can measure uncertainty of LLM predictions made under ICL more reliably than existing alternative methods. Moreover, we show it can be used as a practical tool for trustworthy-related applications, such as hallucination detection.
Our findings pave a new direction for connecting the quantitative view of uncertainty with the mechanistic understanding of model behavior.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.CL
See more →Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?
The REFLECT benchmark reveals that current LLM judges are unreliable, achieving below 55% accuracy in evaluating reasoning and evidence use, highlighting the need for improved evaluation methods for deep research agents.