Embedding by Elicitation: Dynamic Representations for Bayesian Optimization of System Prompts
Quick Take
ReElicit optimizes system prompts using Bayesian methods to adaptively refine representations from aggregate feedback.
Key Points
- Introduces Bayesian optimization for prompt tuning.
- Utilizes LLMs to build adaptive semantic representations.
- Achieves strong performance in aggregate-only optimization tasks.
📖 Reader Mode
~2 min readAbstract:System prompts are a central control mechanism in modern AI systems, shaping behavior across conversations, tasks, and user populations. Yet they are difficult to tune when feedback is available only as aggregate metrics rather than per-example labels, failures, or critiques. We study this aggregate feedback setting as sample-constrained black-box optimization over discrete, variable-length text. We introduce ReElicit, a Bayesian optimization framework based on \emph{embedding by elicitation}. Given a task description, previously evaluated prompts, and scalar scores, an LLM elicits a compact, interpretable feature space and maps prompts into it. Leveraging a probabilistic Gaussian process surrogate, an acquisition function then selects target feature vectors, which the LLM realizes and refines into deployable system prompts. Re-eliciting the feature space as new evaluations arrive lets the representation adapt to the observed prompt-score history. We evaluate the setting using offline benchmark accuracy as a controlled aggregate proxy: the optimizer observes one scalar score per prompt and no per-example labels, errors, or critiques. Across ten system prompt optimization tasks with a 30 total evaluation budget, ReElicit achieves the strongest aggregate performance profile among representative aggregate-only prompt-optimization baselines. These results suggest that LLMs can serve as adaptive semantic representation builders, not only prompt generators, for Bayesian optimization over natural-language artifacts.
| Subjects: | Artificial Intelligence (cs.AI); Machine Learning (cs.LG) |
| Cite as: | arXiv:2605.19093 [cs.AI] |
| (or arXiv:2605.19093v1 [cs.AI] for this version) | |
| https://doi.org/10.48550/arXiv.2605.19093 arXiv-issued DOI via DataCite (pending registration) |
Submission history
From: Zhiyuan Jerry Lin [view email]
[v1]
Mon, 18 May 2026 20:28:17 UTC (142 KB)
— Originally published at arxiv.org
More from arXiv cs.AI
See more →From Prompts to Protocols: An AI Agent for Laboratory Automation
An AI agent integrates large language models for automating laboratory protocols, enhancing efficiency and accuracy.