Towards Spec Learning: Inference-Time Alignment from Preference Pairs

arXiv cs.CL·Dhriti Krishnan, Tejas Goyal, Jaromir Savelka

4h ago

·~1 min·6/24/2026·en·0

Quick Answer

The proposed 'spec learning' framework enables large language models to align with user preferences using brief instructions and preference judgments, outperforming direct preference optimization in specialized domains without requiring parameter updates.

Quick Take

The proposed 'spec learning' framework enables large language models to align with user preferences using brief instructions and preference judgments, outperforming in specialized domains without requiring parameter updates. This method enhances interpretability and transparency of model responses.

Key Points

Spec learning compiles user instructions into natural-language prompts for LLMs.
No parameter updates are needed, making it less brittle than traditional methods.
Outperforms direct preference optimization on dense preference signal datasets.
Specifications are human-readable, enhancing interpretability and transparency.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Article Excerpt

From source RSS / original summary

arXiv:2606. 24004v1 Announce Type: new Abstract: Steering a large language model (LLM) toward a desired behavior typically relies on an iterative process of hand-crafting a prompt based on a careful inspection of the model's responses. This is an involved, brittle, and error-prone process. Preference-based fine-tuning is a more rigorous but often prohibitively expensive solution. We propose spec learning, a framework that relies on a brief user instruction and a small set of preference judgments.

These are compiled into specifications in the form of natural-language prompts for an LLM. Specifications condition LLMs at inference time, and no parameter updates to the underlying models are required. We show that the responses generated based on the compiled specifications often outperform (DPO) on datasets from specialized domains whose preference signal is dense.

Unlike opaque weight updates, the resulting specifications are human-readable and double as interpretable and transparent written embodiments of the preference signal that produced them.

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CL

See more →

arXiv cs.CL·Barak Or

4h ago

FeaturedOriginal

Quantifying Prior Dominance in Systems

AI Summary

The study introduces the Normalized Context Utilization (NCU) metric to evaluate Retrieval-Augmented Generation (RAG) systems, revealing that Small Language Models (SLMs) outperform larger models in factual extraction. The findings indicate that traditional scaling laws yield diminishing returns, with a commercial API frequently failing against adversarial evidence due to systemic confidence collapse.

#LLM #AI Coding #Inference #AI Startup

Towards Spec Learning: Inference-Time Alignment from Preference Pairs

Quick Answer

Quick Take

Key Points

Paper Resources

Article Excerpt

Want this in your inbox every morning?

More from arXiv cs.CL

Quantifying Prior Dominance in Systems

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

When Plausible Is Not Realistic: Evaluating Human Mobility in LLM-Based Urban Simulation

Quick Answer

Quick Take

Key Points

Paper Resources

Article Excerpt

Want this in your inbox every morning?

More from arXiv cs.CL

Quantifying Prior Dominance in RAG Systems

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

When Plausible Is Not Realistic: Evaluating Human Mobility in LLM-Based Urban Simulation

Quantifying Prior Dominance in Systems