Deliberate Evolution: Agentic Reasoning for Sample-Efficient Symbolic Regression with LLMs

arXiv cs.CL·Xinyu Pang, Zhanke Zhou, Xuan Li, Fangrui Lv, Shanshan Wei, Sen Cui, Bo Han, Changshui Zhang

6/4/2026

·~1 min·6/4/2026·en·3

Quick Answer

This paper shows that The Deliberate Evolution (DE) framework enhances sample efficiency in symbolic regression by decoupling proposal generation from search control, outperforming LLM-based baselines on LLM-SRBench with only 40% of the sample budget.

Quick Take

This method utilizes adaptive operators, analytical tools, and reflective memory to improve performance across various scientific domains.

Key Points

DE framework improves sample efficiency in symbolic regression by 60% using only 40% of standard samples.
Decouples symbolic generation from search guidance, enhancing performance.
Utilizes adaptive operators and analytical tools for better error diagnosis.
Demonstrated superior results across diverse scientific domains on LLM-SRBench.
Addresses limitations of existing LLM-based evolutionary methods reliant on scalar feedback.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Source Excerpt

From the original publisher, up to about 700 characters

arXiv:2606. 04360v1 Announce Type: new Abstract: Symbolic regression (SR) discovers compact mathematical expressions from data, yet recent -based evolutionary methods remain sample-inefficient because they rely mainly on scalar feedback such as MSE. We identify a core limitation: existing methods conflate candidate proposal with search guidance, requiring the LLM to infer how to evolve an expression, diagnose its errors, and reuse past experience from a single score.

To address this, we propose Deliberate Evolution (DE), an agentic framework that decouples symbolic generation from search control. …

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CL

See more →

arXiv cs.CL·Miguel Arana-Catania, Catherine Conisbee, Matthew Kidd

6d ago

FeaturedOriginal

Letting the Data Speak: Extracting Keywords from Crowdsourced Collections with AI

AI Summary

The study evaluates three NLP approaches—Named Entity Recognition, Keyword Extraction, and Topic Modelling—using the Their Finest Hour Online Archive to automate keyword extraction from crowdsourced WWII collections. Findings suggest that while NLP methods show promise, no single approach is sufficient, and ethical considerations in automated keyword extraction are crucial for responsible stewardship.

#AI Coding #Inference #Open Source #Policy

Deliberate Evolution: Agentic Reasoning for Sample-Efficient Symbolic Regression with LLMs

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.CL

Letting the Data Speak: Extracting Keywords from Crowdsourced Collections with AI

Quantifying Prior Dominance in Systems

Time to REFLECT: Can We Trust Judges for Evidence-based Research Agents?

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.CL

Letting the Data Speak: Extracting Keywords from Crowdsourced Collections with AI

Quantifying Prior Dominance in RAG Systems

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

Quantifying Prior Dominance in Systems

Time to REFLECT: Can We Trust Judges for Evidence-based Research Agents?