Using Probabilistic Programs to Train Inductive Reasoning in… | AI Deep Signal

Using Probabilistic Programs to Train Inductive Reasoning in Large Language Models

arXiv cs.CL·Liyi Zhang, Akshay K. Jagadish, Brenden M. Lake, Thomas L. Griffiths

6/10/2026

·~2 min·6/10/2026·en·1

Quick Answer

This paper shows that The Program-based Posterior Training (PPT) method enhances inductive reasoning in Large Language Models (LLMs) by generating 10,000 diverse scenarios for fine-tuning, leading to improved accuracy and alignment with human judgments.

Quick Take

This approach demonstrates significant gains in estimation and calibration over traditional methods, indicating a deeper understanding of uncertainty in .

Key Points

PPT fine-tunes LLMs using probabilistic programs for inductive reasoning.
10,000 scenarios generated to improve model performance on held-out tasks.
Significant accuracy gains observed in estimation and human alignment.
Raw calibration improvements exceed those from post-hoc temperature scaling.
PPT shows promise for reliable approximate inductive inference in LLMs.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Source Excerpt

arXiv:2606. 09856v1 Announce Type: new Abstract: Post-training (LLMs) for reasoning typically focuses on deductive tasks such as mathematics and coding where correctness is verifiable. Yet, many real-world reasoning problems are inductive: agents must infer uncertain beliefs from sparse, ambiguous observations.

There are challenges to using standard fine-tuning methods for inductive reasoning, including difficulties in curating large-scale, high-quality labeled datasets and in handling targets that are inherently distributional. …

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CL

See more →

arXiv cs.CL·Isabel Xu (The Overlake School), Cynthia Xu (The Overlake School), Rachel Ren (Edwards Vacuum Inc.), Cong Guo (The University of Memphis), Jiacheng Ding (The University of Memphis)

5d ago

FeaturedOriginal

TriAgent: Divergence-Aware Committees for Cost-Efficient Financial Sentiment Analysis

AI Summary

TriAgent introduces a cost-efficient multi-agent system for financial sentiment analysis, combining VADER, FinBERT, and Qwen2.5. It achieves an F1 score of ~0.87 with significant savings of $9.3M/year at a 10M-user scale compared to GPT-4o-mini, while also detecting hallucinations with an AUC of 0.90.

#LLM #Agent #AI Startup #Enterprise AI

Using Probabilistic Programs to Train Inductive Reasoning in Large Language Models

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.CL

TriAgent: Divergence-Aware Committees for Cost-Efficient Financial Sentiment Analysis

RF-Agent: A Practical Framework for Building Language Agents for RFIC Design

Letting the Data Speak: Extracting Keywords from Crowdsourced Collections with AI

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.CL

TriAgent: Divergence-Aware Multi-Agent Committees for Cost-Efficient Financial Sentiment Analysis

RF-Agent: A Practical Framework for Building Language Agents for RFIC Design

Letting the Data Speak: Extracting Keywords from Crowdsourced Collections with AI

TriAgent: Divergence-Aware Committees for Cost-Efficient Financial Sentiment Analysis