PROMETHEUS: Automating Deep Causal Research Integrating Text, Data and Models

arXiv cs.AI·Sridhar Mahadevan

3d ago

·~2 min·5/14/2026·en·1

Quick Take

PROMETHEUS automates causal research by organizing data into navigable causal atlases.

Key Points

Transforms literature and data into causal predictive-state models.
Facilitates navigation of local claims and their coherence.
Demonstrates applications through various case studies.

📖 Reader Mode

~2 min read

[Submitted on 13 May 2026]

View PDF HTML (experimental)

Abstract:Large language models can extract local causal claims from text, but those claims become more useful when organized as persistent, navigable world models rather than as flat summaries. We introduce PROMETHEUS, a framework that turns retrieved literature, filings, reviews, reports, agent traces, source data, code, simulations, and scientific models into causal atlases: sheaf-like families of local causal predictive-state models over an explicit cover of a research substrate. Each local region contains causal episodes, structured claim tables, predictive tests, support statistics, and provenance; restriction maps compare overlapping regions; gluing diagnostics expose agreement, drift, contradiction, and underdetermination. The resulting Topos World Model is not a single universal graph. It is a research instrument for navigating what a corpus says, where it says it, how strongly it is supported, and where local claims fail to assemble into a coherent global view. Three literature-atlas case studies -- ocean-temperature impacts on marine populations, GLP-1 weight-loss evidence, and resveratrol/red-wine health-benefit claims -- illustrate deep causal research from text with explicit locality, evidence, persistent state, and gluing tension. Four grounded-counterfactual case studies -- a Nature Climate Change microplastics forcing paper, an Indus Valley hydrology paper with VIC-derived figure data and model code, the canonical Sachs protein-signaling study with single-cell perturbation data, and a Nature singing-mouse study with MAPseq projection matrices -- show a stronger mode: when a paper ships source data, simulation outputs, or code, PROMETHEUS can evaluate a counterfactual against that scientific substrate and then rebuild the sheaf world model around the

Comments:	27 pages
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2605.12835 [cs.AI]
	(or arXiv:2605.12835v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2605.12835 arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Sridhar Mahadevan [view email]
[v1] Wed, 13 May 2026 00:08:07 UTC (46 KB)

— Originally published at arxiv.org

Continue reading on arxiv.org

PROMETHEUS: Automating Deep Causal Research Integrating Text, Data and Models

Quick Take

Key Points

📖 Reader Mode

Submission history

More from arXiv cs.AI

Invisible Orchestrators Suppress Protective Behavior and Dissociate Power-Holders: Safety Risks in Multi-Agent LLM Systems

Distribution-Aware Algorithm Design with LLM Agents

Enhanced and Efficient Reasoning in Large Learning Models

Related in this space

Generative Floor Plan Design with LLMs via Reinforcement Learning with Verifiable Rewards