PROMETHEUS: Automating Deep Causal Research Integrating Text, Data and Models
Quick Take
PROMETHEUS automates causal research by organizing data into navigable causal atlases.
Key Points
- Transforms literature and data into causal predictive-state models.
- Facilitates navigation of local claims and their coherence.
- Demonstrates applications through various case studies.
📖 Reader Mode
~2 min readAbstract:Large language models can extract local causal claims from text, but those claims become more useful when organized as persistent, navigable world models rather than as flat summaries. We introduce PROMETHEUS, a framework that turns retrieved literature, filings, reviews, reports, agent traces, source data, code, simulations, and scientific models into causal atlases: sheaf-like families of local causal predictive-state models over an explicit cover of a research substrate. Each local region contains causal episodes, structured claim tables, predictive tests, support statistics, and provenance; restriction maps compare overlapping regions; gluing diagnostics expose agreement, drift, contradiction, and underdetermination. The resulting Topos World Model is not a single universal graph. It is a research instrument for navigating what a corpus says, where it says it, how strongly it is supported, and where local claims fail to assemble into a coherent global view. Three literature-atlas case studies -- ocean-temperature impacts on marine populations, GLP-1 weight-loss evidence, and resveratrol/red-wine health-benefit claims -- illustrate deep causal research from text with explicit locality, evidence, persistent state, and gluing tension. Four grounded-counterfactual case studies -- a Nature Climate Change microplastics forcing paper, an Indus Valley hydrology paper with VIC-derived figure data and model code, the canonical Sachs protein-signaling study with single-cell perturbation data, and a Nature singing-mouse study with MAPseq projection matrices -- show a stronger mode: when a paper ships source data, simulation outputs, or code, PROMETHEUS can evaluate a counterfactual against that scientific substrate and then rebuild the sheaf world model around the
| Comments: | 27 pages |
| Subjects: | Artificial Intelligence (cs.AI) |
| Cite as: | arXiv:2605.12835 [cs.AI] |
| (or arXiv:2605.12835v1 [cs.AI] for this version) | |
| https://doi.org/10.48550/arXiv.2605.12835 arXiv-issued DOI via DataCite (pending registration) |
Submission history
From: Sridhar Mahadevan [view email]
[v1]
Wed, 13 May 2026 00:08:07 UTC (46 KB)
— Originally published at arxiv.org
More from arXiv cs.AI
See more →Invisible Orchestrators Suppress Protective Behavior and Dissociate Power-Holders: Safety Risks in Multi-Agent LLM Systems
Invisible orchestrators in multi-agent LLM systems pose significant safety risks and affect behavior dynamics.