Compositional World Models from Video Pretraining · DeepSignalCompositional World Models from Video Pretraining
A video-pretrained transformer yields compositional world models, lifting long-horizon planning benchmarks by 14%.
Key Points
- Pretrained on 5M hours of video.
- +14% on long-horizon planning suite.
- Released checkpoints and dataset list.
Reader Mode is being prepared.
Invisible Orchestrators Suppress Protective Behavior and Dissociate Power-Holders: Safety Risks in Multi-Agent LLM Systems
AI Summary
Invisible orchestrators in multi-agent LLM systems pose significant safety risks and affect behavior dynamics.

arXiv cs.AI·Saharsh Koganti, Priyadarsi Mishra, Pierfrancesco Beneventano, Tomer Galanti 2d agoDistribution-Aware Algorithm Design with LLM Agents
AI Summary
The study presents a distribution-aware algorithm leveraging LLM agents for optimized solver code generation.
Enhanced and Efficient Reasoning in Large Learning Models
AI Summary
The paper proposes an efficient reasoning method for large language models, enhancing trust in generated content.

arXiv cs.CL·Luis Lara, Aristides Milios, Zhi Hao Luo, Aditya Sharma, Ge Ya Luo, Christopher Beckham, Florian Golemo, Christopher Pal 2d agoGenerative Floor Plan Design with LLMs via Reinforcement Learning with Verifiable Rewards
AI Summary
A new LLM-based approach generates floor plans while adhering to numerical and topological constraints using reinforcement learning.

arXiv cs.CV·Alvaro Lopez Pellicer, Plamen Angelov, Marwan Bukhari, Yi Li, Eduardo Soares, Jemma Kerns 2d agoProtoMedAgent: Multimodal Clinical Interpretability via Privacy-Aware Agentic Workflows
AI Summary
ProtoMedAgent enhances clinical interpretability by integrating multimodal reporting with privacy-aware workflows.
China bypasses US GPU bans with 1.54-exaflops 'LineShine' supercomputer — CPU-only monster packs 2.4 million Huawei-designed Armv9 cores
AI Summary
China's LineShine supercomputer achieves 1.54 exaflops using 2.4 million Armv9 cores, circumventing US GPU restrictions.
67
≥75 high · 50–74 medium · <50 low
Why Featured
Video-pretrained world models are a key piece toward general-purpose agents that plan over time.