Anthropic Researcher Mode: Claude builds and runs its own experiments · DeepSignalAnthropic Researcher Mode: Claude builds and runs its own experiments
Anthropic's Researcher Mode gives Claude persistent compute and a sandbox for multi-day investigations and experiments.
Key Points
- Persistent compute + filesystem.
- Plan-execute-report loop over days.
- Initially gated to research preview.
Reader Mode is being prepared.
More from this source
Claude Sonnet 4.5 leads SWE-Bench Verified at 64.2%
AI Summary
Claude Sonnet 4.5 jumps SWE-Bench Verified to 64.2% and adds a 200K-token context option.
Anthropic publishes Constitutional AI v3 — fewer refusals, better task completion
AI Summary
Anthropic's Constitutional AI v3 cuts over-refusal by 41% while preserving safety, using a tighter principle set and contrastive reinforcement.
Invisible Orchestrators Suppress Protective Behavior and Dissociate Power-Holders: Safety Risks in Multi-Agent LLM Systems
AI Summary
Invisible orchestrators in multi-agent LLM systems pose significant safety risks and affect behavior dynamics.
OpenAI co-founder Greg Brockman reportedly takes charge of product strategy
AI Summary
OpenAI co-founder Greg Brockman is now leading product strategy amid plans to integrate ChatGPT and Codex.

arXiv cs.AI·Saharsh Koganti, Priyadarsi Mishra, Pierfrancesco Beneventano, Tomer Galanti 2d agoDistribution-Aware Algorithm Design with LLM Agents
AI Summary
The study presents a distribution-aware algorithm leveraging LLM agents for optimized solver code generation.
33
≥75 high · 50–74 medium · <50 low
Why Featured
Multi-day autonomous research loops are the next obvious agent capability; sets the bar for the rest of the field.