MathAtlas: A Benchmark for Autoformalization in the Wild

arXiv cs.AI·Nilay Patel, Noah Arias, Davit Babayan, Victoria Cochran, Timothy Libman, Hafsah Mahmood, Liam McCarty, Soli Munoz, Laurel Willey, Jeffrey Flanigan

2d ago

·~1 min·5/15/2026·en·1

Quick Take

MathAtlas is a new benchmark for autoformalization in graduate-level mathematics, featuring 52k theorems and a dependency graph.

Key Points

First large-scale benchmark for graduate-level mathematics.
Includes 52k theorems from 103 textbooks.
Challenges state-of-the-art models with low correctness rates.

Reader Mode unavailable (could not extract clean content).

Read on arxiv.org

MathAtlas: A Benchmark for Autoformalization in the Wild

Quick Take

Key Points

More from arXiv cs.AI

Invisible Orchestrators Suppress Protective Behavior and Dissociate Power-Holders: Safety Risks in Multi-Agent LLM Systems

Distribution-Aware Algorithm Design with LLM Agents

Enhanced and Efficient Reasoning in Large Learning Models

Related in this space

Measuring and Mitigating Toxicity in Large Language Models: A Comprehensive Replication Study