
Mistral's open-source Leanstral 1.5 aces formal math benchmarks and catches real bugs in code
Quick Answer
Mistral AI has launched Leanstral 1.5, an open-source model for formal verification in Lean 4, which excelled in formal math benchmarks and identified five previously unknown bugs across 57 open-source repositories.
Quick Take
Mistral AI has launched Leanstral 1.5, an open-source model for formal verification in Lean 4, which excelled in formal math benchmarks and identified five previously unknown bugs across 57 open-source repositories.
Key Points
- Leanstral 1.5 is designed for formal verification using Lean 4.
- The model excelled in formal math benchmarks.
- It discovered five previously unknown bugs in open-source code.
- The bugs were found while scanning 57 repositories.
- This release enhances reliability in software development.
Article Excerpt
From source RSS / original summaryMistral AI released Leanstral 1. 5, an open-source model for formal verification in Lean 4. Beyond math, the model found five previously unknown bugs while scanning 57 open-source repositories. The article Mistral's open-source Leanstral 1. 5 aces formal math benchmarks and catches real bugs in code appeared first on The Decoder.
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from The Decoder
See more →
An AI model programmed nonstop for 19 days on a single MirrorCode task that cost $2,600 to run
Epoch AI's MirrorCode benchmark reveals Claude Opus 4.7 as the leader with a 56% solve rate, reconstructing a 16,000-line toolkit in 14 hours. Despite this, all models tested struggle with the most complex tasks, highlighting limitations in current AI capabilities. The single task consumed $2,600 over 19 days, raising questions about cost-effectiveness in AI development.

