Mistral's open-source Leanstral 1.5 aces formal math benchmarks and catches real bugs in code

The Decoder·Matthias Bastian

2h ago

·~1 min·7/4/2026·en·0

Quick Answer

Mistral AI has launched Leanstral 1.5, an open-source model for formal verification in Lean 4, which excelled in formal math benchmarks and identified five previously unknown bugs across 57 open-source repositories.

Quick Take

Key Points

Leanstral 1.5 is designed for formal verification using Lean 4.
The model excelled in formal math benchmarks.
It discovered five previously unknown bugs in open-source code.
The bugs were found while scanning 57 repositories.
This release enhances reliability in software development.

Article Excerpt

From source RSS / original summary

Mistral AI released Leanstral 1. 5, an open-source model for formal verification in Lean 4. Beyond math, the model found five previously unknown bugs while scanning 57 open-source repositories. The article Mistral's open-source Leanstral 1. 5 aces formal math benchmarks and catches real bugs in code appeared first on The Decoder.

Read on the-decoder.com

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from The Decoder

See more →

An AI model programmed nonstop for 19 days on a single MirrorCode task that cost $2,600 to run

The Decoder·Matthias Bastian

1w ago

FeaturedOriginal

An AI model programmed nonstop for 19 days on a single MirrorCode task that cost $2,600 to run

AI Summary

Epoch AI's MirrorCode benchmark reveals Claude Opus 4.7 as the leader with a 56% solve rate, reconstructing a 16,000-line toolkit in 14 hours. Despite this, all models tested struggle with the most complex tasks, highlighting limitations in current AI capabilities. The single task consumed $2,600 over 19 days, raising questions about cost-effectiveness in AI development.

#LLM #AI Coding #Inference #AI Startup