
New benchmark exposes how badly AI struggles with real knowledge work
Quick Answer
A new benchmark reveals that even top AI models, like those from leading companies, only solve 3% of realistic knowledge work tasks.
Quick Take
A new benchmark reveals that even top AI models, like those from leading companies, only solve 3% of realistic knowledge work tasks. This stark performance gap highlights the limitations of current AI technologies in practical applications, affecting industries reliant on knowledge work.
Key Points
- Top AI models struggle with realistic knowledge work, solving only 3% of tasks.
- The benchmark highlights significant limitations in AI's practical applications.
- Industries relying on knowledge work may face challenges due to AI performance.
- Current AI technologies are not yet equipped for complex knowledge tasks.
Article Excerpt
From source RSS / original summaryEven the best AI model fails at realistic knowledge work, fully solving just 3 percent of tasks. The article New benchmark exposes how badly AI struggles with real knowledge work appeared first on The Decoder.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from The Decoder
See more →
OpenAI models now available on Amazon Web Services
OpenAI has launched GPT-5.5, GPT-5.4, and Codex on Amazon Bedrock, matching its own pricing. Currently, these models are available only in the US across commercial and government AWS regions, with usage contributing to existing AWS contracts.

