Microsoft AI Introduces MAI-Transcribe-1.5: 2.4% WER on Artificial Analysis, Best-in-Class FLEURS Accuracy, and Up to 5x Faster Long-Audio Transcription
Quick Answer
Microsoft AI has launched MAI-Transcribe-1.5, achieving a 2.4% Word-Error-Rate on the Artificial Analysis leaderboard and offering up to 5x faster long-audio transcription.
Quick Take
Microsoft AI has launched MAI-Transcribe-1.5, achieving a 2.4% Word-Error-Rate on the Artificial Analysis leaderboard and offering up to 5x faster long-audio transcription. This model supports 43 languages and features keyword biasing for domain-specific terms, now available in Azure AI Foundry.
Key Points
- MAI-Transcribe-1.5 supports 43 languages and enhances domain-specific term recognition.
- Achieves a 2.4% Word-Error-Rate on the Artificial Analysis benchmark.
- Transcribes one hour of audio in under 15 seconds, improving efficiency.
- Features keyword biasing to enhance accuracy for specific industries.
- Available for use in Azure AI Foundry, expanding accessibility.
Article Excerpt
From source RSS / original summaryMicrosoft AI has released MAI-Transcribe-1. 5, the second iteration of its in-house speech-to-text family. The model covers 43 languages, adds keyword (entity) biasing for domain-specific terms, posts a 2. 4% Word-Error-Rate on the Artificial Analysis leaderboard, and transcribes an hour of audio in under 15 seconds. It is generally available in Azure AI Foundry. The post Microsoft AI Introduces MAI-Transcribe-1. 5: 2.
4% WER on Artificial Analysis, Best-in-Class FLEURS Accuracy, and Up to 5x Faster Long-Audio Transcription appeared first on MarkTechPost.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from MarkTechPost
See more →NVIDIA garak Tutorial: Build a Complete Defensive LLM Red-Teaming Workflow with Custom Probes and Detectors
The NVIDIA garak tutorial provides a comprehensive framework for defensive LLM red-teaming, detailing setup, plugin discovery, and evaluations using Hugging Face models. It emphasizes analyzing safety scores, attack success rates, and extending functionality with custom probes, concluding with exporting results in AVID format for vulnerability assessment.