
MiniMax M3 on AI Gateway
Quick Take
MiniMax M3, now available on Vercel AI Gateway, features a 1M-token context window and native multimodality using MiniMax Sparse Attention (MSA). It enhances software engineering, terminal-based tool use, and agentic web browsing, optimized for multi-turn collaboration.
Key Points
- M3 supports multimodal input, allowing image and prompt combinations.
- AI Gateway offers a unified API for model calls and usage tracking.
- No markup on provider pricing; no platform fee on inference requests.
- Features include custom reporting and zero data retention support.
- Dynamic provider sorting by latency and cost enhances performance.
Article Excerpt
From source RSS / original summaryMiniMax M3 is now available on . Vercel AI GatewayM3 is MiniMax's first model with a 1M-token context window and native multimodality, built around MiniMax Sparse Attention (MSA). M3 improves on software engineering, terminal-based tool use, and agentic web browsing, and is tuned for multi-turn collaboration. To use MiniMax M3, set model to in the .
minimax/minimax-m3AI SDKPass an image alongside a prompt to use M3's multimodal input:AI Gateway provides a unified API for calling models, tracking usage and cost, and configuring retries, failover, and performance optimizations for higher-than-provider uptime. It includes built-in , , , and more. AI Gateway reflects provider pricing with no markup and does not charge a platform fee on inference, including on (BYOK) requests.
custom reportingZero Data Retention supportdynamic provider sorting by latency & costBring Your Own KeyLearn more about , view the or try it in our . AI GatewayAI Gateway model leaderboardmodel playgroundRead more
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from Vercel AI
See more →
Opus 4.8 on AI Gateway
Claude Opus 4.8, now available on Vercel AI Gateway, excels in long-horizon agentic execution and complex coding tasks, producing clearer prose for knowledge work. Users can access it via the .anthropic/claude-opus-4.8 model in the AI SDK, benefiting from a unified API with no markup on provider pricing.

