Marktechpost AI on X: "This is super cool! Just checked NVIDIA Cosmos 3 this week. Here's what's actually interesting if you build physical AI. It's an open family of omnimodal world models. One model does physical reasoning, world generation, and action generation. Earlier Cosmos releases needed ht
Quick Answer
NVIDIA's Cosmos 3 introduces an omnimodal world model with a two-tower architecture: a Reasoner for interpreting inputs and a Generator for producing physics-aware outputs.
Quick Take
NVIDIA's Cosmos 3 introduces an omnimodal world model with a two-tower architecture: a Reasoner for interpreting inputs and a Generator for producing physics-aware outputs. It supports various hardware configurations, including Cosmos3-Nano and Cosmos3-Super, and achieves state-of-the-art benchmarks in multiple AI tasks.
Key Points
- Cosmos 3 features a Reasoner tower for video, image, and text analysis.
- Generator tower produces physics-aware videos and actions, with outputs including synchronized sound.
- Available in Cosmos3-Nano (16B) and Cosmos3-Super (64B) configurations.
- Achieves open-source state-of-the-art results on R-Bench and leads in PAI-Bench and Physics-IQ.
- Future release planned for a 4B Edge model.
Article Excerpt
From source RSS / original summary1) Two towers, one transformer→ Reasoner tower: an autoregressive that reads video, images, and text → Generator tower: a diffusion path for physics-aware video and actions → Information flows one way, reasoner → generator 2) Pick a size for your hardware→ Cosmos3-Nano: 16B total (dense 8B, Qwen3-VL 8B), runs on workstation GPUs like the RTX PRO 6000 → Cosmos3-Super: 64B total (dense 32B, Qwen3-VL 32B), targets Hopper and Blackwell datacenters → A 4B Edge model is planned for a later release 3) What it generates→ In: text, image, video, action arrays → Out: image, video, synchronized sound, action states, text → 256p/480p/720p, 5–300 frames (default 189 ≈ 7.
9s at 24 FPS), stereo AAC at 48 kHz 4) The benchmark claims→ Open-source SOTA on R-Bench; leads PAI-Bench, Physics-IQ, and RoboLab
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from WebSearch (Tavily)
See more →WSJ: OpenAI is considering deep price reductions as competition ...
OpenAI is contemplating significant price cuts in response to competitive pressure from Anthropic, particularly due to the success of Claude Code in developer and coding workflows. This shift could affect pricing strategies in the AI market as companies vie for dominance in coding solutions.