
NVIDIA CUDA 13.3 Enhances GPU Development with Tile Programming in C++, Compiler Autotuning, and Python Updates
Quick Take
NVIDIA CUDA 13.3 introduces Tile programming in C++, compiler autotuning, and Python updates for enhanced GPU development.
Key Points
- Tile programming simplifies high-level kernel development.
- Automatic management of low-level GPU details.
- Supports Compute Capability 9.0 for broader compatibility.
Article Excerpt
From source RSS / original summaryNVIDIA CUDA 13. 3 brings new capabilities and performance optimizations to developers across the CUDA ecosystem. The launch of NVIDIA CUDA Tile programming in... NVIDIA CUDA 13. 3 brings new capabilities and performance optimizations to developers across the CUDA ecosystem. The launch of NVIDIA CUDA Tile programming in C++, enables high-level, tile-based kernel development that automatically manages complex low-level GPU details for optimal performance and portability.
Additionally, CUDA Tile programming is now supported on Compute Capability 9. 0… Source
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from NVIDIA Developer Blog
See more →
Develop High-Performance GPU Kernels in C++ with NVIDIA CUDA Tile
NVIDIA CUDA Tile enables optimized GPU kernel development within existing C++ codebases.

Extract More Kernel Performance with NVIDIA CompileIQ Auto-Tuning
NVIDIA CompileIQ optimizes compiler options for enhanced kernel performance in specific workloads.

