featured
Dec 08, 2025
Optimizing Inference for Long Context and Large Batch Sizes with NVFP4 KV Cache
Quantization is one of the strongest levers for large-scale inference. By reducing the precision of weights, activations, and KV cache, we can reduce the memory...
10 MIN READ
Dec 05, 2025
NVIDIA Kaggle Grandmasters Win Artificial General Intelligence Competition
NVIDIA researchers on Friday won a key Kaggle competition many in the field treat as a real-time pulse check on humanity’s progress toward artificial general...
3 MIN READ
Dec 05, 2025
NVIDIA Grace CPU Delivers High Bandwidth and Efficiency for Modern Data Centers
Since its debut in 2023, the NVIDIA Grace CPU has experienced rapid adoption across data centers, setting new benchmarks for performance efficiency across...
8 MIN READ
Dec 02, 2025
AWS Integrates AI Infrastructure with NVIDIA NVLink Fusion for Trainium4 Deployment
As demand for AI continues to grow, hyperscalers are looking for ways to accelerate deployment of specialized AI infrastructure with the highest performance....
5 MIN READ
Nov 25, 2025
Making Robot Perception More Efficient on NVIDIA Jetson Thor
Building autonomous robots requires robust, low-latency visual perception for depth, obstacle recognition, localization, and navigation in dynamic environments....
15 MIN READ
Nov 24, 2025
Build and Run Secure, Data-Driven AI Agents
As generative AI advances, organizations need AI agents that are accurate, reliable, and informed by data specific to their business. The NVIDIA AI-Q Research...
9 MIN READ
Nov 24, 2025
Model Quantization: Concepts, Methods, and Why It Matters
AI models are becoming increasingly complex, often exceeding the capabilities of available hardware. Quantization has emerged as a crucial technique to address...
12 MIN READ
Nov 19, 2025
Breaking Through Reinforcement Learning Training Limits with Scaling Rollouts in BroRL
When training large language models (LLMs) with reinforcement learning from verifiable rewards (RLVR), one of the most compelling questions is how to overcome...
7 MIN READ
Nov 19, 2025
Building Better Qubits with GPU-Accelerated Computing
Quantum computing promises to revolutionize science and industry, from drug discovery to materials science. But building a useful, large-scale quantum computer...
5 MIN READ
Nov 18, 2025
Building Scalable AI on Enterprise Data with NVIDIA Nemotron RAG and Microsoft SQL Server 2025
At Microsoft Ignite 2025, the vision for an AI-ready enterprise database becomes a reality with the announcement of Microsoft SQL Server 2025, giving developers...
10 MIN READ
Nov 18, 2025
Faster Chemistry and Materials Discovery with AI-Powered Simulations Using NVIDIA ALCHEMI
Almost all manufactured products are enabled by chemistry and materials science. However, new discoveries are costly and time-consuming and often hindered by...
6 MIN READ
Nov 17, 2025
NVIDIA NVQLink Architecture Integrates Accelerated Computing with Quantum Processors
Quantum computing is entering an era where progress will be driven by the integration of accelerated computing with quantum processors. The hardware that...
8 MIN READ
Nov 17, 2025
Pioneering AI Co-Scientists for Fusion Research and Cancer Treatment
AI is reshaping scientific research and innovation. Scientists can leverage AI to generate, summarize, combine, and analyze scientific data. AI models can find...
8 MIN READ
Nov 13, 2025
Achieve CUTLASS C++ Performance with Python APIs Using CuTe DSL
CuTe, a core component of CUTLASS 3.x, provides a unified algebra for describing data layouts and thread mappings, and abstracts complex memory access patterns...
9 MIN READ
Nov 13, 2025
How to Get Started with Neural Shading for Your Game or Application
For the past 25 years, real-time rendering has been driven by continuous hardware improvements. The goal has always been to create the highest fidelity image...
21 MIN READ
Nov 12, 2025
NVIDIA Blackwell Architecture Sweeps MLPerf Training v5.1 Benchmarks
The NVIDIA Blackwell architecture powered the fastest time to train across every MLPerf Training v5.1 benchmark, marking a clean sweep in the latest round of...
10 MIN READ