featured

Dec 08, 2025

Optimizing Inference for Long Context and Large Batch Sizes with NVFP4 KV Cache

Quantization is one of the strongest levers for large-scale inference. By reducing the precision of weights, activations, and KV cache, we can reduce the memory...

10 MIN READ

Dec 05, 2025

NVIDIA Kaggle Grandmasters Win Artificial General Intelligence Competition

NVIDIA researchers on Friday won a key Kaggle competition many in the field treat as a real-time pulse check on humanity’s progress toward artificial general...

3 MIN READ

Dec 05, 2025

NVIDIA Grace CPU Delivers High Bandwidth and Efficiency for Modern Data Centers

Since its debut in 2023, the NVIDIA Grace CPU has experienced rapid adoption across data centers, setting new benchmarks for performance efficiency across...

8 MIN READ

Dec 02, 2025

AWS Integrates AI Infrastructure with NVIDIA NVLink Fusion for Trainium4 Deployment

As demand for AI continues to grow, hyperscalers are looking for ways to accelerate deployment of specialized AI infrastructure with the highest performance....

5 MIN READ

Nov 25, 2025

Making Robot Perception More Efficient on NVIDIA Jetson Thor

Building autonomous robots requires robust, low-latency visual perception for depth, obstacle recognition, localization, and navigation in dynamic environments....

15 MIN READ

Nov 24, 2025

Build and Run Secure, Data-Driven AI Agents

As generative AI advances, organizations need AI agents that are accurate, reliable, and informed by data specific to their business. The NVIDIA AI-Q Research...

9 MIN READ

Nov 24, 2025

Model Quantization: Concepts, Methods, and Why It Matters

AI models are becoming increasingly complex, often exceeding the capabilities of available hardware. Quantization has emerged as a crucial technique to address...

12 MIN READ

Nov 19, 2025

Breaking Through Reinforcement Learning Training Limits with Scaling Rollouts in BroRL

When training large language models (LLMs) with reinforcement learning from verifiable rewards (RLVR), one of the most compelling questions is how to overcome...

7 MIN READ

Nov 19, 2025

Building Better Qubits with GPU-Accelerated Computing

Quantum computing promises to revolutionize science and industry, from drug discovery to materials science. But building a useful, large-scale quantum computer...

5 MIN READ

Nov 18, 2025

Building Scalable AI on Enterprise Data with NVIDIA Nemotron RAG and Microsoft SQL Server 2025

At Microsoft Ignite 2025, the vision for an AI-ready enterprise database becomes a reality with the announcement of Microsoft SQL Server 2025, giving developers...

10 MIN READ

Nov 18, 2025

Faster Chemistry and Materials Discovery with AI-Powered Simulations Using NVIDIA ALCHEMI

Almost all manufactured products are enabled by chemistry and materials science. However, new discoveries are costly and time-consuming and often hindered by...

6 MIN READ

Nov 17, 2025

NVIDIA NVQLink Architecture Integrates Accelerated Computing with Quantum Processors

Quantum computing is entering an era where progress will be driven by the integration of accelerated computing with quantum processors. The hardware that...

8 MIN READ

Nov 17, 2025

Pioneering AI Co-Scientists for Fusion Research and Cancer Treatment

AI is reshaping scientific research and innovation. Scientists can leverage AI to generate, summarize, combine, and analyze scientific data. AI models can find...

8 MIN READ

Nov 13, 2025

Achieve CUTLASS C++ Performance with Python APIs Using CuTe DSL

CuTe, a core component of CUTLASS 3.x, provides a unified algebra for describing data layouts and thread mappings, and abstracts complex memory access patterns...

9 MIN READ

Nov 13, 2025

How to Get Started with Neural Shading for Your Game or Application

For the past 25 years, real-time rendering has been driven by continuous hardware improvements. The goal has always been to create the highest fidelity image...

21 MIN READ

Nov 12, 2025

NVIDIA Blackwell Architecture Sweeps MLPerf Training v5.1 Benchmarks

The NVIDIA Blackwell architecture powered the fastest time to train across every MLPerf Training v5.1 benchmark, marking a clean sweep in the latest round of...

10 MIN READ