Deep dive

Jul 02, 2025
Optimizing FLUX.1 Kontext for Image Editing with Low-Precision Quantization
FLUX.1 Kontext, the recently released model from Black Forest Labs, is a fascinating addition to the repertoire of community image generation models. The open...
10 MIN READ

Jul 01, 2025
Per-Tensor and Per-Block Scaling Strategies for Effective FP8 Training
In this blog post, we’ll break down the main FP8 scaling strategies—per-tensor scaling, delayed and current scaling, and per-block scaling (including the...
10 MIN READ

Jun 30, 2025
Best-in-Class Multimodal RAG: How the Llama 3.2 NeMo Retriever Embedding Model Boosts Pipeline Accuracy
Data goes far beyond text—it is inherently multimodal, encompassing images, video, audio, and more, often in complex and unstructured formats. While the...
7 MIN READ

Jun 25, 2025
Powering the Next Frontier of Networking for AI Platforms with NVIDIA DOCA 3.0
The NVIDIA DOCA framework has evolved to become a vital component of next-generation AI infrastructure. From its initial release to the highly anticipated...
12 MIN READ

Jun 24, 2025
Introducing NVFP4 for Efficient and Accurate Low-Precision Inference
To get the most out of AI, optimizations are critical. When developers think about optimizing AI models for inference, model compression techniques—such as...
11 MIN READ

Jun 18, 2025
Improved Performance and Monitoring Capabilities with NVIDIA Collective Communications Library 2.26
The NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multinode communication primitives optimized for NVIDIA GPUs and networking. NCCL...
11 MIN READ

Jun 17, 2025
Power Real-Time AI Media Effects with New AI Reference Apps on NVIDIA Holoscan for Media
Live media workflows are increasingly using AI microservices to augment production capabilities. However, advanced AI models are mostly hosted in the cloud,...
4 MIN READ

Jun 16, 2025
Enhance Robot Learning with Synthetic Trajectory Data Generated by World Foundation Models
Generalist robotics have arrived, powered by advances in mechatronics and robot AI foundation models. But a key bottleneck remains: robots need vast training...
8 MIN READ

Jun 12, 2025
Accelerated Sequence Alignment for Protein Science with MMseqs2-GPU and NVIDIA NIM
Protein sequence alignment—comparing protein sequences for similarities—is fundamental to modern biology and medicine. It illuminates gene functions by...
9 MIN READ

Jun 12, 2025
Driving Toward Billion-Cell Analysis and Biological Breakthroughs with RAPIDS-singlecell
The future of cell biology and virtual cell models is dependent on measuring and analyzing data at scale. Single-cell experiments have been growing at an...
7 MIN READ

Jun 12, 2025
NVIDIA Holoscan Sensor Bridge Empowers Developers with Real-Time Data Processing
In the rapidly evolving robotics and edge AI landscape, the ability to efficiently process and transfer sensor data is crucial. Many edge applications are...
9 MIN READ

Jun 11, 2025
Develop Custom Physical AI Foundation Models with NVIDIA Cosmos Predict-2
Building smarter robots and autonomous vehicles (AVs) starts with physical AI models that understand real-world dynamics. These models serve two critical roles:...
7 MIN READ

Jun 11, 2025
Simplify End-to-End Autonomous Vehicle Development with New NVIDIA Cosmos World Foundation Models
The shift to end-to-end planning models for powering autonomous vehicles (AVs) is increasing the demand for high-quality, physically-based sensor data. These...
7 MIN READ

Jun 10, 2025
Transforming Quantum Education with AI Supercomputing and NVIDIA CUDA-Q Academic
As quantum computers scale, they will integrate with AI supercomputers to tackle some of the world’s most challenging problems. These accelerated quantum...
8 MIN READ

Jun 09, 2025
A Fine-tuning–Free Approach for Rapidly Recovering LLM Compression Errors with EoRA
Model compression techniques have been extensively explored to reduce the computational resource demands of serving large language models (LLMs) or other...
9 MIN READ

Jun 05, 2025
Analyzing Baseboard Management Controllers to Secure Data Center Infrastructure
Modern data centers depend on Baseboard Management Controllers (BMCs) for remote management. These embedded processors enable administrators to reconfigure...
9 MIN READ