Skip to content
@nvidia-cosmos

NVIDIA Cosmos

NVIDIA Cosmos is a world foundation model platform for accelerating the development of physical AI systems.

NVIDIA Cosmos

NVIDIA Cosmos™ is a platform purpose-built for physical AI, featuring state-of-the-art generative world foundation models (WFMs), robust guardrails, and an accelerated data processing and curation pipeline. Designed specifically for real-world systems, Cosmos enables developers to rapidly advance physical AI applications such as autonomous vehicles (AVs), robots, and video analytics AI agents.

Cosmos World Foundation Models come in three model types which can all be customized in post-training: cosmos-predict, cosmos-transfer, and cosmos-reason:

Predict Transfer Reason
Type World Generation Multi-Controlnet Reasoning VLM
Function Predict novel future frames given initial frames Transfer existing control frames into photoreal frames within a video clip Reason against frames within a video clip
Use Cases Data Generation & Policy Evaluation Data Augmentation Data Curation
Inputs Text, Image, Video Multiple Video Modalities such as RGB, Depth, Segmentation, and more. Video & Text
Outputs Video Video Text

NVIDIA Cosmos Cookbook

The Cosmos Cookbook offers developers step-by-step recipes and post-training scripts to quickly build, customize, and deploy NVIDIA’s Cosmos world foundation models for robotics and autonomous systems.

Use Cases in Physical AI Development

Our world foundation models are purpose-built to accelerate improving performance in downstream model tasks in various stages, as illustrated here in the flywheel.

Cosmos Data Flywheel

Pinned Loading

  1. cosmos-reason1 cosmos-reason1 Public

    Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

    Python 834 74

  2. cosmos-predict2.5 cosmos-predict2.5 Public

    Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

    Python 486 38

  3. cosmos-transfer2.5 cosmos-transfer2.5 Public

    Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control inputs.

    Python 256 33

  4. cosmos-cookbook cosmos-cookbook Public

    Post-training scripts and samples for NVIDIA Cosmos ecosystem

    Python 113 27

Repositories

Showing 10 of 12 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…