NVIDIA Cosmos for Developers

NVIDIA Cosmosâ„¢ is a platform of state-of-the-art generative world foundation models (WFMs), advanced tokenizers, guardrails, and an accelerated data processing and curation pipeline for autonomous vehicles (AVs) and robotics developers.

Build, evaluate, deploy, and simulate physical AI models faster while minimizing testing and validation risks in the real world.

Explore Models Documentation

NVIDIA Cosmos World Foundation Models

A family of pretrained models for world generation as videos for accelerating physical AI development. Available openly to developers on NGC, Hugging Face, and GitHub.

Cosmos Predict-2

Our best-performing world foundation model yetâ€”higher fidelity, flexible frame rates and resolutions, fewer hallucinations, and better text, object, and motion control in the video.

Generate previews from text in under 4s and up to 30s of future world video from reference image, or preview.

Download Cosmos Predict-2 Models

Cosmos Transfer

For controllable and photorealistic synthetic data at scale.

Input: Segmentation maps, depth signals, lidar scans, key points, trajectories, HD maps, and ground-truth simulations from NVIDIA Omniverseâ„¢.

Output: Photorealistic world scenes, conditioned based on inputs, mirroring layout, object placement, and motion.

Download Cosmos Transfer-1 Models

Cosmos Reason

For physical AI reasoning.

Fully customizable, multimodal reasoning model trained using visual-language fine-tuning and reinforcement learning that uses a chain of thoughts to plan responses.

The model enables intelligent decision-making by reasoning and rewarding optimal responses.

Download Cosmos Reason Models

Cosmos Predict-1

For out-of-the box world generation and post-training.

A generalist model that generates world states from text or video prompts and synthesizes continuous motion by predicting frames between a given start and end frame.

These models range from 4 billion to 15 billion parameters and can be used based on inference requirements.

Download Cosmos Predict-1 Models

Cosmos Tokenizers

A suite of image and video tokenizers that advances the state of the art in visual tokenization for world model training.

Download Cosmos Tokenizer Models

Introductory Resources

Develop Custom Physical AI Foundation Models With NVIDIA Cosmos Predict-2

Cosmos Predict-2 is a suite of improved physical AI foundation models designed to generate realistic, physics-aware simulation data for training robots and AVs.

Read Tech Blog

End-to-End AV Development With New Cosmos WFMs

Cosmos Predict-2 and Cosmos Transfer, accelerate end-to-end AV development by enabling high-quality SDG and unlocking new data sources such as generating multi-view videos from single-view footage.

Read Tech Blog

Scale SDG With the NVIDIA NeMo Agent Toolkit

Agent toolkit is built using NVIDIA Omniverse, OpenUSD, Cosmos WFMs, and NVIDIA NIM microservices to automate and scale the generation of high-quality SDG, and accelerate the training and deployment of physical AI systems.

Read Tech Blog

Starter Kits

Start solving physical AI challenges by developing custom world models with Cosmos or using Cosmos WFMs for downstream use cases. Explore implementation scripts, explainer blogs, and more how-to documentation for various stages of physical AI development.

Post-Training Cosmos WFMs

Cosmos WFMs are purpose-built for post-training. Use domain-specific datasets to build world models or post-train for different types of output, such as action generation for policy models.

Synthetic Data Generation

Build and deploy world models for infinite domain-specific synthetic data. Use NVIDIA Omniverse for physics-based conditioning.

Cosmos Learning Library

More Resources

GitHub Forums

Read Cosmos FAQ

Sign Up for the
Developer Newsletter

Ethical Considerations

NVIDIA believes Trustworthy AI is a shared responsibility, and we have established policies and practices to enable development for a wide array of AI applications. When downloading or using this model in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.

NVIDIA has collaborated with Google Deepmind to watermark generated videos from the NVIDIA API catalog.

For more detailed information on ethical considerations for this model, please see the System Card, Model Card++ Explainability, Bias, Safety & Security, and Privacy Subcards. Please report security vulnerabilities or NVIDIA AI concerns here.

Get Started With NVIDIA Cosmos Today

Try Now

NVIDIA Cosmos for Developers

How It Works

NVIDIA Cosmos World Foundation Models

Cosmos Predict-2

Cosmos Transfer

Cosmos Reason

Cosmos Predict-1

Cosmos Tokenizers

Cosmos WFM Post-Training Samples

Cosmos Guardrails

Cosmos Prompt Upsampler

Introductory Resources

Develop Custom Physical AI Foundation Models With NVIDIA Cosmos Predict-2

End-to-End AV Development With New Cosmos WFMs

Scale SDG With the NVIDIA NeMo Agent Toolkit

Starter Kits

Post-Training Cosmos WFMs

Synthetic Data Generation

Cosmos Learning Library

More Resources

GitHub Forums

Read Cosmos FAQ

Sign Up for the
Developer Newsletter

Ethical Considerations

NVIDIA Cosmos for Developers

How It Works

NVIDIA Cosmos World Foundation Models

Cosmos Predict-2

Cosmos Transfer

Cosmos Reason

Cosmos Predict-1

Cosmos Tokenizers

Cosmos WFM Post-Training Samples

Cosmos Guardrails

Cosmos Prompt Upsampler

Introductory Resources

Develop Custom Physical AI Foundation Models With NVIDIA Cosmos Predict-2

End-to-End AV Development With New Cosmos WFMs

Scale SDG With the NVIDIA NeMo Agent Toolkit

Starter Kits

Post-Training Cosmos WFMs

Synthetic Data Generation

Cosmos Learning Library

More Resources

GitHub Forums

Read Cosmos FAQ

Sign Up for the Developer Newsletter

Ethical Considerations

Sign Up for the
Developer Newsletter