Nvidia’s Cosmos-Transfer1 is a diffusion-based conditional world model for multimodal controllable world generation.