Skip to content

Latest commit

 

History

History
30 lines (18 loc) · 1.3 KB

cogvideox_transformer3d.md

File metadata and controls

30 lines (18 loc) · 1.3 KB

CogVideoXTransformer3DModel

A Diffusion Transformer model for 3D data from CogVideoX was introduced in CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer by Tsinghua University & ZhipuAI.

The model can be loaded with the following code snippet.

fromdiffusersimportCogVideoXTransformer3DModeltransformer=CogVideoXTransformer3DModel.from_pretrained("THUDM/CogVideoX-2b", subfolder="transformer", torch_dtype=torch.float16).to("cuda")

CogVideoXTransformer3DModel

[[autodoc]] CogVideoXTransformer3DModel

Transformer2DModelOutput

[[autodoc]] models.modeling_outputs.Transformer2DModelOutput

close