• Mail us
  • Book a Meeting
  • Call us
  • Chat with us

 

WAN 2.1

WAN 2.1

​Wan 2.1 is an advanced AI video generation model developed by Alibaba's Tongyi Lab, designed to create high quality videos from text descriptions and images. It supports various tasks, including text to video, image to video, video editing, text to image and video to audio generation. 

Key Features of Wan 2.1:

  • State of the Art Performance: Wan 2.1 consistently outperforms existing open source models and state of the art commercial solutions across multiple benchmarks. ​

  • Consumer Grade GPU Compatibility: The T2V-1.3B model requires only 8.19 GB of VRAM, making it compatible with most consumer-grade GPUs. It can generate a 5 second 480p video on an RTX 4090 in about 4 minutes without optimization techniques like quantization. ​

  • Visual Text Generation: Wan 2.1 is the first video model capable of generating both Chinese and English text within videos, enhancing its practical applications.

  • Powerful Video VAE: Wan-VAE delivers exceptional efficiency and performance, encoding and decoding 1080p videos of any length while preserving temporal information, making it an ideal foundation for video and image generation.

Wan 2.1 is open source, with code and model weights available on GitHub. It has been integrated into platforms like ComfyUI and Diffusers, facilitating broader accessibility and ease of use.

Related Item