Alibaba Cloud open-sources 4 video-gen AI models

Alibaba Cloud has expanded its support for the open-source community by making four AI models freely available for public download and exploration, fostering further development in video generation technology.

These models belong to the Wan2.1 series, including the 14B and 1.3B versions of Tongyi Wanxiang (Wan), which have collectively surpassed one million downloads across ModelScope and Hugging Face Hub.

Designed for generating high-quality images and videos from both text and image inputs, the released models include T2V-14B, T2V-1.3B, I2V-14B-720P, and I2V-14B-480P.

Introduced earlier this year, the Wan2.1 series is the first video generation model to support text effects in both Chinese and English. It enhances realism by accurately handling complex movements, improving pixel quality, adhering to physical principles, and refining instruction execution precision.

Having secured the top position on the VBench leaderboard, a benchmark suite for video generation models, Alibaba Cloud aims to lower entry barriers for businesses seeking cost-effective AI-driven visual content creation.

By offering both 14B and 1.3B variants, users can select the model that best aligns with their hardware capabilities and output requirements.