China unveils Sora challenger able to produce videos from text similar to OpenAI tool, though much shorter

by Ben Jiang at scmp.com

China has come up with its own text-to-video artificial-intelligence (AI) tool similar to OpenAI’s Sora, although the new model can only produce videos no longer than 16 seconds, compared with the US service’s 60 seconds.

Vidu, the country’s best hope so far in catching up with Sora, was launched over the weekend by start-up Shengshu Technology in a joint effort with the prestigious Beijing-based Tsinghua University.

The model is able to produce videos with 1080p resolution based on simple text prompts, the company said.

“Vidu is the latest achievement of self-reliant innovation, with breakthroughs in many areas,” said Zhu Jun, chief scientist at Shengshu who is also deputy dean at Tsinghua’s Institute for AI, announcing the model at the Zhongguancun Forum held in the Chinese capital, according to a report by Beijing News.

Vidu is “imaginative”, “can simulate the physical world” and “produce 16-second videos with consistent characters, scenes and timeline”, Zhu said, adding that the model is also able to comprehend “Chinese elements”.

 

During the model’s unveiling, Shengshu released several demo clips, including one featuring a panda playing the guitar while sitting on grass and another of a puppy swimming in a pool, both showing vivid details.

Vidu’s debut has raised hopes in the country, which is racing to catch up with leading global generative AI players, such as Microsoft-backed OpenAI.

Unlike OpenAI’s ChatGPT, which has inspired a raft of China-based competitors after launching in November 2022, previews of Sora videos released in February have not drawn a similar level of enthusiasm from Chinese Big Tech firms or start-ups.