Home AI Runway’s latest AI video generator brings giant cotton candy monsters to life

Runway’s latest AI video generator brings giant cotton candy monsters to life

June 19, 2024

New Gen-3 Alpha AI video generator can create detailed humans and surreal situations.

Enlarge / Screen capture of a Runway Gen-3 Alpha video generated with the prompt “A giant humanoid, made of fluffy blue cotton candy, stomping on the ground, and roaring to the sky, clear blue sky behind them.”

On Sunday, Runway announced a new AI video synthesis model called Gen-3 Alpha that’s still under development, but it appears to create video of similar quality to OpenAI’s Sora, which debuted earlier this year (and has also not yet been released). It can generate novel, high-definition video from text prompts that range from realistic humans to surrealistic monsters stomping the countryside.

AI-generated beer commercial contains joyful monstrosities, goes viral

Unlike Runway’s previous best model from June 2023, which could only create two-second-long clips, Gen-3 Alpha can reportedly create 10-second-long video segments of people, places, and things that have a consistency and coherency that easily surpasses Gen-2. If 10 seconds sounds short compared to Sora’s full minute of video, consider that the company is working with a shoestring budget of compute compared to more lavishly funded OpenAI—and actually has a history of shipping video generation capability to commercial users.

Gen-3 Alpha does not generate audio to accompany the video clips, and it’s highly likely that temporally coherent generations (those that keep a character consistent over time) are dependent on similar high-quality training material. But Runway’s improvement in visual fidelity over the past year is difficult to ignore.

AI video heats up

It’s been a busy couple of weeks for AI video synthesis in the AI research community, including the launch of the Chinese model Kling, created by Beijing-based Kuaishou Technology (sometimes called “Kwai”). Kling can generate two minutes of 1080p HD video at 30 frames per second with a level of detail and coherency that reportedly matches Sora.

Gen-3 Alpha prompt: “Subtle reflections of a woman on the window of a train moving at hyper-speed in a Japanese city.”

Not long after Kling debuted, people on social media began creating surreal AI videos using Luma AI’s Luma Dream Machine. These videos were novel and weird but generally lacked coherency; we tested out Dream Machine and were not impressed by anything we saw.Advertisement

Meanwhile, one of the original text-to-video pioneers, New York City-based Runway—founded in 2018—recently found itself the butt of memes that showed its Gen-2 tech falling out of favor compared to newer video synthesis models. That may have spurred the announcement of Gen-3 Alpha.

Gen-3 Alpha prompt: “An astronaut running through an alley in Rio de Janeiro.”

Generating realistic humans has always been tricky for video synthesis models, so Runway specifically shows off Gen-3 Alpha’s ability to create what its developers call “expressive” human characters with a range of actions, gestures, and emotions. However, the company’s provided examples weren’t particularly expressive—mostly people just slowly staring and blinking—but they do look realistic.

Provided human examples include generated videos of a woman on a train, an astronaut running through a street, a man with his face lit by the glow of a TV set, a woman driving a car, and a woman running, among others.

Gen-3 Alpha prompt: “A close-up shot of a young woman driving a car, looking thoughtful, blurred green forest visible through the rainy car window.”

The generated demo videos also include more surreal video synthesis examples, including a giant creature walking in a rundown city, a man made of rocks walking in a forest, and the giant cotton candy monster seen below, which is probably the best video on the entire page.

Gen-3 Alpha prompt: “A giant humanoid, made of fluffy blue cotton candy, stomping on the ground, and roaring to the sky, clear blue sky behind them.”

Gen-3 will power various Runway AI editing tools (one of the company’s most notable claims to fame), including Multi Motion Brush, Advanced Camera Controls, and Director Mode. It can create videos from text or image prompts.

Runway says that Gen-3 Alpha is the first in a series of models trained on a new infrastructure designed for large-scale multimodal training, taking a step toward the development of what it calls “General World Models,” which are hypothetical AI systems that build internal representations of environments and use them to simulate future events within those environments.

ARS VIDEO

How Scientists Respond to Science Deniers

A few limitations

While these demos look fun at first glance, it’s worth mentioning a few drawbacks of an announcement like this. Since Gen-3 is not yet public and we do not have access yet, we have not had the chance to evaluate it. That means that even if you take Runway’s stated claim (“All of the videos on this page were generated with Gen-3 Alpha with no modifications”) at face value, the videos were very likely cherry-picked as having especially optimal results.

A recap of AI video synthesis on Ars Technica

Since 2022, we’ve covered a number of AI video synthesis models. We’ve also missed a few notable projects, such as Phenaki (mentioned briefly in one piece), Runway’s Gen-1, Pika (mentioned in a roundup syndicated from FT), Luma Dream Machine, and Kling (both mentioned above). To provide a brief rundown of where the technology has been so far, here’s a list of related Ars Technica articles. This is as much for our benefit as it is for yours because it’s sometimes difficult to keep all of these AI video models straight.

9/9/2022 – Runway teases AI-powered text-to-video editing using written prompts
9/29/2022 – Meta announces Make-A-Video, which generates video from text [Make-A-Video]
10/5/2022 – Google’s newest AI generator creates HD video from text prompts [Imagen Video]
3/30/2023 – AI-generated video of Will Smith eating spaghetti astounds with terrible beauty [ModelScope]
4/17/2023 – Adobe teases generative AI video tools [Firefly Video]
5/2/2023 – AI-generated beer commercial contains joyful monstrosities, goes viral [Gen-2]
11/27/2023 – New “Stable Video Diffusion” AI model can animate any still image [Stable Video Diffusion]
12/15/2023 – These AI-generated news anchors are freaking me out [Channel 1]
1/24/2024 – Google’s latest AI video generator can render cute animals in implausible situations [Lumiere]
2/16/2024 – OpenAI collapses media reality with Sora, a photorealistic AI video generator [Sora]
2/20/2024 – Will Smith parodies viral AI-generated video by actually eating spaghetti
2/23/2024 – Tyler Perry puts $800 million studio expansion on hold because of OpenAI’s Sora
5/15/2024 – Google unveils Veo, a high-definition AI video generator that may rival Sora [Veo]

Even a cursory look at the process from the earliest models above shows that AI video synthesis technology is steadily on the move, and the increased capability is likely only limited by available compute and enough high-quality training data. We’ll keep you posted.

Runway’s latest AI video generator brings giant cotton candy monsters to life

New Gen-3 Alpha AI video generator can create detailed humans and surreal situations.

AI video heats up

ARS VIDEO

A few limitations

FURTHER READING

A recap of AI video synthesis on Ars Technica

LEAVE A REPLY Cancel reply

How one YouTuber is trying to poison the AI bots stealing...

New Gen-3 Alpha AI video generator can create detailed humans and surreal situations.

AI video heats up

ARS VIDEO

A few limitations

FURTHER READING

A recap of AI video synthesis on Ars Technica

RELATED ARTICLESMORE FROM AUTHOR

How one YouTuber is trying to poison the AI bots stealing her content

The questions the Chinese government doesn’t want DeepSeek AI to answer

Couple allegedly tricked AI investors into funding wedding, houses

LEAVE A REPLY Cancel reply

How one YouTuber is trying to poison the AI bots stealing...

RELATED ARTICLES MORE FROM AUTHOR