OpenAI Introduces New Tool to Create Video From Text

ChatGPT
(Image credit: OpenAI)

OpenAI, the company behind ChatGPT, has introduced a new tool that uses generative AI to create videos from text.

According to OpenAI, Sora generates a video by starting with one that looks like static noise and gradually transforms it by removing the noise over many steps.

The tool is capable of generating entire videos all at once or extending generated videos to make them longer, said the company. By giving the model foresight of many frames at a time, it claims to have solved the issue of making sure a subject stays the same even when it goes out of view temporarily.

So far Sora is only available to a few researchers and video creators, however the company has showcased its capabilities on X, formerly known as Twitter.

According to a blog post from the company, Sora takes inspiration from large language models which acquire generalist capabilities by training on internet-scale data.

The success of the LLM paradigm is enabled in part by the use of tokens that unify diverse modalities of text—code, math and various natural languages, said the post.

Instead of using text tokens, Sora has visual patches, said OpenAI. “We find that patches are a highly-scalable and effective representation for training generative models on diverse types of videos and images,” it added.

The technology turns videos into patches by first compressing them into a lower-dimensional latent space, and subsequently decomposing the representation into spacetime patches, said the post.

OpenAI has “trained” a network that reduces the dimensionality of visual data. It takes raw video as input and outputs a “latent representation that is compressed both temporally and spatially”.

Sora is trained on and subsequently generates videos within this compressed latent space.

This article originally appeared on TV Tech sister brand TVBEurope

Jenny Priestley

Jenny has worked in the media throughout her career, joining TVBEurope as editor in 2017. She has also been an entertainment reporter, interviewing everyone from Kylie Minogue to Tom Hanks; as well as spending a number of years working in radio. She continues to appear on radio every week and occasionally pops up on TV.