Google Releases its Latest AI Media Production Models, Veo and Imagen 3

By Media Infotainment Team | Thursday, 16 May 2024

Google unveiled its newest AI media production engines today: Imagen 3, its most recent text-to-image framework, and Veo, which can create "high-quality" 1080p films. Although they don't seem all that groundbreaking, Google is using them to counter OpenAI's Sora video model and Dall-E 3, a tool that is almost universally associated with artificial intelligence (AI) imagery.

Veo can produce any kind of video you can imagine, according to Google, since it has "an advanced understanding of natural language and visual semantics". The duration of the AI-generated films is "beyond a minute." Veo is also able to comprehend visual and cinematic methods, such as the idea of a timelapse. In theory, though, that ought to be standard operating procedure for an AI video generating model.

Google has also teamed up with Donald Glover and Gilga, his creative agency, to showcase the model's potential in an effort to demonstrate that Veo isn't trying to take away creative professions. Glover and the team use text to produce a short advertising film that shows a yacht sailing across the ocean and a convertible arriving at a residence in Europe.Veo has enhanced its rendering of high-definition imagery and is capable of simulating real-world physics more accurately than its prior models, according to Google.

In the video, Glover declares, "Everyone should aspire to be a director, and everybody will become one," fully deserving of his Google salary. "Simply said, narrative is at the core of everything. We shall understand one another better the closer we may get to sharing our tales with one another."

Aside from the macabre curiosity of seeing a computer attempt to algorithmically imitate the work of human artists, it is unclear if anybody will truly want to watch AI-generated film. However, this doesn't stop Google or OpenAI from advertising these tools in the hopes that they would be beneficial—or at the very least, profitable. For some producers, Veo will be accessible via Google's VideoFX tool starting today. According to the business, Veo will also be added to YouTube Shorts and other products. Google can dominate TikTok in at least one area if Veo ends up becoming a feature of YouTube Shorts.

Regarding Imagen 3, Google is pledging as usual: It is said to be the "highest quality" text-to-image model offered by the firm, with less artifacts and a "incredible level of detail" for "photorealistic, lifelike images". Naturally, the true test will be how it responds to cues in comparison to Dall-E 3. Google claims that Imagen 3 is more intelligent at handling details from lengthy prompts and that it handles text better than Imagen 2.

In order to try out its Music AI Sandbox, a collection of tools that can aid in song and beat development, Google is also collaborating with recording artists like Wyclef Jean and Bjorn. Though we just had a quick peek, this has resulted in some fascinating demos:

Sunlight comes and goes. We are all progressively becoming insane. And AI is becoming more intelligent every day. It appears to be the main lesson to be learned from Google's most recent media production tools. Naturally, they're improving! In an attempt to control the next major advancement in computing, Google is investing billions to bring the idea of artificial intelligence to life. Will any of this genuinely improve our quality of life? Can they ever create work that truly resonates with them? Attend Google I/O each year until either AGI materializes or our society implodes.

Current Issue

🍪 Do you like Cookies?

We use cookies to ensure you get the best experience on our website. Read more...