Google takes on OpenAI with flashy text-to-image generator

May 25, 2022

Google takes on OpenAI with flashy text-to-image generator

The AI imagery competition is getting personal. Google this week unveiled a new challenger to OpenAI’s vaunted DALLE-2 text-to-image generator — and took shots at its rival’s efforts. Both models convert text prompts into pictures. But Google’s researchers claim their system provides “unprecedented photorealism and deep language understanding.” Human raters preferred Imagen over DALLE-2 for both sample quality and image-text alignment. Credit: Saharia et al.The cringingly-named Imagen system uses a large pre-trained language model as a text encoder. A cascade of diffusion models then turn the user’s words into pictures. In tests, the Google team said Imagen “significantly outperformed” DALL-E 2. Imagen particularly…

This story continues at The Next Web

Or just read more coverage about: Google

from LatestTechyTalks

Search This Blog

Latest Techy Talks

Google takes on OpenAI with flashy text-to-image generator

Comments

Post a Comment

Popular Posts

The first flights of the US air-taxi program carried organs, not passengers

AI writing tools can subtly steer public opinion, Oxford study finds