Google Imagen Is The Latest AI That Can Create Images From Text

 Google Imagen: an artificial intelligence technology that turns text into images

Google has unveiled a new Al brand called Google Imagen. This AI technology from Google is the text-to-image generator.

Google Imagen Is The Latest AI That Can Create Images From Text

Although it is not available for use to the general public yet. But the images that you can generate via simple texts are absolutely amazing.

This technology is based on taking text-to-image text-input models such as "a cat on a skateboard" and producing a relevant image. It's something that's been done over the years but has recently improved in terms of quality and accessibility.

Imagine "a robot couple feeding with the Eiffel Tower in the background"? For us humans, it's very easy to visualize this in our heads. Of course, the most creative people among us can easily enliven these words in their artwork.

Now Google's AI model called Imagen is able to do something similar. In a new ad, Google demonstrated how Imagen, a text-to-image publishing model, is able to create images based on typed text

How does Google imagen work?

Firstly, Google Imagen uses different diffusion techniques, which basically start with a pure noise image. It slowly refines it bit by bit until the model thinks it can’t make it look any more like a cat on a skateboard than it already does.

This is an improvement over top-to-bottom generators that sometimes get it tremendously wrong on their first guess. The other element is improved language understanding through large language models. It’s done by using the transformer approach. A few other recent advances have led to convincing language models like GPT-3 and others.

The technical aspects work something like this:

Google Imagen starts by generating a small (64×64 pixels) image and then does two “super-resolution” passes on it to bring it up to 1024×1024. This isn’t like normal upscaling, though, as AI creates new details with the smaller image, using the original as a base.

The AI has an understanding of simpler objects and how do they look like. For instance, generating details in a cat’s eye is going to be an easy feat as the model has been trained to fill in tiny details.


turn any text to an image with google's latest AI tool 'imagen'

وضع القراءة :
حجم الخط
تباعد السطور