Vietnam.vn - Nền tảng quảng bá Việt Nam

Experience Google's impressive AI image generation model

Gemini 2.0 Flash's photo editing capabilities allow users to edit photos faster and more efficiently than some free models.

Zing NewsZing News19/03/2025

Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 1

Google has just widely released the feature of generating original images from the Gemini 2.0 Flash language model. With this ability, the model can compose and edit existing images based on text input. Photo: Google .

Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 2

Unlike other imaging tools that combine diffusion models with large language models (LLM), Gemini 2.0 Flash is multimodal, able to process input and output in multiple formats (text, audio, images, etc.). In theory, this technique improves image quality, allowing the tool to understand context and continue editing within the same conversation. Photo: Google .

Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 3

To try out the tool, users and developers need to go to Google AI Studio, switch to Gemini 2.0 Flash (Image Generation) Experimental mode. In the Output format section, select Images and text . The interactive area with the tool is in the middle of the screen. Below are some of the main uses of Gemini 2.0 Flash.

Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 4

Create images and text at the same time. According to Google, Gemini 2.0 Flash supports creating text and images at the same time. For example, you can ask the model to tell a story and draw illustrations. In my experience, the speed of creating images and text is quite fast, averaging 5-10 seconds/image (depending on length and complexity).

Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 5

Edit photos in the same conversation . Thanks to its contextual understanding, Gemini 2.0 Flash supports feedback and photo editing. If you are not satisfied with the color, object or any detail in the photo, just enter a command and the tool will change it without affecting other elements.

Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 6

Edit an existing photo . Similarly, simply upload any photo and ask the tool to edit details in the photo, such as changing the color, adding objects, or adjusting the background. Users can give continuous feedback until the tool produces satisfactory results.

Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 7Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 8

Object separation . Gemini 2.0 Flash's object separation ability is quite good, but there are still weaknesses related to human hands. The tool understands Vietnamese meaning, allowing separation and replacement of backgrounds according to many different topics.

Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 9Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 10

Expand/change scene . In this case, the user can ask the tool to shrink the existing image, filling in the gap with a new scene based on the description. Since it is still in the testing phase, the tool sometimes crashes or does not create the desired image.

Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 11

Create images with a lot of text . According to Google, Gemini 2.0 Flash can create images with long text without spelling mistakes or strange words. This is one of the many weaknesses of other image generation models. However, experience shows that Vietnamese language text is still difficult to read in some places. The tool also cannot translate text in the text without specific suggestions.

Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 12Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 13

Add people to photos . The tool supports adding people to existing photos, with the correct appearance as described. Since it is currently in free trial release, each conversation thread has a limit of about 30,000 tokens. However, users do not have to worry because a question/answer only costs about 300-500 tokens, not too much if you only edit and create basic photos.

Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 14Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 15

Change shooting angle . Users can request to change different shooting angles of the same photo. Of course, the tool supports adjusting different details until creating a satisfactory photo.

Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 16

Knowledge mining . According to Google, Gemini 2.0 Flash is trained on a large amount of knowledge with reasoning capabilities. For example, you can ask the tool to create a recipe based on existing knowledge, then draw an illustration for easy understanding. Similar to other tools, Google notes that Gemini 2.0 Flash only has general knowledge, not too in-depth or absolutely accurate.

Google Gemini 2.0 Flash,  trai nghiem Google Gemini,  tri tue nhan tao,  cong cu AI Google anh 17

Controversial feature . After its widespread release, many people discovered that Gemini 2.0 Flash can remove watermarks from photos. This ability is not accepted by AI tools like GPT-4o. Since it is still experimental, Google is likely to fix this in the near future. Photo: @deedydas/X .

Source: https://znews.vn/gemini-20-flash-lam-duoc-gi-post1539018.html


Comment (0)

No data
No data

Heritage

Figure

Business

No videos available

News

Political System

Local

Product