Chatgpt's New Image Generator Shattered My Expectations

Серверы с серверами

chatgpt-image-dog-in-a-suit — Prompt: Can you generate a realistic colorful image of dog wearing a suit on the street in 16:9 ratio

Screenshot by Sabrina Ortiz/

OpenAI may have kicked off the text-to-image generation craze with its DALL-E model, but since those earlier glory days, the AI company's offering has been lapped by much more capable image models. As a result, when OpenAI released its latest and greatest GPT-4o image generation model, I was skeptical. After testing it, I have changed my mind entirely.

Getting started

When DALL-E first launched, it lived on its standalone website; since then, it has moved to ChatGPT. The move came with many benefits, including the ability to ask the AI chatbot for an image you want in the same interface where you are already chatting about something else, thereby eliminating the need for constant context switching.

With the release of GPT-4o image generation, OpenAI kept this convenient format, switching the default image generator from DALL-E to GPT-4o for paid subscribers. As a result, it was super easy to start creating new images from my ChatGPT Plus account. All I had to do was enter the prompt for what I wanted to see, and then it generated them. Users can also access it from the Sora interface.

Also: How to use OpenAI's Sora to create stunning AI-generated videos

You can also generate images if you are a free user. At launch, the model was announced to be coming to all users, including free ones, but then OpenAI CEO Sam Altman announced a day later that the rollout to the free tier would now be "delayed for awhile," only to make it available to free users again a week later.

However, if you are unimpressed when you try it in the free version, it is because the only method that activates the use of GPT-4o is typing in the shortcut "/create image." If you simply type a request such as "Create an image of XYZ," it will default to the DALL-E model, which renders significantly lower-quality photos. OpenAI does not explicitly state limits, but after generating three images from my free account, I hit my daily limit. Therefore, ChatGPT Plus is still a good option for higher access to image generation.

The images

The moment you have been waiting for -- the images. After you insert a prompt, the AI outputs the generation in under a minute. The process does take a bit longer than it used to, but the images are worth the wait, delivering lots of details, texture, realism, and even text accuracy. Instead of describing it, I will include examples below so you can see for yourself.

Prompt: Can you generate a realistic image of a chameleon, up close, shot as if it were in National Geographic in 16:9 ratio?

chatgpt-image-lizard — Sabrina Ortiz/ via ChatGPT

Prompt: Can you generate an image of a laptop open on a desk that says, "This model is so good that it can even get text and hands right, which are usually major challenges for AI models," with hands typing on a keyboard in 16:9 ratio?

chatgpt-laptop-with-hands — Sabrina Ortiz/ via ChatGPT

Prompt: Can you generate a realistic photo of a close-up of a woman in a crowd in Times Square looking at the camera and smiling, with the quality of one taken on a DSLR?

chatgpt-woman smiling — Sabrina Ortiz/ via ChatGPT

As seen above, the image generator does a great job of adhering to the prompt and delivering high-quality, realistic images. However, when testing an AI model, one of the true performance metrics is how it compares to competitors on the market. To give you a good indicator of this, I made it generate the same prompt I tested across all of the major AI image generators, including Midjourney, Google's Imagen 3, Adobe Firefly, and more.

I am attaching GPT-4o's rendition below. You can see how it fares against all of the other AI image generators in this article, including DALL-E's rendition, which clearly is far behind what the new model can do.

Prompt: Can you generate an image of a vibrant, realistic hummingbird perched on a tree?

chatgpt-image-hummingbird — Sabrina Ortiz/ via ChatGPT

Other notable features

Even though the quality of the images is perhaps one of the model's biggest wins, there are other benefits as well. One of the biggest is that it lives in the chatbot's interface, which makes it easy to tweak the generations with simple natural language prompts. Also, because the chatbot has the context of what you just asked it, it can consider that in building the image.

For example, if you are chatting with it about throwing a birthday party, you may be able to say, "Can you now create an invite that has the information above on it?" instead of having to retype. For example, I started chatting with ChatGPT about throwing a housewarming, and when asking it to create an invite, I did not have to repeat the information I previously provided.

Housewarming Party Invite- ChatGPT — Screenshot by Sabrina Ortiz/

You can also upload reference images and then ask ChatGPT to create a different version or use them as elements of a new one. For example, you can input it as a selfie and have it generated in anime style, as seen in Altman's new X post.

changed my pfp but maybe someone will make me a better one

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

Серверы с серверами

Новости по теме

Huawei S5735-L24T4S-A: High-Performance Stacking Meets Zero-Noise Deployment

S5735-L24P4XE-A-V2: Huawei’s Smart Choice for High-Density Campus Deployments

S5735-L24P4X-A1: Huawei’s High-Performance Access Switch Redefining Campus Networking

Huawei S5735-L24P4S-A1 Review: Reliable Gigabit Access with Enterprise-Grade Features

What Is an Orthogonal Architecture?

Huawei s5735-l24p4s-a-v2 Delivers Scalable, Secure, and Smart PoE Access for Modern IT Infrastructures

Huawei S5735-L48T4XE-A-V2 Switch Delivers Enterprise-Grade Performance in a Compact Design

Huawei S5735-L48P4XE-A-V2 Review: Versatile Campus Switch with iStack and Full L3 Support

Differences Between Huawei CE Series and S Series Switches

Huawei CloudEngine S5735 Switches Set the Benchmark for High-Performance, Energy-Efficient Switching

Huawei CloudEngine S5731‑S48P4X Datasheet

Huawei CloudEngine S5731‑S24P4X Datasheet

Huawei S5731-S Empowers Next-Generation Campus Networks with Advanced Capabilities

Huawei S5731-H24P4XC Switch Review: Power-Packed Performance and Smart PoE

Huawei S5731-H Series Switches Redefine Campus Networking with Intelligent High-Performance Architecture

Top Features of the Huawei S5731-S24T4X: The Ultimate Gigabit Access Switch for Modern Networks

General Power Module Fault Location Procedure (CE8800 & 7800 & 6800 & 5800)

How Do I Split a Stack? How to clear the stacking configuration?

Huawei CloudEngine S5731 Datasheet

Huawei CloudEngine S5731-S24P4X: Powerful Enterprise-Grade Switch Explained

Huawei S5731-S48T4X Review: Powerful Enterprise Switch for High-Speed Networking

Why are network cables limited to 100 meters?

Huawei S5731-S32ST4X: Powerful, Enterprise-Ready Gigabit Switch with Advanced Capabilities

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

ChatGPT's new image generator shattered my expectations - and now it's free to try

Getting started

The images

Other notable features

Горячие метки: 3. Инновации

Ordering Guide

Ресурсы по программам

О нас

Huawei CloudEngine S5731‑S48P4X Datasheet