GPT Image 2: High vs Medium vs Low Quality — Is It Worth Paying 15× More?

By VibeDex ResearchOriginally published: April 24, 2026Updated: 25 April 2026

TL;DR

GPT Image 2's three quality tiers separate cleanly when judged blind. Mean weighted scores across 29 complex prompts (only those where all three tiers successfully generated), judged in three independent blind passes: high 3.54, medium 3.36, low 3.17 — a 0.37-point spread (low → high). Per-prompt, high tier wins 76% of head-to-head comparisons (22 of 29), beating medium on 83% of prompts and low on 90%. Tier ordering is monotonic — paying more genuinely does buy more quality, just not 15× more. The 15× price premium from low to high translates to a 12% mean-score lift; cost-effectiveness depends on whether you care about peak single-render quality or batch-average quality.[1]

Why We Ran This Test

OpenAI's GPT Image 2 (internally gpt-image-2, accessed via OpenAI's API or third-party providers like Runware) exposes three quality tiers: low, medium, and high. The price difference is extreme — 15× between the cheapest and most expensive tier at 1024×1024 resolution. OpenAI's official guidance suggests using high quality for "dense layouts or heavy in-image text"[1] and low for latency-sensitive use cases.

There is no published rigorous comparison. The Artificial Analysis leaderboard only benchmarks the high tier of GPT Image 2[3], implicitly assuming it is the best version. We decided to test directly: generate the same prompt at all three tiers and compare side-by-side.

The honest answer: tiers separate cleanly — when judged blind by an independent reviewer with no knowledge of which tier produced which image, high tier wins three quarters of the time. The quality difference is real, but the question worth asking is whether 76% per-prompt win rate justifies a 15× price difference. For most workflows, no — medium tier captures most of the quality at a fraction of the cost.

Headline Numbers

We ran the same 29 complex prompts (average length ~750 characters — the hardest prompts in our 200-prompt benchmark) at all three quality tiers of GPT Image 2, then judged every image in three independent blind passes using Claude Opus 4.7. Each tier's score below is the mean of three independent judgments per image.

#ModelMean ScoreCost/ImageTier
1GPT Image 2 (High)3.54$0.212Premium
2GPT Image 2 (Medium)3.36$0.055Standard
3GPT Image 2 (Low)3.17$0.014Budget

Mean weighted score across 29 prompts, averaged over three independent blind judging passes per image. Spread between tiers: 0.37 (low → high).

The top-line result: paying 15× more for high tier over low tier buys you a 0.37-point score improvement — meaningful, but not proportional to the price gap. The bigger jump happens between low and medium (+0.19 points for 4× the cost); the high-tier premium adds another +0.18 points on top. On a cost-per-score basis, medium tier is the clear winner.

Methodology note: scores restricted to the 29 prompts where all three tiers generated successfully. Two prompts in the original 31 had at least one tier fail (high tier timed out on prompt-0147; low tier on prompt-0198) and are excluded from all comparisons in this article so each tier is judged on the same set.

Per-Prompt Winner Distribution

The aggregate means already separate cleanly; the per-prompt picture confirms it. Across 29 prompts (using a 0.05 weighted-score threshold for ties), here is how the three tiers split the wins after averaging three independent blind passes:

22
High wins
76%
4
Medium wins
14%
3
Low wins
10%
0
True ties (spread < 0.05)
0%

High tier wins 76% of prompts — 7× more often than low tier (10%). The 4 medium-tier wins and 3 low-tier wins are scattered across categories with no systematic pattern; treat them as cases where the lower tier happened to nail a specific composition rather than a strength of that tier.

The earlier hypothesis that "high tier produces a better-best but not a better-average" doesn't survive blind triangulation. Under three independent blind passes, high tier produces both — a better mean (3.54 vs 3.17 for low) AND a higher win rate (76% vs 10%). The original tie at the aggregate was an artefact of the judge seeing all three tiers side-by-side and anchoring scores together.

Side-by-Side: All 27 Prompts Across Three Tiers

Every prompt we ran (with 2 of the original 29 omitted from this gallery — see note at the end of this section), grouped by which tier won. Click any image to open the full-resolution version. Sorted within each group by how divergent the tiers were — biggest gaps first, so the most decisive cases surface at the top of each section. Hover a tile for a one-sentence reason behind the score.

Note: rationale text shown in each tile is excerpted from one of the three Opus 4.7 blind passes; passes occasionally disagreed and we picked one reading per tile. 2 prompts (a left-hook boxer and an airport-arrivals scene) have been removed from this gallery because the consolidated rationale text contained errors we couldn't cleanly fix without re-running the judge. Aggregate scores and win-rates above still reflect the full 29-prompt set.

High tier wins (20)

Prompts where the premium tier earned its price by delivering specifically-requested features — fine optical physics, precise biomechanics, complex scene logic.

prompt-0181 · spread 0.50 · High wins

Ultra high-resolution commercial photograph of a diamond engagement ring on a reflective black glass surface, the round brilliant cut diamond showing...

Low ($0.014) - Ultra high-resolution commercial photograph of a diamond engagement ring on a reflective black glass surface, the round brilliant cut diamond showing precise facet geometry with fire — spectral light dispersion splitting white light into rainbow flashes along the crown facets, the platinum band's mirror finish reflecting a clean studio environment without distortion, micro-detail visible in the prong setting showing each individual claw's contact point with the diamond girdle, the gallery beneath the diamond revealing the underside facets through the openwork, no visible dust particles or fingerprints on any surface, pin-sharp focus across the entire ring achieved through helicon focus stacking of thirty-two exposures, the reflection on the glass surface equally detailed showing the ring's underside, Schneider 120mm macro on a medium format Phase One IQ4 150MP back, Broncolor Scoro flash heads with focusing tubes for precise specular highlight placement
?

Low ($0.014)

2.97

Medium ($0.055) - Ultra high-resolution commercial photograph of a diamond engagement ring on a reflective black glass surface, the round brilliant cut diamond showing precise facet geometry with fire — spectral light dispersion splitting white light into rainbow flashes along the crown facets, the platinum band's mirror finish reflecting a clean studio environment without distortion, micro-detail visible in the prong setting showing each individual claw's contact point with the diamond girdle, the gallery beneath the diamond revealing the underside facets through the openwork, no visible dust particles or fingerprints on any surface, pin-sharp focus across the entire ring achieved through helicon focus stacking of thirty-two exposures, the reflection on the glass surface equally detailed showing the ring's underside, Schneider 120mm macro on a medium format Phase One IQ4 150MP back, Broncolor Scoro flash heads with focusing tubes for precise specular highlight placement
?

Medium ($0.055)

3.40

High ($0.212) - Ultra high-resolution commercial photograph of a diamond engagement ring on a reflective black glass surface, the round brilliant cut diamond showing precise facet geometry with fire — spectral light dispersion splitting white light into rainbow flashes along the crown facets, the platinum band's mirror finish reflecting a clean studio environment without distortion, micro-detail visible in the prong setting showing each individual claw's contact point with the diamond girdle, the gallery beneath the diamond revealing the underside facets through the openwork, no visible dust particles or fingerprints on any surface, pin-sharp focus across the entire ring achieved through helicon focus stacking of thirty-two exposures, the reflection on the glass surface equally detailed showing the ring's underside, Schneider 120mm macro on a medium format Phase One IQ4 150MP back, Broncolor Scoro flash heads with focusing tubes for precise specular highlight placement
?

High ($0.212)

3.47

prompt-0128 · spread 0.95 · High wins

Cinematic establishing shot of a World War II era airfield, a B-17 Flying Fortress parked on the tarmac with all four Wright Cyclone radial engines...

Low ($0.014) - Cinematic establishing shot of a World War II era airfield, a B-17 Flying Fortress parked on the tarmac with all four Wright Cyclone radial engines visible showing the correct nine-cylinder configuration and exhaust collector rings, the distinctive chin turret beneath the nose with twin fifty-caliber barrels, ball turret retracted under the belly, the olive drab paint scheme with nose art of a pin-up girl and mission tally marks below the cockpit window, ground crew loading ammunition belts through the waist gunner hatches, fuel truck connected via a hose to the wing tank, all mechanical details at historically accurate scale, dramatic storm clouds building on the horizon, shot in the style of a Christopher Nolan period film with IMAX-quality clarity and muted color palette
?

Low ($0.014)

2.48

Medium ($0.055) - Cinematic establishing shot of a World War II era airfield, a B-17 Flying Fortress parked on the tarmac with all four Wright Cyclone radial engines visible showing the correct nine-cylinder configuration and exhaust collector rings, the distinctive chin turret beneath the nose with twin fifty-caliber barrels, ball turret retracted under the belly, the olive drab paint scheme with nose art of a pin-up girl and mission tally marks below the cockpit window, ground crew loading ammunition belts through the waist gunner hatches, fuel truck connected via a hose to the wing tank, all mechanical details at historically accurate scale, dramatic storm clouds building on the horizon, shot in the style of a Christopher Nolan period film with IMAX-quality clarity and muted color palette
?

Medium ($0.055)

3.08

High ($0.212) - Cinematic establishing shot of a World War II era airfield, a B-17 Flying Fortress parked on the tarmac with all four Wright Cyclone radial engines visible showing the correct nine-cylinder configuration and exhaust collector rings, the distinctive chin turret beneath the nose with twin fifty-caliber barrels, ball turret retracted under the belly, the olive drab paint scheme with nose art of a pin-up girl and mission tally marks below the cockpit window, ground crew loading ammunition belts through the waist gunner hatches, fuel truck connected via a hose to the wing tank, all mechanical details at historically accurate scale, dramatic storm clouds building on the horizon, shot in the style of a Christopher Nolan period film with IMAX-quality clarity and muted color palette
?

High ($0.212)

3.43

prompt-0125 · spread 0.76 · High wins

Digital art of a fantasy blacksmith's forge interior, a massive bellows with correct pleated leather construction and wooden handles, an anvil with...

Low ($0.014) - Digital art of a fantasy blacksmith's forge interior, a massive bellows with correct pleated leather construction and wooden handles, an anvil with proper horn and hardy hole mounted on an oak stump, dozens of specialized tools hanging on a pegboard wall each with distinct and accurate shapes — ball peen hammer, cross peen, tongs, swages, fullers — a quenching barrel with iron hoops, glowing ingots on a coal bed
?

Low ($0.014)

3.08

Medium ($0.055) - Digital art of a fantasy blacksmith's forge interior, a massive bellows with correct pleated leather construction and wooden handles, an anvil with proper horn and hardy hole mounted on an oak stump, dozens of specialized tools hanging on a pegboard wall each with distinct and accurate shapes — ball peen hammer, cross peen, tongs, swages, fullers — a quenching barrel with iron hoops, glowing ingots on a coal bed
?

Medium ($0.055)

3.44

High ($0.212) - Digital art of a fantasy blacksmith's forge interior, a massive bellows with correct pleated leather construction and wooden handles, an anvil with proper horn and hardy hole mounted on an oak stump, dozens of specialized tools hanging on a pegboard wall each with distinct and accurate shapes — ball peen hammer, cross peen, tongs, swages, fullers — a quenching barrel with iron hoops, glowing ingots on a coal bed
?

High ($0.212)

3.83

prompt-0156 · spread 0.52 · High wins

Fashion editorial shot using a tilt-shift lens to create a selective focus plane across the model's eyes and accessories while the rest falls into...

Low ($0.014) - Fashion editorial shot using a tilt-shift lens to create a selective focus plane across the model's eyes and accessories while the rest falls into creamy blur, the model standing in the center of a long symmetrical corridor in a grand palace, the corridor's repeating arches creating perfect one-point perspective receding to a vanishing point directly behind the model's head, wearing a structured avant-garde outfit with geometric patterns that echo the architectural lines, the tilt-shift effect making the sharp focus band cut diagonally across the frame from the model's left eye to the right hand holding a mirrored clutch that reflects the corridor, Canon TS-E 90mm f/2.8 with full tilt applied, natural light from side windows creating alternating bands of light and shadow across the corridor floor, fashion photography in the style of Tim Walker's architectural location work
?

Low ($0.014)

3.17

Medium ($0.055) - Fashion editorial shot using a tilt-shift lens to create a selective focus plane across the model's eyes and accessories while the rest falls into creamy blur, the model standing in the center of a long symmetrical corridor in a grand palace, the corridor's repeating arches creating perfect one-point perspective receding to a vanishing point directly behind the model's head, wearing a structured avant-garde outfit with geometric patterns that echo the architectural lines, the tilt-shift effect making the sharp focus band cut diagonally across the frame from the model's left eye to the right hand holding a mirrored clutch that reflects the corridor, Canon TS-E 90mm f/2.8 with full tilt applied, natural light from side windows creating alternating bands of light and shadow across the corridor floor, fashion photography in the style of Tim Walker's architectural location work
?

Medium ($0.055)

3.33

High ($0.212) - Fashion editorial shot using a tilt-shift lens to create a selective focus plane across the model's eyes and accessories while the rest falls into creamy blur, the model standing in the center of a long symmetrical corridor in a grand palace, the corridor's repeating arches creating perfect one-point perspective receding to a vanishing point directly behind the model's head, wearing a structured avant-garde outfit with geometric patterns that echo the architectural lines, the tilt-shift effect making the sharp focus band cut diagonally across the frame from the model's left eye to the right hand holding a mirrored clutch that reflects the corridor, Canon TS-E 90mm f/2.8 with full tilt applied, natural light from side windows creating alternating bands of light and shadow across the corridor floor, fashion photography in the style of Tim Walker's architectural location work
?

High ($0.212)

3.68

prompt-0186 · spread 0.99 · High wins

Studio portrait demonstrating exceptional optical quality, a model's face in three-quarter view lit by a single large parabolic reflector creating a...

Low ($0.014) - Studio portrait demonstrating exceptional optical quality, a model's face in three-quarter view lit by a single large parabolic reflector creating a broad, wrapping light with a clean specular transition, shot on a Hasselblad X2D 100C at ISO 64 — the resulting image showing individual eyelashes in tack-sharp focus with the iris revealing radial fibrous structure in the stroma, the catchlight in each eye showing a perfect circular specular from the parabolic modifier, skin texture rendered with forensic detail showing pore structure, fine facial hair, and the natural micro-topography of the skin surface without any retouching, out-of-focus areas demonstrating the medium format's characteristically smooth bokeh with no onion ring artifacts, chromatic aberration completely absent even at the extreme corners of the frame, the tonal range from the specular highlight on the nose bridge to the deepest shadow under the jaw showing a smooth unbroken gradient with no banding
?

Low ($0.014)

2.98

Medium ($0.055) - Studio portrait demonstrating exceptional optical quality, a model's face in three-quarter view lit by a single large parabolic reflector creating a broad, wrapping light with a clean specular transition, shot on a Hasselblad X2D 100C at ISO 64 — the resulting image showing individual eyelashes in tack-sharp focus with the iris revealing radial fibrous structure in the stroma, the catchlight in each eye showing a perfect circular specular from the parabolic modifier, skin texture rendered with forensic detail showing pore structure, fine facial hair, and the natural micro-topography of the skin surface without any retouching, out-of-focus areas demonstrating the medium format's characteristically smooth bokeh with no onion ring artifacts, chromatic aberration completely absent even at the extreme corners of the frame, the tonal range from the specular highlight on the nose bridge to the deepest shadow under the jaw showing a smooth unbroken gradient with no banding
?

Medium ($0.055)

3.78

High ($0.212) - Studio portrait demonstrating exceptional optical quality, a model's face in three-quarter view lit by a single large parabolic reflector creating a broad, wrapping light with a clean specular transition, shot on a Hasselblad X2D 100C at ISO 64 — the resulting image showing individual eyelashes in tack-sharp focus with the iris revealing radial fibrous structure in the stroma, the catchlight in each eye showing a perfect circular specular from the parabolic modifier, skin texture rendered with forensic detail showing pore structure, fine facial hair, and the natural micro-topography of the skin surface without any retouching, out-of-focus areas demonstrating the medium format's characteristically smooth bokeh with no onion ring artifacts, chromatic aberration completely absent even at the extreme corners of the frame, the tonal range from the specular highlight on the nose bridge to the deepest shadow under the jaw showing a smooth unbroken gradient with no banding
?

High ($0.212)

3.97

prompt-0109 · spread 0.12 · High wins

High fashion editorial photograph of a model emerging from a swimming pool at twilight, water cascading off a metallic gold lamé gown that clings to...

Low ($0.014) - High fashion editorial photograph of a model emerging from a swimming pool at twilight, water cascading off a metallic gold lamé gown that clings to the body when wet revealing fabric weight and drape behavior different from dry fabric, hair slicked back with water droplets catching the fading daylight as crystalline pinpoints, the pool water surface disturbed in concentric ripples radiating outward from the model's movement, wet footprints on the travertine pool deck showing the path of approach, underwater pool lights creating a turquoise glow that illuminates the model from below, shot on Nikon Z9 with 70-200mm f/2.8 at 135mm, two Profoto B10 Plus heads with colored gels — amber camera left and cyan camera right — creating a complementary split lighting scheme, art directed in the style of Tim Walker meets Helmut Newton
?

Low ($0.014)

3.10

Medium ($0.055) - High fashion editorial photograph of a model emerging from a swimming pool at twilight, water cascading off a metallic gold lamé gown that clings to the body when wet revealing fabric weight and drape behavior different from dry fabric, hair slicked back with water droplets catching the fading daylight as crystalline pinpoints, the pool water surface disturbed in concentric ripples radiating outward from the model's movement, wet footprints on the travertine pool deck showing the path of approach, underwater pool lights creating a turquoise glow that illuminates the model from below, shot on Nikon Z9 with 70-200mm f/2.8 at 135mm, two Profoto B10 Plus heads with colored gels — amber camera left and cyan camera right — creating a complementary split lighting scheme, art directed in the style of Tim Walker meets Helmut Newton
?

Medium ($0.055)

3.12

High ($0.212) - High fashion editorial photograph of a model emerging from a swimming pool at twilight, water cascading off a metallic gold lamé gown that clings to the body when wet revealing fabric weight and drape behavior different from dry fabric, hair slicked back with water droplets catching the fading daylight as crystalline pinpoints, the pool water surface disturbed in concentric ripples radiating outward from the model's movement, wet footprints on the travertine pool deck showing the path of approach, underwater pool lights creating a turquoise glow that illuminates the model from below, shot on Nikon Z9 with 70-200mm f/2.8 at 135mm, two Profoto B10 Plus heads with colored gels — amber camera left and cyan camera right — creating a complementary split lighting scheme, art directed in the style of Tim Walker meets Helmut Newton
?

High ($0.212)

3.22

prompt-0114 · spread 0.82 · High wins

Full-page illustration of a diverse group of five teenage superheroes standing in a V-formation on a city rooftop, each with distinct body types and...

Low ($0.014) - Full-page illustration of a diverse group of five teenage superheroes standing in a V-formation on a city rooftop, each with distinct body types and proportions: a tall lanky speedster with elongated limbs, a stocky brick-house powerhouse with broad shoulders and thick neck, a lithe acrobat with dancer's build, a heavyset tech genius in powered armor, and an average-build psychic with glowing eyes, each face showing unique ethnic features with correct proportional relationships between eyes nose and mouth, hands in various positions — pointing, fists clenched, holding devices — all with correct finger count and natural joint articulation, hair styles ranging from tight coils to straight to shaved, comic book illustration style with bold inks and cel shading, dramatic low-angle perspective with city skyline behind
?

Low ($0.014)

3.13

Medium ($0.055) - Full-page illustration of a diverse group of five teenage superheroes standing in a V-formation on a city rooftop, each with distinct body types and proportions: a tall lanky speedster with elongated limbs, a stocky brick-house powerhouse with broad shoulders and thick neck, a lithe acrobat with dancer's build, a heavyset tech genius in powered armor, and an average-build psychic with glowing eyes, each face showing unique ethnic features with correct proportional relationships between eyes nose and mouth, hands in various positions — pointing, fists clenched, holding devices — all with correct finger count and natural joint articulation, hair styles ranging from tight coils to straight to shaved, comic book illustration style with bold inks and cel shading, dramatic low-angle perspective with city skyline behind
?

Medium ($0.055)

3.33

High ($0.212) - Full-page illustration of a diverse group of five teenage superheroes standing in a V-formation on a city rooftop, each with distinct body types and proportions: a tall lanky speedster with elongated limbs, a stocky brick-house powerhouse with broad shoulders and thick neck, a lithe acrobat with dancer's build, a heavyset tech genius in powered armor, and an average-build psychic with glowing eyes, each face showing unique ethnic features with correct proportional relationships between eyes nose and mouth, hands in various positions — pointing, fists clenched, holding devices — all with correct finger count and natural joint articulation, hair styles ranging from tight coils to straight to shaved, comic book illustration style with bold inks and cel shading, dramatic low-angle perspective with city skyline behind
?

High ($0.212)

3.95

prompt-0131 · spread 0.39 · High wins

Cinematic wide shot of a busy 1920s speakeasy hidden behind a laundromat front, the camera positioned inside the secret bar looking toward the...

Low ($0.014) - Cinematic wide shot of a busy 1920s speakeasy hidden behind a laundromat front, the camera positioned inside the secret bar looking toward the entrance where a bouncer checks a patron's card through a sliding peephole in a steel-reinforced door, the bar interior fully coherent with a long mahogany counter where a bartender in vest and sleeve garters shakes a cocktail, shelves behind stocked with period-appropriate bottles, four jazz musicians on a small corner stage — upright bass, trumpet, piano, drums — all holding their instruments correctly, round tables with white tablecloths where couples in era-appropriate evening wear sit with cocktail coupes and ashtrays, a ceiling fan slowly rotating above, smoke hazing the room creating atmospheric depth, art deco wall sconces providing warm amber light
?

Low ($0.014)

2.78

Medium ($0.055) - Cinematic wide shot of a busy 1920s speakeasy hidden behind a laundromat front, the camera positioned inside the secret bar looking toward the entrance where a bouncer checks a patron's card through a sliding peephole in a steel-reinforced door, the bar interior fully coherent with a long mahogany counter where a bartender in vest and sleeve garters shakes a cocktail, shelves behind stocked with period-appropriate bottles, four jazz musicians on a small corner stage — upright bass, trumpet, piano, drums — all holding their instruments correctly, round tables with white tablecloths where couples in era-appropriate evening wear sit with cocktail coupes and ashtrays, a ceiling fan slowly rotating above, smoke hazing the room creating atmospheric depth, art deco wall sconces providing warm amber light
?

Medium ($0.055)

2.93

High ($0.212) - Cinematic wide shot of a busy 1920s speakeasy hidden behind a laundromat front, the camera positioned inside the secret bar looking toward the entrance where a bouncer checks a patron's card through a sliding peephole in a steel-reinforced door, the bar interior fully coherent with a long mahogany counter where a bartender in vest and sleeve garters shakes a cocktail, shelves behind stocked with period-appropriate bottles, four jazz musicians on a small corner stage — upright bass, trumpet, piano, drums — all holding their instruments correctly, round tables with white tablecloths where couples in era-appropriate evening wear sit with cocktail coupes and ashtrays, a ceiling fan slowly rotating above, smoke hazing the room creating atmospheric depth, art deco wall sconces providing warm amber light
?

High ($0.212)

3.17

prompt-0095 · spread 0.64 · High wins

Fantasy marketplace built into the branches of an enormous ancient tree, wooden platforms and rope bridges connecting merchant stalls at various...

Low ($0.014) - Fantasy marketplace built into the branches of an enormous ancient tree, wooden platforms and rope bridges connecting merchant stalls at various heights, each platform showing structural supports anchored properly into the trunk and major branches, hanging lanterns and counterweighted pulley systems for lifting goods, painted in a lush painterly style reminiscent of Studio Ghibli background art
?

Low ($0.014)

3.41

Medium ($0.055) - Fantasy marketplace built into the branches of an enormous ancient tree, wooden platforms and rope bridges connecting merchant stalls at various heights, each platform showing structural supports anchored properly into the trunk and major branches, hanging lanterns and counterweighted pulley systems for lifting goods, painted in a lush painterly style reminiscent of Studio Ghibli background art
?

Medium ($0.055)

3.39

High ($0.212) - Fantasy marketplace built into the branches of an enormous ancient tree, wooden platforms and rope bridges connecting merchant stalls at various heights, each platform showing structural supports anchored properly into the trunk and major branches, hanging lanterns and counterweighted pulley systems for lifting goods, painted in a lush painterly style reminiscent of Studio Ghibli background art
?

High ($0.212)

4.03

prompt-0126 · spread 0.28 · High wins

Studio product photography of a professional espresso machine in brushed stainless steel, front-facing hero shot showing the group head with a...

Low ($0.014) - Studio product photography of a professional espresso machine in brushed stainless steel, front-facing hero shot showing the group head with a bottomless portafilter locked in at the correct angle, the portafilter basket visible underneath with fine perforations in a uniform pattern, steam wand on the right side with a multi-hole tip and rubber grip sleeve, pressure gauge showing the needle at nine bars on a correctly numbered dial face, cup warmer tray on top with two white ceramic espresso cups inverted, water reservoir visible through a tinted window on the side showing the water level, drip tray with removable grate, backlit brand logo centered above the group head, shot on a medium grey seamless background with a large overhead softbox and two strip lights for metallic edge definition, Fujifilm GFX 50S II with 110mm f/2 macro
?

Low ($0.014)

2.65

Medium ($0.055) - Studio product photography of a professional espresso machine in brushed stainless steel, front-facing hero shot showing the group head with a bottomless portafilter locked in at the correct angle, the portafilter basket visible underneath with fine perforations in a uniform pattern, steam wand on the right side with a multi-hole tip and rubber grip sleeve, pressure gauge showing the needle at nine bars on a correctly numbered dial face, cup warmer tray on top with two white ceramic espresso cups inverted, water reservoir visible through a tinted window on the side showing the water level, drip tray with removable grate, backlit brand logo centered above the group head, shot on a medium grey seamless background with a large overhead softbox and two strip lights for metallic edge definition, Fujifilm GFX 50S II with 110mm f/2 macro
?

Medium ($0.055)

2.85

High ($0.212) - Studio product photography of a professional espresso machine in brushed stainless steel, front-facing hero shot showing the group head with a bottomless portafilter locked in at the correct angle, the portafilter basket visible underneath with fine perforations in a uniform pattern, steam wand on the right side with a multi-hole tip and rubber grip sleeve, pressure gauge showing the needle at nine bars on a correctly numbered dial face, cup warmer tray on top with two white ceramic espresso cups inverted, water reservoir visible through a tinted window on the side showing the water level, drip tray with removable grate, backlit brand logo centered above the group head, shot on a medium grey seamless background with a large overhead softbox and two strip lights for metallic edge definition, Fujifilm GFX 50S II with 110mm f/2 macro
?

High ($0.212)

2.93

prompt-0098 · spread 0.45 · High wins

Whimsical illustration of a mouse family's treehouse home built inside a hollow oak, cross-section view showing multiple floors connected by tiny...

Low ($0.014) - Whimsical illustration of a mouse family's treehouse home built inside a hollow oak, cross-section view showing multiple floors connected by tiny staircases, each floor structurally supported by internal branch growth, miniature furniture at correct mouse scale, acorn cap bowls on a twig table, leaf curtains in the windows, children's book illustration style with warm watercolor tones
?

Low ($0.014)

3.75

Medium ($0.055) - Whimsical illustration of a mouse family's treehouse home built inside a hollow oak, cross-section view showing multiple floors connected by tiny staircases, each floor structurally supported by internal branch growth, miniature furniture at correct mouse scale, acorn cap bowls on a twig table, leaf curtains in the windows, children's book illustration style with warm watercolor tones
?

Medium ($0.055)

3.98

High ($0.212) - Whimsical illustration of a mouse family's treehouse home built inside a hollow oak, cross-section view showing multiple floors connected by tiny staircases, each floor structurally supported by internal branch growth, miniature furniture at correct mouse scale, acorn cap bowls on a twig table, leaf curtains in the windows, children's book illustration style with warm watercolor tones
?

High ($0.212)

4.20

prompt-0121 · spread 0.82 · High wins

Commercial product photograph of a luxury Swiss automatic watch on a polished obsidian surface, the dial showing correct hour marker placement at all...

Low ($0.014) - Commercial product photograph of a luxury Swiss automatic watch on a polished obsidian surface, the dial showing correct hour marker placement at all twelve positions with matching lume pip sizes, three sub-dials for chronograph functions with properly scaled subsidiary hands, date window at three o'clock magnified by cyclops lens showing the number 15 in correct font, crown and two pushers on the right side with knurled grip texture at accurate scale, exhibition caseback revealing the decorated movement with Geneva stripes on the rotor and blued steel screws in the bridge plates, bracelet links showing progressive size reduction from the case to the clasp, shot with focus stacking on a Fujifilm GFX 100S with 120mm GF Macro at f/5.6, Broncolor Siros strobe with strip softbox creating a clean specular highlight following the case contour
?

Low ($0.014)

2.33

Medium ($0.055) - Commercial product photograph of a luxury Swiss automatic watch on a polished obsidian surface, the dial showing correct hour marker placement at all twelve positions with matching lume pip sizes, three sub-dials for chronograph functions with properly scaled subsidiary hands, date window at three o'clock magnified by cyclops lens showing the number 15 in correct font, crown and two pushers on the right side with knurled grip texture at accurate scale, exhibition caseback revealing the decorated movement with Geneva stripes on the rotor and blued steel screws in the bridge plates, bracelet links showing progressive size reduction from the case to the clasp, shot with focus stacking on a Fujifilm GFX 100S with 120mm GF Macro at f/5.6, Broncolor Siros strobe with strip softbox creating a clean specular highlight following the case contour
?

Medium ($0.055)

2.90

High ($0.212) - Commercial product photograph of a luxury Swiss automatic watch on a polished obsidian surface, the dial showing correct hour marker placement at all twelve positions with matching lume pip sizes, three sub-dials for chronograph functions with properly scaled subsidiary hands, date window at three o'clock magnified by cyclops lens showing the number 15 in correct font, crown and two pushers on the right side with knurled grip texture at accurate scale, exhibition caseback revealing the decorated movement with Geneva stripes on the rotor and blued steel screws in the bridge plates, bracelet links showing progressive size reduction from the case to the clasp, shot with focus stacking on a Fujifilm GFX 100S with 120mm GF Macro at f/5.6, Broncolor Siros strobe with strip softbox creating a clean specular highlight following the case contour
?

High ($0.212)

3.15

prompt-0178 · spread 0.68 · High wins

High-fashion cinematic photograph of a model standing in an immense field of lavender in Provence at the moment the sun dips below the horizon, the...

Low ($0.014) - High-fashion cinematic photograph of a model standing in an immense field of lavender in Provence at the moment the sun dips below the horizon, the entire scene bathed in the last seconds of direct golden light that makes the lavender rows glow in alternating stripes of violet and amber shadow, the model wearing a diaphanous ivory organza gown that becomes translucent when backlit by the low sun revealing the silhouette beneath, the fabric billowing dramatically to the right caught by the Mistral wind, hair similarly wind-swept creating dynamic flowing shapes, the composition placing the model at the intersection of converging lavender rows creating strong one-point perspective, the sky above transitioning through a complete warm spectrum from gold at the horizon through salmon and rose to deep violet overhead, Hasselblad X2D with XCD 90mm f/2.5 creating medium format rendering with creamy dimensional separation, the overall aesthetic referencing Terrence Malick's natural light philosophy filtered through a high-fashion editorial sensibility
?

Low ($0.014)

3.05

Medium ($0.055) - High-fashion cinematic photograph of a model standing in an immense field of lavender in Provence at the moment the sun dips below the horizon, the entire scene bathed in the last seconds of direct golden light that makes the lavender rows glow in alternating stripes of violet and amber shadow, the model wearing a diaphanous ivory organza gown that becomes translucent when backlit by the low sun revealing the silhouette beneath, the fabric billowing dramatically to the right caught by the Mistral wind, hair similarly wind-swept creating dynamic flowing shapes, the composition placing the model at the intersection of converging lavender rows creating strong one-point perspective, the sky above transitioning through a complete warm spectrum from gold at the horizon through salmon and rose to deep violet overhead, Hasselblad X2D with XCD 90mm f/2.5 creating medium format rendering with creamy dimensional separation, the overall aesthetic referencing Terrence Malick's natural light philosophy filtered through a high-fashion editorial sensibility
?

Medium ($0.055)

3.48

High ($0.212) - High-fashion cinematic photograph of a model standing in an immense field of lavender in Provence at the moment the sun dips below the horizon, the entire scene bathed in the last seconds of direct golden light that makes the lavender rows glow in alternating stripes of violet and amber shadow, the model wearing a diaphanous ivory organza gown that becomes translucent when backlit by the low sun revealing the silhouette beneath, the fabric billowing dramatically to the right caught by the Mistral wind, hair similarly wind-swept creating dynamic flowing shapes, the composition placing the model at the intersection of converging lavender rows creating strong one-point perspective, the sky above transitioning through a complete warm spectrum from gold at the horizon through salmon and rose to deep violet overhead, Hasselblad X2D with XCD 90mm f/2.5 creating medium format rendering with creamy dimensional separation, the overall aesthetic referencing Terrence Malick's natural light philosophy filtered through a high-fashion editorial sensibility
?

High ($0.212)

3.73

prompt-0112 · spread 0.30 · High wins

Editorial fashion photograph of a model in a flowing crimson silk gown standing at the edge of an infinity pool overlooking Santorini at golden hour,...

Low ($0.014) - Editorial fashion photograph of a model in a flowing crimson silk gown standing at the edge of an infinity pool overlooking Santorini at golden hour, wind catching the fabric creating dynamic flowing shapes while the bodice remains structured and fitted showing correct tailoring darts, the model's pose shows natural weight distribution with one hip shifted creating an authentic contrapposto, visible collarbones and shoulder anatomy proportional to body frame, hands relaxed at sides with naturally curved fingers and visible fingernails, face showing subtle makeup with individually identifiable eyelashes, white-washed buildings with blue domes in the background at correct atmospheric perspective scale, Canon EOS R5 with 85mm f/1.2 lens, shallow depth of field isolating the subject, warm Mediterranean light casting long shadows
?

Low ($0.014)

3.20

Medium ($0.055) - Editorial fashion photograph of a model in a flowing crimson silk gown standing at the edge of an infinity pool overlooking Santorini at golden hour, wind catching the fabric creating dynamic flowing shapes while the bodice remains structured and fitted showing correct tailoring darts, the model's pose shows natural weight distribution with one hip shifted creating an authentic contrapposto, visible collarbones and shoulder anatomy proportional to body frame, hands relaxed at sides with naturally curved fingers and visible fingernails, face showing subtle makeup with individually identifiable eyelashes, white-washed buildings with blue domes in the background at correct atmospheric perspective scale, Canon EOS R5 with 85mm f/1.2 lens, shallow depth of field isolating the subject, warm Mediterranean light casting long shadows
?

Medium ($0.055)

3.37

High ($0.212) - Editorial fashion photograph of a model in a flowing crimson silk gown standing at the edge of an infinity pool overlooking Santorini at golden hour, wind catching the fabric creating dynamic flowing shapes while the bodice remains structured and fitted showing correct tailoring darts, the model's pose shows natural weight distribution with one hip shifted creating an authentic contrapposto, visible collarbones and shoulder anatomy proportional to body frame, hands relaxed at sides with naturally curved fingers and visible fingernails, face showing subtle makeup with individually identifiable eyelashes, white-washed buildings with blue domes in the background at correct atmospheric perspective scale, Canon EOS R5 with 85mm f/1.2 lens, shallow depth of field isolating the subject, warm Mediterranean light casting long shadows
?

High ($0.212)

3.50

prompt-0123 · spread 0.43 · High wins

Flat lay of a complete professional photographer's kit: a Canon EOS R5 body with visible mode dial markings, RF 24-70mm f/2.8 lens with correct filter...

Low ($0.014) - Flat lay of a complete professional photographer's kit: a Canon EOS R5 body with visible mode dial markings, RF 24-70mm f/2.8 lens with correct filter thread size and focus distance window, two CFexpress cards showing pin arrays, a battery with correct contact placement, a lens cleaning pen, a rocket blower, and a camera strap with embossed logo, all arranged in a Pelican case with custom foam cutouts
?

Low ($0.014)

3.10

Medium ($0.055) - Flat lay of a complete professional photographer's kit: a Canon EOS R5 body with visible mode dial markings, RF 24-70mm f/2.8 lens with correct filter thread size and focus distance window, two CFexpress cards showing pin arrays, a battery with correct contact placement, a lens cleaning pen, a rocket blower, and a camera strap with embossed logo, all arranged in a Pelican case with custom foam cutouts
?

Medium ($0.055)

3.27

High ($0.212) - Flat lay of a complete professional photographer's kit: a Canon EOS R5 body with visible mode dial markings, RF 24-70mm f/2.8 lens with correct filter thread size and focus distance window, two CFexpress cards showing pin arrays, a battery with correct contact placement, a lens cleaning pen, a rocket blower, and a camera strap with embossed logo, all arranged in a Pelican case with custom foam cutouts
?

High ($0.212)

3.53

prompt-0158 · spread 0.53 · High wins

Architectural visualization 3D render using a vertical cutaway section view of a five-story modern apartment building, slicing through the center to...

Low ($0.014) - Architectural visualization 3D render using a vertical cutaway section view of a five-story modern apartment building, slicing through the center to reveal all floors simultaneously like a dollhouse, each apartment showing a different resident's lifestyle — ground floor: a young couple's minimalist studio with open kitchen, second floor: a family apartment with children's toys scattered and a baby crib in the bedroom, third floor: an elderly person's traditional-decorated flat with heavy curtains and bookshelves, fourth floor: a work-from-home professional's loft with multiple monitors and a standing desk, penthouse: a luxury unit with double-height ceilings and a rooftop terrace, consistent structural elements running through all floors — load-bearing walls aligning vertically, plumbing stacks in the same position, stairwell on the right side, each floor at correct ceiling height with visible floor slabs, rendered in V-Ray with section line highlighted in red
?

Low ($0.014)

2.72

Medium ($0.055) - Architectural visualization 3D render using a vertical cutaway section view of a five-story modern apartment building, slicing through the center to reveal all floors simultaneously like a dollhouse, each apartment showing a different resident's lifestyle — ground floor: a young couple's minimalist studio with open kitchen, second floor: a family apartment with children's toys scattered and a baby crib in the bedroom, third floor: an elderly person's traditional-decorated flat with heavy curtains and bookshelves, fourth floor: a work-from-home professional's loft with multiple monitors and a standing desk, penthouse: a luxury unit with double-height ceilings and a rooftop terrace, consistent structural elements running through all floors — load-bearing walls aligning vertically, plumbing stacks in the same position, stairwell on the right side, each floor at correct ceiling height with visible floor slabs, rendered in V-Ray with section line highlighted in red
?

Medium ($0.055)

2.85

High ($0.212) - Architectural visualization 3D render using a vertical cutaway section view of a five-story modern apartment building, slicing through the center to reveal all floors simultaneously like a dollhouse, each apartment showing a different resident's lifestyle — ground floor: a young couple's minimalist studio with open kitchen, second floor: a family apartment with children's toys scattered and a baby crib in the bedroom, third floor: an elderly person's traditional-decorated flat with heavy curtains and bookshelves, fourth floor: a work-from-home professional's loft with multiple monitors and a standing desk, penthouse: a luxury unit with double-height ceilings and a rooftop terrace, consistent structural elements running through all floors — load-bearing walls aligning vertically, plumbing stacks in the same position, stairwell on the right side, each floor at correct ceiling height with visible floor slabs, rendered in V-Ray with section line highlighted in red
?

High ($0.212)

3.25

prompt-0174 · spread 0.25 · High wins

Baroque-inspired oil painting portrait of a contemporary Black woman posed in the style of Vermeer's Girl with a Pearl Earring, wearing a modern...

Low ($0.014) - Baroque-inspired oil painting portrait of a contemporary Black woman posed in the style of Vermeer's Girl with a Pearl Earring, wearing a modern interpretation of the turban in electric blue African wax print fabric, a single oversized gold hoop earring catching warm candlelight instead of a pearl, the characteristic Vermeer over-the-shoulder gaze and parted lips perfectly recreated, skin rendered with the luminous glazing technique of the Dutch masters showing warm undertones glowing through cooler surface tones, the background a smooth gradient from deep umber to warm ochre, dramatic chiaroscuro with a single key light source from the upper left creating a soft transition from highlight to shadow across the cheekbone, visible canvas texture and brushwork in the impasto highlights on the earring and fabric while shadows are built from transparent glazes, the painting style precisely mimicking seventeenth-century technique while the subject is unmistakably contemporary
?

Low ($0.014)

3.80

Medium ($0.055) - Baroque-inspired oil painting portrait of a contemporary Black woman posed in the style of Vermeer's Girl with a Pearl Earring, wearing a modern interpretation of the turban in electric blue African wax print fabric, a single oversized gold hoop earring catching warm candlelight instead of a pearl, the characteristic Vermeer over-the-shoulder gaze and parted lips perfectly recreated, skin rendered with the luminous glazing technique of the Dutch masters showing warm undertones glowing through cooler surface tones, the background a smooth gradient from deep umber to warm ochre, dramatic chiaroscuro with a single key light source from the upper left creating a soft transition from highlight to shadow across the cheekbone, visible canvas texture and brushwork in the impasto highlights on the earring and fabric while shadows are built from transparent glazes, the painting style precisely mimicking seventeenth-century technique while the subject is unmistakably contemporary
?

Medium ($0.055)

3.83

High ($0.212) - Baroque-inspired oil painting portrait of a contemporary Black woman posed in the style of Vermeer's Girl with a Pearl Earring, wearing a modern interpretation of the turban in electric blue African wax print fabric, a single oversized gold hoop earring catching warm candlelight instead of a pearl, the characteristic Vermeer over-the-shoulder gaze and parted lips perfectly recreated, skin rendered with the luminous glazing technique of the Dutch masters showing warm undertones glowing through cooler surface tones, the background a smooth gradient from deep umber to warm ochre, dramatic chiaroscuro with a single key light source from the upper left creating a soft transition from highlight to shadow across the cheekbone, visible canvas texture and brushwork in the impasto highlights on the earring and fabric while shadows are built from transparent glazes, the painting style precisely mimicking seventeenth-century technique while the subject is unmistakably contemporary
?

High ($0.212)

4.05

prompt-0135 · spread 0.25 · High wins

Anime scene of a high school cultural festival, a crowded hallway with students in costumes running booths — a takoyaki stand with a girl flipping...

Low ($0.014) - Anime scene of a high school cultural festival, a crowded hallway with students in costumes running booths — a takoyaki stand with a girl flipping octopus balls on a griddle, a haunted house entrance with spooky decorations, a maid café with students taking orders, hand-painted banners overhead with Japanese text, consistent perspective as the hallway recedes with students getting smaller proportionally
?

Low ($0.014)

3.58

Medium ($0.055) - Anime scene of a high school cultural festival, a crowded hallway with students in costumes running booths — a takoyaki stand with a girl flipping octopus balls on a griddle, a haunted house entrance with spooky decorations, a maid café with students taking orders, hand-painted banners overhead with Japanese text, consistent perspective as the hallway recedes with students getting smaller proportionally
?

Medium ($0.055)

3.76

High ($0.212) - Anime scene of a high school cultural festival, a crowded hallway with students in costumes running booths — a takoyaki stand with a girl flipping octopus balls on a griddle, a haunted house entrance with spooky decorations, a maid café with students taking orders, hand-painted banners overhead with Japanese text, consistent perspective as the hallway recedes with students getting smaller proportionally
?

High ($0.212)

3.82

prompt-0164 · spread 0.47 · High wins

Digital character art of a fantasy ranger standing in a forest clearing, the character must have the following specific attributes: female with medium...

Low ($0.014) - Digital character art of a fantasy ranger standing in a forest clearing, the character must have the following specific attributes: female with medium brown skin and long silver-white hair in a single braid over the right shoulder, wearing dark green leather armor with no shoulder pauldrons leaving the arms bare, a wooden longbow held in the left hand with the string undrawn, a quiver of arrows on the back containing exactly seven visible arrow shafts with white fletching, a brown leather belt with a single short dagger in a sheath on the left hip and no other weapons, no cape or cloak, barefoot standing on moss-covered ground, a small red fox sitting at her right side looking up at her, no other animals present, afternoon forest light dappling through the canopy above, the character should not have any facial markings tattoos or scars
?

Low ($0.014)

3.45

Medium ($0.055) - Digital character art of a fantasy ranger standing in a forest clearing, the character must have the following specific attributes: female with medium brown skin and long silver-white hair in a single braid over the right shoulder, wearing dark green leather armor with no shoulder pauldrons leaving the arms bare, a wooden longbow held in the left hand with the string undrawn, a quiver of arrows on the back containing exactly seven visible arrow shafts with white fletching, a brown leather belt with a single short dagger in a sheath on the left hip and no other weapons, no cape or cloak, barefoot standing on moss-covered ground, a small red fox sitting at her right side looking up at her, no other animals present, afternoon forest light dappling through the canopy above, the character should not have any facial markings tattoos or scars
?

Medium ($0.055)

3.10

High ($0.212) - Digital character art of a fantasy ranger standing in a forest clearing, the character must have the following specific attributes: female with medium brown skin and long silver-white hair in a single braid over the right shoulder, wearing dark green leather armor with no shoulder pauldrons leaving the arms bare, a wooden longbow held in the left hand with the string undrawn, a quiver of arrows on the back containing exactly seven visible arrow shafts with white fletching, a brown leather belt with a single short dagger in a sheath on the left hip and no other weapons, no cape or cloak, barefoot standing on moss-covered ground, a small red fox sitting at her right side looking up at her, no other animals present, afternoon forest light dappling through the canopy above, the character should not have any facial markings tattoos or scars
?

High ($0.212)

3.57

prompt-0137 · spread 0.35 · High wins

A farmer's market on a sunny Saturday morning, white canopy vendor stalls arranged in two rows with colorful seasonal produce displayed in wooden...

Low ($0.014) - A farmer's market on a sunny Saturday morning, white canopy vendor stalls arranged in two rows with colorful seasonal produce displayed in wooden crates at correct heights, shoppers carrying reusable tote bags browsing between stalls, a busker playing acoustic guitar near the entrance with an open case for tips, a chalkboard sign reading FRESH & LOCAL at the main arch, consistent outdoor shadows falling to the northwest suggesting mid-morning sun, a golden retriever on a leash waiting patiently beside its owner
?

Low ($0.014)

3.68

Medium ($0.055) - A farmer's market on a sunny Saturday morning, white canopy vendor stalls arranged in two rows with colorful seasonal produce displayed in wooden crates at correct heights, shoppers carrying reusable tote bags browsing between stalls, a busker playing acoustic guitar near the entrance with an open case for tips, a chalkboard sign reading FRESH & LOCAL at the main arch, consistent outdoor shadows falling to the northwest suggesting mid-morning sun, a golden retriever on a leash waiting patiently beside its owner
?

Medium ($0.055)

3.91

High ($0.212) - A farmer's market on a sunny Saturday morning, white canopy vendor stalls arranged in two rows with colorful seasonal produce displayed in wooden crates at correct heights, shoppers carrying reusable tote bags browsing between stalls, a busker playing acoustic guitar near the entrance with an open case for tips, a chalkboard sign reading FRESH & LOCAL at the main arch, consistent outdoor shadows falling to the northwest suggesting mid-morning sun, a golden retriever on a leash waiting patiently beside its owner
?

High ($0.212)

4.03

Medium tier wins (4)

Prompts where medium ($0.055) outperformed both neighbours — often the sweet spot on prompts with moderate complexity.

prompt-0165 · spread 0.22 · Medium wins

Children's book illustration of a birthday party scene with exactly six children sitting around a table, a cake in the center with seven lit candles...

Low ($0.014) - Children's book illustration of a birthday party scene with exactly six children sitting around a table, a cake in the center with seven lit candles and no text on the cake, the birthday child wearing a pointed party hat and sitting at the head of the table, three wrapped presents stacked on the floor to the left — one red, one blue, one yellow — a banner hanging above reading HAPPY BIRTHDAY
?

Low ($0.014)

3.02

Medium ($0.055) - Children's book illustration of a birthday party scene with exactly six children sitting around a table, a cake in the center with seven lit candles and no text on the cake, the birthday child wearing a pointed party hat and sitting at the head of the table, three wrapped presents stacked on the floor to the left — one red, one blue, one yellow — a banner hanging above reading HAPPY BIRTHDAY
?

Medium ($0.055)

3.23

High ($0.212) - Children's book illustration of a birthday party scene with exactly six children sitting around a table, a cake in the center with seven lit candles and no text on the cake, the birthday child wearing a pointed party hat and sitting at the head of the table, three wrapped presents stacked on the floor to the left — one red, one blue, one yellow — a banner hanging above reading HAPPY BIRTHDAY
?

High ($0.212)

3.08

prompt-0089 · spread 0.33 · Medium wins

Editorial dance photography of a contemporary ballet performer executing a grand jeté in an abandoned subway station, body forming a perfect split in...

Low ($0.014) - Editorial dance photography of a contemporary ballet performer executing a grand jeté in an abandoned subway station, body forming a perfect split in mid-air with front leg extended to one hundred eighty degrees and pointed toes, arms in a lyrical port de bras reaching forward, neck elongated with gaze following the leading hand, wearing a deconstructed tutu and flesh-toned bodystocking, shot on Hasselblad H6D with 80mm lens at f/2.8, mixed lighting from fluorescent station tubes and portable LED panels creating cyan-magenta split toning, slight motion blur on the extremities conveying velocity, concrete pillars framing the composition
?

Low ($0.014)

3.18

Medium ($0.055) - Editorial dance photography of a contemporary ballet performer executing a grand jeté in an abandoned subway station, body forming a perfect split in mid-air with front leg extended to one hundred eighty degrees and pointed toes, arms in a lyrical port de bras reaching forward, neck elongated with gaze following the leading hand, wearing a deconstructed tutu and flesh-toned bodystocking, shot on Hasselblad H6D with 80mm lens at f/2.8, mixed lighting from fluorescent station tubes and portable LED panels creating cyan-magenta split toning, slight motion blur on the extremities conveying velocity, concrete pillars framing the composition
?

Medium ($0.055)

3.51

High ($0.212) - Editorial dance photography of a contemporary ballet performer executing a grand jeté in an abandoned subway station, body forming a perfect split in mid-air with front leg extended to one hundred eighty degrees and pointed toes, arms in a lyrical port de bras reaching forward, neck elongated with gaze following the leading hand, wearing a deconstructed tutu and flesh-toned bodystocking, shot on Hasselblad H6D with 80mm lens at f/2.8, mixed lighting from fluorescent station tubes and portable LED panels creating cyan-magenta split toning, slight motion blur on the extremities conveying velocity, concrete pillars framing the composition
?

High ($0.212)

3.37

prompt-0182 · spread 0.30 · Medium wins

Cinematic night scene shot with available light only — a woman reading a book by candlelight in a 17th century Dutch interior, the image quality...

Low ($0.014) - Cinematic night scene shot with available light only — a woman reading a book by candlelight in a 17th century Dutch interior, the image quality demonstrating extraordinary dynamic range with the candle flame properly exposed as warm white without clipping while simultaneously rendering detail in the deep shadows of the room's corners, the woman's face illuminated by the warm glow showing individual pores and peach fuzz caught in the rim light, the open book's pages showing legible text with crisp serif letterforms, fabric of her period dress rendered with texture detail showing individual linen weave threads in the highlight areas, the wooden table surface showing rich grain detail in the midtones, zero chromatic noise in the shadow regions with clean smooth gradients, candlelight creating a physically accurate inverse-square falloff, graded to match the tonal characteristics of Vermeer's paintings
?

Low ($0.014)

3.33

Medium ($0.055) - Cinematic night scene shot with available light only — a woman reading a book by candlelight in a 17th century Dutch interior, the image quality demonstrating extraordinary dynamic range with the candle flame properly exposed as warm white without clipping while simultaneously rendering detail in the deep shadows of the room's corners, the woman's face illuminated by the warm glow showing individual pores and peach fuzz caught in the rim light, the open book's pages showing legible text with crisp serif letterforms, fabric of her period dress rendered with texture detail showing individual linen weave threads in the highlight areas, the wooden table surface showing rich grain detail in the midtones, zero chromatic noise in the shadow regions with clean smooth gradients, candlelight creating a physically accurate inverse-square falloff, graded to match the tonal characteristics of Vermeer's paintings
?

Medium ($0.055)

3.63

High ($0.212) - Cinematic night scene shot with available light only — a woman reading a book by candlelight in a 17th century Dutch interior, the image quality demonstrating extraordinary dynamic range with the candle flame properly exposed as warm white without clipping while simultaneously rendering detail in the deep shadows of the room's corners, the woman's face illuminated by the warm glow showing individual pores and peach fuzz caught in the rim light, the open book's pages showing legible text with crisp serif letterforms, fabric of her period dress rendered with texture detail showing individual linen weave threads in the highlight areas, the wooden table surface showing rich grain detail in the midtones, zero chromatic noise in the shadow regions with clean smooth gradients, candlelight creating a physically accurate inverse-square falloff, graded to match the tonal characteristics of Vermeer's paintings
?

High ($0.212)

3.35

prompt-0111 · spread 0.27 · Medium wins

Cinematic portrait of a weathered deep-sea fishing captain standing at the helm of his trawler during golden hour, face deeply tanned with authentic...

Low ($0.014) - Cinematic portrait of a weathered deep-sea fishing captain standing at the helm of his trawler during golden hour, face deeply tanned with authentic crow's feet and sun damage, salt-and-pepper beard trimmed short with individual whisker detail, pale blue eyes with visible blood vessels in the sclera, wearing a faded navy cable-knit sweater with realistic wool texture and minor pilling, large calloused hands gripping the wooden ship's wheel with anatomically correct finger placement showing prominent knuckles and visible tendons, a thin scar running from the left eyebrow to the temple, ears proportional to head size with natural lobes, shot on ARRI Alexa with Panavision C-Series anamorphic lens creating characteristic oval bokeh, warm key light from the setting sun camera right with cool bounce fill from the ocean surface camera left
?

Low ($0.014)

3.40

Medium ($0.055) - Cinematic portrait of a weathered deep-sea fishing captain standing at the helm of his trawler during golden hour, face deeply tanned with authentic crow's feet and sun damage, salt-and-pepper beard trimmed short with individual whisker detail, pale blue eyes with visible blood vessels in the sclera, wearing a faded navy cable-knit sweater with realistic wool texture and minor pilling, large calloused hands gripping the wooden ship's wheel with anatomically correct finger placement showing prominent knuckles and visible tendons, a thin scar running from the left eyebrow to the temple, ears proportional to head size with natural lobes, shot on ARRI Alexa with Panavision C-Series anamorphic lens creating characteristic oval bokeh, warm key light from the setting sun camera right with cool bounce fill from the ocean surface camera left
?

Medium ($0.055)

3.67

High ($0.212) - Cinematic portrait of a weathered deep-sea fishing captain standing at the helm of his trawler during golden hour, face deeply tanned with authentic crow's feet and sun damage, salt-and-pepper beard trimmed short with individual whisker detail, pale blue eyes with visible blood vessels in the sclera, wearing a faded navy cable-knit sweater with realistic wool texture and minor pilling, large calloused hands gripping the wooden ship's wheel with anatomically correct finger placement showing prominent knuckles and visible tendons, a thin scar running from the left eyebrow to the temple, ears proportional to head size with natural lobes, shot on ARRI Alexa with Panavision C-Series anamorphic lens creating characteristic oval bokeh, warm key light from the setting sun camera right with cool bounce fill from the ocean surface camera left
?

High ($0.212)

3.59

Low tier wins (3)

Prompts where the cheapest tier ($0.014) produced the strongest output — typically simpler scenes where extra compute introduced artefacts rather than detail.

prompt-0138 · spread 0.48 · Low wins

Children's book double-page spread illustration of a magical bakery where enchanted kitchen utensils work autonomously, a wooden spoon stirring batter...

Low ($0.014) - Children's book double-page spread illustration of a magical bakery where enchanted kitchen utensils work autonomously, a wooden spoon stirring batter in a copper mixing bowl that sits on a flour-dusted counter, a rolling pin flattening dough with visible pressure marks, oven mitts carrying a hot tray of star-shaped cookies leaving the open oven where more cookies are visible on the rack inside, a sugar shaker sprinkling powdered sugar that falls in a believable arc, measuring cups marching in single file from smallest to largest like Russian nesting dolls, the elderly baker sitting in a rocking chair reading a book while her enchanted kitchen works, a cat sleeping undisturbed on the windowsill, through the window a snowy village scene with smoke rising from chimneys, warm interior lighting contrasting with cool blue exterior, whimsical illustration style with rounded shapes and soft textures, reminiscent of Mary Blair and Charley Harper
?

Low ($0.014)

3.68

Medium ($0.055) - Children's book double-page spread illustration of a magical bakery where enchanted kitchen utensils work autonomously, a wooden spoon stirring batter in a copper mixing bowl that sits on a flour-dusted counter, a rolling pin flattening dough with visible pressure marks, oven mitts carrying a hot tray of star-shaped cookies leaving the open oven where more cookies are visible on the rack inside, a sugar shaker sprinkling powdered sugar that falls in a believable arc, measuring cups marching in single file from smallest to largest like Russian nesting dolls, the elderly baker sitting in a rocking chair reading a book while her enchanted kitchen works, a cat sleeping undisturbed on the windowsill, through the window a snowy village scene with smoke rising from chimneys, warm interior lighting contrasting with cool blue exterior, whimsical illustration style with rounded shapes and soft textures, reminiscent of Mary Blair and Charley Harper
?

Medium ($0.055)

3.20

High ($0.212) - Children's book double-page spread illustration of a magical bakery where enchanted kitchen utensils work autonomously, a wooden spoon stirring batter in a copper mixing bowl that sits on a flour-dusted counter, a rolling pin flattening dough with visible pressure marks, oven mitts carrying a hot tray of star-shaped cookies leaving the open oven where more cookies are visible on the rack inside, a sugar shaker sprinkling powdered sugar that falls in a believable arc, measuring cups marching in single file from smallest to largest like Russian nesting dolls, the elderly baker sitting in a rocking chair reading a book while her enchanted kitchen works, a cat sleeping undisturbed on the windowsill, through the window a snowy village scene with smoke rising from chimneys, warm interior lighting contrasting with cool blue exterior, whimsical illustration style with rounded shapes and soft textures, reminiscent of Mary Blair and Charley Harper
?

High ($0.212)

3.40

prompt-0185 · spread 0.50 · Low wins

Hyper-detailed digital portrait of a cyborg character, the biological half of the face showing pore-level skin detail with individual vellus hairs...

Low ($0.014) - Hyper-detailed digital portrait of a cyborg character, the biological half of the face showing pore-level skin detail with individual vellus hairs visible, the mechanical half showing individually modeled micro-servos, fiber optic bundles with visible core-cladding structure, tiny serial number engravings on titanium plates, lens elements in the eye showing internal reflections, 8K texture resolution throughout
?

Low ($0.014)

3.72

Medium ($0.055) - Hyper-detailed digital portrait of a cyborg character, the biological half of the face showing pore-level skin detail with individual vellus hairs visible, the mechanical half showing individually modeled micro-servos, fiber optic bundles with visible core-cladding structure, tiny serial number engravings on titanium plates, lens elements in the eye showing internal reflections, 8K texture resolution throughout
?

Medium ($0.055)

3.67

High ($0.212) - Hyper-detailed digital portrait of a cyborg character, the biological half of the face showing pore-level skin detail with individual vellus hairs visible, the mechanical half showing individually modeled micro-servos, fiber optic bundles with visible core-cladding structure, tiny serial number engravings on titanium plates, lens elements in the eye showing internal reflections, 8K texture resolution throughout
?

High ($0.212)

3.22

prompt-0132 · spread 0.07 · Low wins

Architectural interior photograph of a modern open-concept kitchen flowing into a living dining area, the kitchen featuring a large waterfall-edge...

Low ($0.014) - Architectural interior photograph of a modern open-concept kitchen flowing into a living dining area, the kitchen featuring a large waterfall-edge marble island with bar stools at correct seating height, pendant lights hanging at proportional distances above the island, built-in appliances including a double wall oven flush with cabinetry, range hood ducted into the ceiling, the living area visible beyond with furniture at correct scale relative to the room — a sectional sofa facing a recessed electric fireplace below a wall-mounted television, dining table set for six with place settings at proper spacing, continuous engineered hardwood flooring running throughout with consistent plank direction, large windows showing a garden view with correct exterior lighting matching the interior time of day, shot with a tilt-shift lens to correct vertical perspective convergence, Nikon Z7 II with PC-E 24mm
?

Low ($0.014)

3.75

Medium ($0.055) - Architectural interior photograph of a modern open-concept kitchen flowing into a living dining area, the kitchen featuring a large waterfall-edge marble island with bar stools at correct seating height, pendant lights hanging at proportional distances above the island, built-in appliances including a double wall oven flush with cabinetry, range hood ducted into the ceiling, the living area visible beyond with furniture at correct scale relative to the room — a sectional sofa facing a recessed electric fireplace below a wall-mounted television, dining table set for six with place settings at proper spacing, continuous engineered hardwood flooring running throughout with consistent plank direction, large windows showing a garden view with correct exterior lighting matching the interior time of day, shot with a tilt-shift lens to correct vertical perspective convergence, Nikon Z7 II with PC-E 24mm
?

Medium ($0.055)

3.68

High ($0.212) - Architectural interior photograph of a modern open-concept kitchen flowing into a living dining area, the kitchen featuring a large waterfall-edge marble island with bar stools at correct seating height, pendant lights hanging at proportional distances above the island, built-in appliances including a double wall oven flush with cabinetry, range hood ducted into the ceiling, the living area visible beyond with furniture at correct scale relative to the room — a sectional sofa facing a recessed electric fireplace below a wall-mounted television, dining table set for six with place settings at proper spacing, continuous engineered hardwood flooring running throughout with consistent plank direction, large windows showing a garden view with correct exterior lighting matching the interior time of day, shot with a tilt-shift lens to correct vertical perspective convergence, Nikon Z7 II with PC-E 24mm
?

High ($0.212)

3.70

So When Should You Use Each Tier?

Low tier ($0.014/image) — use for prototyping and throwaway work

Mean score of 3.17 is meaningfully below medium and high. Use for rapid prototyping, internal review drafts, or large-batch generation where you accept that 90% of images will be visibly worse than the higher tiers. The 15× cost advantage over high is real but you pay for it in quality. Not recommended for client-facing or final-render work.

Medium tier ($0.055/image) — best cost-effective default

Captures most of the high-tier quality (3.36 vs 3.54) at 26% of the cost. The sweet spot for production work where each image matters but you're running at volume. Loses to high tier on 83% of prompts head-to-head, but the gap is small (~0.18 points). If cost-per-score is your optimisation target, medium wins outright.

High tier ($0.212/image) — use for hero shots and detail-critical work

Highest mean score (3.54) and wins 76% of prompts head-to-head. Worth the premium when the brief is "one image, must be the best we can do" — commercial hero shots, detail-critical compositions, prompts demanding precise optical physics or biomechanics. Not worth it for batch work where medium captures enough of the quality.

Methodology

Prompts: 29 prompts drawn from our 200-prompt benchmark suite, selected as the most complex (avg ~750 characters) and where all three tiers successfully generated. Coverage across visual fidelity, physics logic, subject-object integrity, and instruction adherence categories.

Generation: Each prompt generated three times, once at each tier, via Runware's openai:gpt-image@2 endpoint with explicit providerSettings.openai.quality set to low, medium, or high. All images at 1024×1024 PNG. Individual tier attempts that timed out at the provider were excluded along with any prompt missing a complete tier triplet.

Scoring (three independent blind passes): Every image reviewed by Claude Opus 4.7 multimodal vision against a prompt-specific rubric. Each tier received three completely independent judging passes, each by a fresh reviewer with no knowledge of which tier produced the image and no exposure to the other two tiers' renders. Scores in this article are the mean of those three independent passes per image — so each tier's aggregate represents 87 independent judgments (29 prompts × 3 passes), and head-to-head comparisons average 3 paired votes per prompt. This blind triangulation is necessary because earlier tier-context judging (where the same reviewer saw all three tiers together) systematically inflated scores by anchoring tiers against each other; under blind isolation the true tier separation emerges.

Rubric: For each prompt, we first determined the primary quality category (visual fidelity, physics logic, subject-object integrity, or instruction adherence) and assigned weights to the three sub-categories under it. Scores 1–5 per sub-category with visual reasoning, then weighted into a single score per tier. Rubric includes explicit rules for subject-frame directional terms ("left hook" is the subject's left arm, not the viewer's) and for reflected/reversed text in interior glass-viewpoint scenes.

Cost per tier: $0.014 (low), $0.055 (medium), $0.212 (high) per 1024×1024 image. These are observed charges from Runware's billing logs, not list prices — Runware's pricing page quotes a flat $0.006 but the actual charge depends on tier and prompt-token length.

Related Vibedex Benchmarks

Methodology: Rankings and scores in this article are based on VibeDex's independent benchmarks. Models are evaluated by AI-powered judges across multiple quality dimensions with scores weighted by prompt intent. See our full methodology

FAQ

Does GPT Image 2 high quality actually look better than low quality?

Yes, clearly. Across 29 complex prompts judged blind by Claude Opus 4.7 in three independent passes, mean weighted scores ladder cleanly: low 3.17, medium 3.36, high 3.54. Per-prompt, high tier wins 76% of direct comparisons versus 10% for low tier; high beats medium on 83% of prompts and beats low on 90%. The quality difference is real, but the gap from low to high (0.37 points) is much smaller than the 15× price difference, so cost-effectiveness depends on the use case.

How much does GPT Image 2 cost at each quality tier?

At 1024×1024: low tier $0.014/image, medium $0.055/image, high $0.212/image — a 15× price difference between low and high. Runware's pricing page quotes a flat $0.006 but that is not what is actually charged; the real per-image cost depends on quality tier and prompt-token length.

Is GPT Image 2 cheaper than GPT Image 1.5?

At every tier. GPT Image 1.5 costs $0.133/image. GPT Image 2 ranges from $0.014 (low) to $0.212 (high). Low tier is 89% cheaper than 1.5, medium is 59% cheaper, high is 59% more expensive. On our 31-prompt complex subset judged blind, GPT Image 2 high (3.55) lands modestly above GPT Image 1.5’s benchmark range — but the comparison is not apples-to-apples because we judged 1.5 on the full 200-prompt suite under the older Gemini judge.

When should I use high quality for GPT Image 2?

When you care about the best single render. In our 29-prompt review with all three tiers, high tier won 22 prompts (76%) — winning on prompts that demand specific detail-critical features (optical physics, precise biomechanics, dense layouts) AND on simpler scenes where it produced a marginally better render. The premium is most defensible for hero shots and detail-critical work; for high-volume batch generation where you accept average-of-batch quality, medium tier captures most of the quality at 26% of the cost.

Does high tier win on any specific prompt categories?

High tier wins broadly across categories — visual fidelity prompts, instruction adherence, subject-object integrity, and physics logic — though the margin varies. The 4 medium-tier wins and 3 low-tier wins in our sample are scattered across categories, with no clear pattern beyond "sometimes the lower tier just happened to nail this specific composition better." Treat low/medium wins as luck of the draw rather than systematic strengths.

Find the best model for your prompt

VibeDex analyzes your prompt and recommends the best AI image model based on what your specific image demands.

Try VibeDex