Flat multi-object scene (sun, hills, hot-air balloon)
The same task, run on 28 models. Compare the outputs side by side, or open any one in a popup to inspect it.
Top result: claude-opus-4-8 (low reasoning) at 100.0% composite. Lowest: gemini-3.5-flash at 90.0%. 28 models compared on this task.
How it ran
Each model was given the brief below in a fresh, isolated session with no access to our tools, and returned its answer from scratch.
The rendered output was scored 1 to 5 on brief fidelity, visual design, craft, and impact by a four-family vision panel - Anthropic (Claude Opus 4.8), OpenAI (GPT-5.5), Google (Gemini 3.1 Pro), and xAI (Grok 4.3) - using one identical prompt so the scores compare. The published judge score is leave-one-family-out: a model is never scored by a judge of its own family, so same-family self-preference is removed.
The brief
Create a flat, multi-object SVG scene with these elements: a yellow sun in the top-left
corner, three green hills along the bottom, and a red hot-air balloon. Use a 200x120
viewBox. Use vector primitives only (paths, rects, circles, ellipses, lines) - no raster
images and no text. Write the scene to `scene.svg`.