grok-composer-2.5-fast scored 85.0% composite across 87 tasks - code, UI, full websites, SVG, marketing pages, dashboards, animations, and Australian legal and accounting. Graded by execution, and the visual builds by a cross-family vision panel (leave-one-family-out). Run on 2026-06-25.
Composite score per domain, weakest first. Judge is the vision model’s read, shown for the visual domains.
The actual rendered output. Open any tile to view it in a popup, or compare the same task across every model.
Programming, Australian legal and accounting, graded by execution. 34 of 34 scored a perfect 100.0%. Open the answer in a popup, or compare it across every model.