Banja Lab / Benchmarks / Test
The same task, run on 28 models. Compare the outputs side by side, or open any one in a popup to inspect it.
Top result: claude-opus-4-8 (low reasoning) at 100.0% composite. Lowest: claude-haiku-4-5 at 0.0%. 28 models compared on this task.
Draw a vertical traffic light in SVG. Show three lamps stacked top to bottom (red, amber, green). Only the green lamp at the bottom is lit (a bright green); the red and amber lamps are dark / off. Use a 40x100 viewBox. Build it from rectangles and circles only - do not use paths, raster images, or text. Write the result to `light.svg`.