Banja Lab / Benchmarks / Test
The same task, run on 28 models. Compare the outputs side by side, or open any one in a popup to inspect it.
Top result: claude-opus-4-8 (low reasoning) at 100.0% composite. Lowest: claude-haiku-4-5 at 44.8%. 28 models compared on this task.
Build a self-contained pricing section as a single HTML file (`index.html`) that renders with no build step and no network calls (inline all CSS and JS). Requirements: - three pricing tiers, with the middle tier visibly marked as the most popular, - a monthly/annual billing toggle implemented as an accessible switch (role="switch", with aria state), and - clicking the toggle must change every displayed price between its monthly and annual value. Use plain, readable, accessible markup. No external fonts or scripts.