Banja Lab / Benchmarks / Test
The same task, run on 28 models. Compare the outputs side by side, or open any one in a popup to inspect it.
Top result: claude-opus-4-8 (low reasoning) at 100.0% composite. Lowest: kimi-k2.7-code at 19.7%. 28 models compared on this task.
Build a self-contained tabbed interface as a single HTML file (`index.html`) that renders with no build step and no network calls (inline all CSS and JS). Requirements: - a tab list with exactly three tabs, each a real button with role="tab" and aria-selected reflecting which tab is active (one selected on load), - three matching panels with role="tabpanel"; only the selected tab's panel is visible, the other two are hidden, - clicking a tab selects it and shows its panel while hiding the previously shown panel, so the visible content changes. Use plain, readable, accessible markup. No external fonts or scripts.