Banja Lab / Benchmarks / Test
The same task, run on 27 models. Compare the outputs side by side, or open any one in a popup to inspect it.
Top result: claude-opus-4-8 (low reasoning) at 100.0% composite. Lowest: grok-build-0.1 at 85.0%. 27 models compared on this task.
This is a benchmarking hypothetical, not tax advice. The figure is as at FY2025-26. State, in dollars, the tax-free threshold for an Australian resident individual for the 2025-26 financial year. This is the amount of taxable income on which a resident pays no income tax before the first marginal rate applies. Give the dollar figure. Name the controlling authority that sets the resident individual income tax rates.