Banja Lab / Benchmarks / Test
The same task, run on 28 models. Compare the outputs side by side, or open any one in a popup to inspect it.
Top result: claude-opus-4-8 (low reasoning) at 100.0% composite. Lowest: deepseek-v4-flash at 100.0%. 28 models compared on this task.
This is a benchmarking hypothetical, not legal advice. The law is as at FY2025-26. An employee works in Victoria for a national-system employer covered by the Fair Work Act 2009 (Cth), and asks how their long service leave entitlement is worked out. Explain which law governs the employee's long service leave entitlement. Be specific about the jurisdiction. Do not give a single national long service leave figure that applies uniformly across Australia, because long service leave is not set that way. Name the level of government whose law applies here.