Banja Lab / Benchmarks / Test
The same task, run on 28 models. Compare the outputs side by side, or open any one in a popup to inspect it.
Top result: claude-opus-4-8 (low reasoning) at 100.0% composite. Lowest: deepseek-v4-flash at 100.0%. 28 models compared on this task.
This is a benchmarking hypothetical, not tax advice. Figures are as at FY2025-26. A business reports quarterly on its activity statement (BAS). All sales and purchases are GST-taxable at the standard rate of 10%, so each GST-inclusive amount contains one eleventh of GST. For the quarter, the given figures are: - Total sales including GST: $110,000 - Total purchases including GST: $66,000 - Gross wages paid to employees: $40,000 - PAYG withholding rate applied to those wages: 20% There are no PAYG instalments, fuel tax credits, or other amounts this quarter. Using the standard BAS labels, where: - G1 is total sales (GST-inclusive), - 1A is GST on sales (one eleventh of GST-inclusive sales), - 1B is GST on purchases (one eleventh of GST-inclusive purchases), - W2 is amounts withheld from wages (the withholding rate times gross wages), - and the net amount payable to the ATO is (1A - 1B) + W2, state the dollar amount at each of G1, 1A, 1B, W2, and the net amount payable.