Banja
About
Services
Products
Case Studies
Lab
Contact Us
Let us pitch to you

LET'S BUILD
THE FUTURE.

Start a Project
or
Meet Jett
banja.au

We build digital products for people who move fast.

Explore

•About•Case Studies•Blog•Careers•Contact

Services

•Product Design & Build•AI Agents & Automation•Website & Brand Setup

Products

•Boosta

Contact

helloremovethis@andthisbanja.au
50 Miller St
North Sydney NSW 2060

© 2026 Banja Labs. All rights reserved.

Privacy PolicyTerms of Use

Banja Lab / Benchmarks

Google

gemini-3.1-flash-lite

default reasoningAPI single-shot

gemini-3.1-flash-lite (Google) scored 69.5% composite across 87 tasks - code, UI, full websites, SVG, marketing pages, dashboards, animations, and Australian legal and accounting. Graded by execution, and the visual builds by a cross-family vision panel (leave-one-family-out). Run on 2026-06-23.

Composite
69.5%
95% CI 60.7% - 77.7%
Est. cost
$0.0715
USD
Tokens
79,278
generation
Wall clock
6 min
end to end
Gate pass
-
validated
Separation
-
good vs bad

By domain

Composite score per domain, weakest first. Judge is the vision model’s read, shown for the visual domains.

svg-scene
1 task
14.6%
judge
14.6%
composite
Auth screen
1 task
41.7%
judge
41.7%
composite
Marketing page
1 task
41.7%
judge
41.7%
composite
Dashboard
1 task
47.9%
judge
47.9%
composite
Animation
2 tasks
52.1%
judge
52.1%
composite
Websites
15 tasks
54.0%
composite
Australian accounting
23 tasks
61.3%
composite
UI components
19 tasks
76.3%
composite
Australian law
5 tasks
80.0%
composite
SVG and graphics
13 tasks
86.6%
composite
Programming
6 tasks
100.0%
composite

Outputs

The actual rendered output. Open any tile to view it in a popup, or compare the same task across every model.

gemini-3.1-flash-lite Animation output - Simple UI animation
Open
Compare modelsANIM-0001judge 2.6/5
gemini-3.1-flash-lite Animation output - Complex animation scene
Open
Compare modelsANIM-0002judge 3.6/5
gemini-3.1-flash-lite Auth screen output - Register / login screen
Open
Compare modelsAUTH-0001judge 2.7/5
gemini-3.1-flash-lite UI components output - Keyboard-operable modal dialog with focus trap and return
Open
Compare modelsAYGAT-00010.0%
gemini-3.1-flash-lite UI components output - Keyboard-operable select-only combobox with a listbox popup
Open
Compare modelsAYGAT-00020.0%
gemini-3.1-flash-lite UI components output - Keyboard-operable menu button with arrow-key navigation
Open
Compare modelsAYGAT-00030.0%
gemini-3.1-flash-lite UI components output - Keyboard-operable tabs with roving tabindex and arrow-key activation
Open
Compare modelsAYGAT-000497.2%
gemini-3.1-flash-lite UI components output - Keyboard-operable accordion with disclosure buttons and arrow navigation
Open
Compare modelsAYGAT-000598.1%
gemini-3.1-flash-lite UI components output - Keyboard-operable ARIA radiogroup with roving tabindex
Open
Compare modelsAYGAT-000697.2%
gemini-3.1-flash-lite Dashboard output - SaaS analytics dashboard
Open
Compare modelsDASH-0001judge 2.9/5
gemini-3.1-flash-lite SVG and graphics output - Bar chart with exact proportional heights for [4, 10, 6, 8, 12]
Open
Compare modelsDIAGR-000125.0%
gemini-3.1-flash-lite SVG and graphics output - Four-node pipeline flow graph with exactly four edges
Open
Compare modelsDIAGR-0002100.0%
gemini-3.1-flash-lite SVG and graphics output - Horizontal stacked bar with proportional segments and a legend
Open
Compare modelsDIAGR-0003100.0%
gemini-3.1-flash-lite SVG and graphics output - Org chart with three reporting edges and a fixed hierarchy
Open
Compare modelsDIAGR-0004100.0%
gemini-3.1-flash-lite SVG and graphics output - Gantt chart with proportional start offsets and durations
Open
Compare modelsDIAGR-0005100.0%
gemini-3.1-flash-lite SVG and graphics output - Grouped two-series bar chart across three categories with a legend
Open
Compare modelsDIAGR-00060.4%
gemini-3.1-flash-lite SVG and graphics output - Five-node hub-and-satellite network with exactly six edges
Open
Compare modelsDIAGR-0007100.0%
gemini-3.1-flash-lite Marketing page output - Marketing landing page
Open
Compare modelsMKT-0001judge 2.7/5
gemini-3.1-flash-lite UI components output - Stat card built to a frozen token sheet
Open
Compare modelsPIXEL-0001100.0%
gemini-3.1-flash-lite UI components output - Three overlapping layers in an exact z-order and position
Open
Compare modelsPIXEL-00020.0%
gemini-3.1-flash-lite UI components output - Flex toolbar with an exact gap and exact button geometry
Open
Compare modelsPIXEL-0003100.0%
gemini-3.1-flash-lite UI components output - Two-column grid split with exact column widths and gutter
Open
Compare modelsPIXEL-0004100.0%
gemini-3.1-flash-lite UI components output - Pill CTA button built to an exact box and token sheet
Open
Compare modelsPIXEL-0005100.0%
gemini-3.1-flash-lite UI components output - Progress bar with an exact fill width and track tokens
Open
Compare modelsPIXEL-0006100.0%
gemini-3.1-flash-lite UI components output - Segmented pill nav with an exact active pill and gap
Open
Compare modelsPIXEL-0007100.0%
gemini-3.1-flash-lite UI components output - Notification badge pinned to an avatar at an exact offset and z-order
Open
Compare modelsPIXEL-0008100.0%
gemini-3.1-flash-lite Websites output - Header nav that collapses to a menu button below a breakpoint
Open
Compare modelsRESPO-00010.0%
gemini-3.1-flash-lite Websites output - Feature grid that reflows from one to two to three columns
Open
Compare modelsRESPO-0002100.0%
gemini-3.1-flash-lite Websites output - Sidebar and content layout that never overflows, including at the tablet width
Open
Compare modelsRESPO-0003100.0%
gemini-3.1-flash-lite Websites output - Responsive nav that both collapses at the breakpoint and toggles open on mobile
Open
Compare modelsRESPO-0004100.0%
gemini-3.1-flash-lite Websites output - Gallery grid that widens from one to four columns as the viewport grows
Open
Compare modelsRESPO-00050.0%
gemini-3.1-flash-lite Websites output - Headline type scale that steps up at the tablet and desktop breakpoints
Open
Compare modelsRESPO-0006100.0%
gemini-3.1-flash-lite Websites output - CTA buttons that stack full-width on mobile and sit in a row on desktop
Open
Compare modelsRESPO-00070.0%
gemini-3.1-flash-lite svg-scene output - Sydney Harbour at golden hour (SVG scene)
Open
Compare modelsSCENE-0001judge 1.6/5
gemini-3.1-flash-lite Websites output - Reproduce a three-tier SaaS pricing page from its screenshot
Open
Compare modelsSCREE-000197.6%
gemini-3.1-flash-lite Websites output - Reproduce a dark analytics dashboard overview from its screenshot
Open
Compare modelsSCREE-00020.0%
gemini-3.1-flash-lite Websites output - Reproduce an editorial blog article page from its screenshot
Open
Compare modelsSCREE-00030.0%
gemini-3.1-flash-lite Websites output - Reproduce a split product-feature section from its screenshot
Open
Compare modelsSCREE-00040.0%
gemini-3.1-flash-lite Websites output - Reproduce the Banja home page hero from its screenshot
Open
Compare modelsSCREE-000593.4%
gemini-3.1-flash-lite SVG and graphics output - Single-concept flat icon (coffee cup with steam)
Open
Compare modelsSVG-0001100.0%
gemini-3.1-flash-lite SVG and graphics output - Simple house icon (square, triangle roof, door)
Open
Compare modelsSVG-0002100.0%
gemini-3.1-flash-lite SVG and graphics output - Flat multi-object scene (sun, hills, hot-air balloon)
Open
Compare modelsSVG-0003100.0%
gemini-3.1-flash-lite SVG and graphics output - Geometric logo mark (overlapping circles, abstract monogram)
Open
Compare modelsSVG-0004100.0%
gemini-3.1-flash-lite SVG and graphics output - Tiny bar chart for values [3, 7, 5, 9]
Open
Compare modelsSVG-0005100.0%
gemini-3.1-flash-lite SVG and graphics output - Traffic light with only the green light lit
Open
Compare modelsSVG-0006100.0%
gemini-3.1-flash-lite UI components output - Accessible pricing card with a monthly/annual toggle
Open
Compare modelsUI-000198.1%
gemini-3.1-flash-lite UI components output - Accessible accordion FAQ that expands on click
Open
Compare modelsUI-000299.5%
gemini-3.1-flash-lite UI components output - Tabbed interface that switches panels on click
Open
Compare modelsUI-000399.3%
gemini-3.1-flash-lite UI components output - Signup form with inline email validation
Open
Compare modelsUI-000461.4%
gemini-3.1-flash-lite UI components output - Stat and testimonial card grid
Open
Compare modelsUI-000598.8%
gemini-3.1-flash-lite Websites output - Landing page with a working mobile nav toggle
Open
Compare modelsWEB-000199.6%
gemini-3.1-flash-lite Websites output - Dashboard with a sortable data table
Open
Compare modelsWEB-000224.2%
gemini-3.1-flash-lite Websites output - Pricing page with a working billing toggle and FAQ accordion
Open
Compare modelsWEB-000395.8%

Objective tasks

Programming, Australian legal and accounting, graded by execution. 21 of 34 scored a perfect 100.0%; the rest are below. Open the answer in a popup, or compare it across every model.

TaskDomainDifficultyObjectivepass@1Output
ACC-0002Australian accountingmedium10.0%0.0%
CompareOpen
ACC-0003Australian accountingeasy90.0%100.0%
CompareOpen
AUSFA-0001Australian accountingeasy10.0%0.0%
CompareOpen
AUSFA-0002Australian accountingmedium85.0%100.0%
CompareOpen
AUSFA-0004Australian accountingmedium0.0%0.0%
CompareOpen
AUSFA-0005Australian accountingmedium0.0%0.0%
CompareOpen
AUSFA-0006Australian accountingmedium15.0%0.0%
CompareOpen
AUSFA-0009Australian accountinghard0.0%0.0%
CompareOpen
AUSFA-0012Australian accountinghard33.3%0.0%
CompareOpen
AUSFA-0013Australian accountinghard66.7%0.0%
CompareOpen
AUSFA-0014Australian accountinghard0.0%0.0%
CompareOpen
BASQU-0003Australian accountinghard0.0%0.0%
CompareOpen
LAW-0003Australian lawhard0.0%0.0%
CompareOpen