ok.proof

Hire developers who
get things done with AI.

Give them a real problem and the latest AI model.
See exactly how they work.

See how it works
Templates

Tests for every role

Developers

Can they ship, or just talk about shipping?

Fix a broken API with 3 failing endpointsBuild a dashboard from an empty projectRefactor a 400-line component into clean modules

Product

Can they turn messy data into a clear plan?

Prioritize 40 feature requests by revenue impactWrite a positioning doc from competitor researchCreate a launch plan from a spec and a deadline

Designers

Can they spot what's broken and make it better?

Redesign a checkout flow with 68% drop-offBuild a design system from inconsistent componentsFix a page that scores 34 on accessibility

Leaders

Can they find the signal in the noise?

Build a board update from 12 months of raw metricsPropose a reorg to cut cycle time by 40%Evaluate 4 vendor proposals and write the recommendation

Or bring your own product.

Upload your actual work as the starting point. The closer the test is to the job, the better the signal.

api/routes/checkout.ts
14 async function processCheckout(req) {
15 const cart = await getCart(req.id)
16 // BUG: returns null for guest users
17 const total = cart.items.reduce(...)
18 return charge(total)
19 }

Your codebase

A real bug from your issue tracker. A feature from your roadmap.

Checkout — Redesign v2
Pay now
68% drop-off here
Layers
Properties

Your designs

A screen that needs rethinking. A flow with real drop-off data.

Q1 Revenue by Segment.csv
SegmentARRAcctsRenewal
Enterprise$48k12Q2
Growth$124k38Q1
Starter$8k204Q3
Agency$31k7Q2

Your data

A messy spreadsheet that needs a story. A dashboard that needs building.

Vendor Evaluation — Draft
This section needs a clearer recommendation

Your documents

A brief that needs sharpening. A proposal that needs structure.

How it works

Set up in minutes, not days

01

Describe the task

Write what you want candidates to build. Set a time limit. Send them an invite link. No scheduling, no setup on their end.

New test
Title

Build a dashboard

Time

45 min

Model

Claude

02

They build with AI

Candidates open the link and start working. They prompt an AI agent to write code, see live results, and iterate until they're happy.

Live session32:15

Add a chart component

Creating Chart.js line chart...

03

You see everything

Replay the full session. Read every prompt they wrote. See the final code and a working demo. Review on your schedule.

Review
PreviewCodeChat
Why it works

Built for how hiring actually happens

Real work, real signal

They build something that matters, not solve a puzzle. You see what they'd actually deliver on the job.

See the thinking, not just the output

Watch how they break down the problem, prompt the AI, and iterate. The process tells you more than the result.

45 minutes, not a week

Candidates finish in one sitting. No calendar coordination, no follow-ups, no ghosting.

Candidates actually want to do it

They build with modern AI tools on a real problem. Top candidates see this as a chance to show off, not a chore.

Same conditions, every candidate

Same task, same tools, same time limit. No advantage from local setup, prior context, or outside help.

Review whenever you want

Full replay, final output, and a working demo — all waiting for you. No scheduling a live session.

Works beyond engineering

Designers, product managers, analysts, leaders. Anyone who works with AI can be evaluated this way.

Send a link, that's it

No accounts for candidates, no IDE installs, no environment setup. They click and start building.

FAQ

Common questions

Take-homes take days and you only see the end result. Here, candidates finish in one sitting and you see how they got there.

No. You send them a link. They open it in a browser and start building. No account, no IDE, no setup.

Every candidate works in an isolated environment. The only way to write code is through the AI chat. Every action is logged and replayable.

Most teams run 30–60 minute sessions. You can set anywhere from 5 minutes to 8 hours depending on the role.

Claude, GPT, or Gemini. You pick the model when you create the test.

Free while we’re in beta. Sign up and create your first test in a few minutes.