Updated May 23, 2026
Key Takeaways
- A paid test task is the most reliable vetting tool - it reveals real work quality, communication under pressure, and how the VA handles ambiguity.
- Design test tasks that mirror actual work: the same complexity, same tools, and a realistic (not extreme) time constraint.
- Evaluate the process, not just the output - how a VA handles questions and incomplete information tells you more than the finished product.
- Red flags during testing: questions that are already answered in the brief, missed deadline without proactive communication, and output that interprets the brief generously to cover gaps.
- Stealth Agents pre-vets all VAs before placement - you skip the cold screening and start with candidates who have already passed the baseline quality bar.
Most VA hiring mistakes happen because business owners rely on resumes and interviews to evaluate candidates. Resumes describe past roles. Interviews measure articulation. Neither predicts whether someone will complete your actual tasks correctly and on time.
A structured test process fixes this by evaluating real work rather than claimed experience.
Why Test Before Hiring
The cost of a bad VA hire is higher than most people expect. A month of onboarding investment, documentation of your processes, access to your tools and systems, and time spent giving feedback - all of it is wasted if the VA does not work out.
A one to two hour test task costing $20 to $50 in paid time is the cheapest insurance against that outcome. It is also the most respectful filter - you are asking candidates to demonstrate capability, not just claim it.
Designing the Test Task
The test task should mirror the actual work you will delegate. Design it with these parameters:
Realistic complexity. Do not make the test trivial (sorting a list) or unrealistically hard (complex multi-step research under a two-hour constraint). The task should be representatively difficult.
Actual tools. If the job requires working in Asana, give the test candidate access to Asana. If it requires researching LinkedIn, include a LinkedIn research component. Testing in a vacuum does not predict on-the-job performance.
Partial ambiguity. Include one or two elements where the brief is slightly underspecified. How the VA handles unclear instructions is diagnostic - do they ask clarifying questions before starting, or do they guess and not mention it?
Reasonable time frame. Give 24 to 48 hours for a task that would take 1 to 2 hours to complete. This respects the candidate's time while testing time management and prioritization.
Pay for it. Paying for the test task - at the candidate's stated rate - filters for serious candidates and is fair to the VA's time. Unpaid test tasks attract fewer quality candidates and create a bad first impression.
Sample Test Tasks by Role Type
Administrative VA: "Review the attached inbox export (50 emails) and create a categorized summary: action required, follow-up needed, informational. Flag any items where a response is overdue based on the email date. Use this template: [attach a simple template]. Deliver in the linked Google Sheet."
Research VA: "Research five US-based companies that fit these criteria: [ICP description]. For each, find the company name, CEO name, LinkedIn URL, company website, approximate employee count (from LinkedIn), and the best email format based on their website contact page. Deliver as a structured spreadsheet."
Customer Support VA: "Here are three customer inquiry emails. Write a response to each using the attached brand voice guidelines. Do not send the responses - just draft them in the attached Google Doc with one tab per inquiry."
What to Evaluate
Output quality: Does the work meet the stated standard? Is it accurate? Is the format correct?
Communication quality: Did the VA ask clarifying questions before starting? Were questions clear and specific? Did they confirm receipt of the task?
Process transparency: Did they mention any issues or blockers? If something was unclear, did they note how they interpreted it?
Timeliness: Did they deliver within the stated window? If they were going to miss it, did they communicate proactively?
Attention to detail: Did they follow the format instructions exactly? Small deviations from the brief often predict larger deviations in ongoing work.
Red Flags During Testing
Asking questions already answered in the brief. If you specified the output format and they ask about the output format, they are not reading carefully.
Missing the deadline without communication. A VA who misses a test task deadline and says nothing is demonstrating exactly what they will do on real deadlines.
Overly generous self-interpretation. If the brief was ambiguous and the VA completed the task in a way that happens to showcase their strengths but does not address the ambiguity, they are covering a gap rather than flagging it.
Polished output, wrong brief. A beautifully formatted deliverable that answers a slightly different question than what was asked. Looks impressive but signals they are not closely following instructions.
No follow-up after submission. Good VAs typically confirm delivery and offer to revise if needed. Silence after submission is a minor flag worth noting.
Running Multiple Candidates
For important roles, test two to three candidates simultaneously with the same task. The comparison is often illuminating - quality differences that seem minor in isolation become obvious when you see three responses to the same brief.
This costs more upfront but produces a clear hiring decision based on evidence rather than gut feel.
When Testing is Not Necessary
If you hire through a VA agency that pre-vets candidates, the agency's screening process substitutes for much of the cold testing. Stealth Agents, for example, evaluates candidates before placement - you receive candidates who have already passed the agency's quality bar. An onboarding period still serves as a live test, but you are not starting from zero.
The test task approach is most valuable for direct hires from freelance platforms or referrals, where you are the only filter.
Testing before hiring is not about distrust - it is about efficiency. A structured test task produces better hiring decisions faster, with less wasted onboarding investment on candidates who looked good but could not deliver.

