New AI-agent eval cases, as they ship

Vendor-agnostic test cases for tool-using LLM agents — accuracy, safety, edge cases, prompt injection (OWASP LLM01), hallucination, cost. Drop your email and I'll send new cases + the 6-dimension cheatsheet. No spam.

Launch offer: the full AI Agent QA Eval Pack for ~~$49~~ $9
23 ready-to-run cases · 6 dimensions · vendor-agnostic. Code TRY9 applied at checkout.
Get the pack — $9

Free 5-case starter: GitHub · Full pack ($49): Gumroad