New AI-agent eval cases, as they ship

Vendor-agnostic test cases for tool-using LLM agents — accuracy, safety, edge cases, prompt injection (OWASP LLM01), hallucination, cost. Drop your email and I'll send new cases + the 6-dimension cheatsheet. No spam.

Free 5-case starter: GitHub  ·  Full pack ($49): Gumroad