/ blog / tag
1 post
Five dimensions for evaluating agent output, a checklist you can walk in fifteen minutes, and situations where you just send the result back without a lengthy debate.