Claw-Eval: A Framework That Tests Whether AI Agents Are Safe, Not Just Successful

Open evaluation framework for autonomous AI agents that scores side effects and safety alongside task completion. 326 GitHub stars in its first day.
artificial-intelligence
software-engineering
Author

Kabui, Charles

Published

2026-04-19

Keywords

ai-agents, agent-evaluation, benchmarks, trustworthy-ai, autonomous-systems