Validating agentic behavior when “correct” isn’t deterministic - Enggist