CLI Evaluation Setup

Automatic evaluation configuration using the Handit CLI. Connect evaluation models and configure quality assessment in minutes.

Prerequisites: Handit CLI installed and a Handit.ai account. If you haven’t installed the CLI yet, run npm install -g @handit.ai/cli.

Quick setup

terminal


handit-cli evaluators-setup

The CLI will guide you through:

Evaluation Models: GPT-4 for highest accuracy, GPT-3.5-turbo for cost-effectiveness, Llama models for open source evaluation.

Quality Dimensions: Completeness, accuracy, empathy, format compliance, and any custom evaluators you create.

Evaluation Coverage: Typically 10-20% of interactions for cost-effective quality monitoring.


# Update configuration anytime
handit-cli evaluators-setup

When to reconfigure:

✅ Check your dashboard: Go to dashboard.handit.ai - you should see:

Monitor results: View evaluation data in your Agent Performance dashboard
Create custom evaluators: Build Custom Evaluators for specific quality needs
Enable autonomous fixes: Set up GitHub Integration to complete your autonomous engineer

Your autonomous engineer can now detect quality issues! Enable GitHub integration so it can create pull requests with fixes.

CLI issues: Ensure Node.js is installed and you have a valid Handit.ai account. Try running handit-cli evaluators-setup again.

No evaluation data: Verify your AI is receiving traffic and evaluation percentages are set above 0%.

Model token issues: Check API keys are valid and have sufficient credits. The CLI will help you reconfigure if needed.

For help, visit our Support page or join our Discord community .