Evaluator

Test, compare and iterate on LLM prompts and models fast.

Prompt experiments

Create datasets and compare prompts across models.

Scoring

Automate evals with metrics and model-graded rubrics.

Shareable reports

Visualize results and collaborate with your team.