Evaluator
Test, compare and iterate on LLM prompts and models fast.
Prompt experiments
Create datasets and compare prompts across models.
Scoring
Automate evals with metrics and model-graded rubrics.
Shareable reports
Visualize results and collaborate with your team.