Evaluations
evaluations
Methods
Create Evaluation
List Evaluations
Get Evaluation
Archive Evaluation
Update or Restore Evaluation
Get schema information for evaluation item data, including field names, types, and occurrence counts.
Include archived items in schema analysis
Filter evaluations using metadata and other criteria. Supports up to 10 filters with AND logic.
Get taxonomy JSON for contributor evaluation question tasks.
Domain types
Schema information for an evaluation's item data structure
Tasks
evaluations.tasks
Methods
Add a new test criteria (LLM judge, contributor question, etc.) to an existing evaluation. Gated: rejected if any contributor annotation task has been claimed or completed. Kicks off the evaluation workflow so the new task runs against existing items.
Replace a single test criteria's configuration, identified by its alias. Gated: rejected if any contributor annotation task for the evaluation has been claimed or completed.