Datasets
Datasets are the source inputs for evaluation runs.Typical dataset flow
- Create/import dataset samples.
- Attach datasets to an evaluation suite.
- Execute runs and inspect sample-level outputs.
Good dataset practices
- Keep schema consistent for stable comparisons.
- Version datasets when changing prompts/tasks.
- Start small for fast iteration, then scale.