Skip to main content

Datasets

Datasets are the source inputs for evaluation runs.

Typical dataset flow

  1. Create/import dataset samples.
  2. Attach datasets to an evaluation suite.
  3. Execute runs and inspect sample-level outputs.

Good dataset practices

  • Keep schema consistent for stable comparisons.
  • Version datasets when changing prompts/tasks.
  • Start small for fast iteration, then scale.