Skip to main contentUse run artifacts to understand quality and failure modes.
What to inspect first
- Overall run status (
queued, running, completed, failed)
- Sample-level output quality and scorer deltas
- Event timeline and trace artifacts
Common debugging flow
- Identify failing or low-score samples.
- Open sample artifacts and event logs.
- Adjust solver/scorer config.
- Re-run and compare results.
Reliability tip
Keep one stable reference suite/dataset for regression checks while iterating on new solver changes.