Observe and debug runs
Use run artifacts to understand quality and failure modes.What to inspect first
- Overall run status (
queued,running,completed,failed) - Sample-level output quality and scorer deltas
- Event timeline and trace artifacts
Common debugging flow
- Identify failing or low-score samples.
- Open sample artifacts and event logs.
- Adjust solver/scorer config.
- Re-run and compare results.