Skip to main content
Surface: GET /agents/runs/{id}, error.type = "JobFailed"

When this error occurs

Each agent-loop iteration submits a model-inference job to the data plane and polls it to completion. This error means that job reached a Failed terminal state rather than completing — the inference itself never produced a usable result. Common causes:
  • Data Plane issues. No rights to invoke AI, for example
  • The model provider rejected the request (throttling, content too long, an unsupported parameter).
  • A transient data-plane / execution-cluster failure that exhausted the job’s own retries.
  • An inference configuration that only fails at execution time.

How to fix

  1. If the failure looks transient (provider throttling, a one-off cluster blip), start a new run on the conversation.
  2. If it reproduces, report the log_id and run id to support — the jobs.<job_id> row holds the detailed failure reason.
  3. Check the run’s effective_config (model, max tokens, data plane); a persistently-failing job often points at a configuration the data plane can’t satisfy.