Run model inference
Create a job to perform an LLM call. Supported on both AWS and Snowflake data planes.
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
Data plane identifier, must be provided when registering dataset in external dataplane.
anthropic.claude-haiku-4.5, anthropic.claude-sonnet-4.5, anthropic.claude-opus-4.5, openai.gpt-oss-120b, openai.gpt-4.1, openai.o4-mini Ordered conversation turns. The current wire shape carries a content array of
content blocks (text / tool_use / tool_result) so multi-turn flows with tool
interactions can round-trip without flattening to prose.
The legacy {role, text: String} shape is still accepted on the request side for
backwards compatibility — the API auto-canonicalizes legacy messages into a single
text content block. New integrations should use the content-block shape;
responses always emit it.
- Message (content-block shape, recommended)
- Message (legacy `{role, text}` shape, still accepted)
Response
OK
The id of the job that has been enqueued.

