Quick selection guide
| Your Priority | Recommended Model |
|---|---|
| Speed and cost | anthropic.claude-haiku-4.5 or openai.o4-mini |
| Balanced performance | anthropic.claude-sonnet-4.5 |
| Complex reasoning | anthropic.claude-opus-4.5 or openai.gpt-4.1 |
Model comparison
Anthropic Claude models
| Model | Speed | Capability | Best For |
|---|---|---|---|
| Claude Haiku 4.5 | Fastest | Good | Simple classification, extraction |
| Claude Sonnet 4.5 | Fast | Better | Analysis, summarization, most tasks |
| Claude Opus 4.5 | Slower | Best | Complex reasoning, nuanced decisions |
OpenAI models
| Model | Speed | Capability | Best For |
|---|---|---|---|
| o4-mini | Fastest | Good | Quick responses, simple tasks |
| GPT-4.1 | Moderate | Better | Advanced reasoning |
| GPT-oss-120b | Moderate | Good | General-purpose tasks |
When to use smaller models
Use Claude Haiku or o4-mini when:- Task is straightforward: Binary classification, simple extraction
- High volume: Processing many items where speed matters
- Cost sensitivity: Budget constraints require efficiency
- Latency matters: User-facing features needing fast response
When to use medium models
Use Claude Sonnet 4.5 when:- Task requires understanding: Content analysis, summarization
- Balanced needs: Good quality without excessive cost
- Most production use cases: Default choice for typical workflows
When to use larger models
Use Claude Opus 4.5 or GPT-4.1 when:- Complex reasoning required: Multi-step analysis, nuanced judgment
- High stakes: Decisions with significant impact
- Ambiguous inputs: Tasks requiring interpretation
- Quality over speed: Accuracy is paramount
Task-based recommendations
Classification tasks
| Complexity | Recommended Model |
|---|---|
| Binary (yes/no, spam/not spam) | Claude Haiku |
| Multi-class (3-5 categories) | Claude Haiku or Sonnet |
| Complex taxonomy (many categories, nuanced) | Claude Sonnet or Opus |
Extraction tasks
| Complexity | Recommended Model |
|---|---|
| Simple fields (dates, names, numbers) | Claude Haiku |
| Structured entities (addresses, products) | Claude Sonnet |
| Complex relationships (multi-entity) | Claude Opus |
Generation tasks
| Complexity | Recommended Model |
|---|---|
| Short text (taglines, labels) | Claude Haiku or Sonnet |
| Medium content (descriptions, summaries) | Claude Sonnet |
| Long-form (reports, analysis) | Claude Sonnet or Opus |
Transformation tasks
| Complexity | Recommended Model |
|---|---|
| Format conversion (dates, units) | Claude Haiku |
| Language translation (technical to plain) | Claude Sonnet |
| Complex interpretation (natural language to code) | Claude Opus |
Testing different models
Try multiple models on sample data to compare quality:Best practices
| Practice | Description |
|---|---|
| Start small | Begin with Haiku, upgrade if quality is insufficient |
| Test on samples | Compare models on representative data before production |
| Monitor quality | Track output quality metrics over time |
| Balance cost and quality | Don’t over-engineer simple tasks |
| Consider latency | User-facing features may need faster models |

