Text
| Model | Parameters | Use case |
|---|---|---|
Qwen/Qwen3.5-397B-A17B | 397B (17B active) | Frontier-level structured output. Best accuracy for complex schemas. |
Qwen/Qwen3-14B-FP8 | 14B | High-volume tasks and standard classification. |
openai/gpt-oss-20b | 21B (3.6B active) | Lower latency. Good for local or specialized use cases. |
Vision
| Model | Parameters | Use case |
|---|---|---|
Qwen/Qwen3-VL-235B-A22B-Instruct-FP8 | 235B (22B active) | Advanced multimodal model. Performance similar to GPT-5 Chat. |
Qwen/Qwen3-VL-30B-A3B-Instruct-FP8 | 30B (3B active) | Mid-size multimodal model. Performance similar to GPT-4.1-mini. |