Qwen 2.5-Max
Qwen2.5-Max is a large-scale Mixture-of-Expert (MoE) model that has been pretrained on over 20 trillion tokens. It demonstrated leading performance in benchmarks such as Arena-Hard, LiveBench, LiveCodeBench, and GPQA-Diamond, compared to models such as DeepSeek V3 and Llama 3.1.
Qwen 2.5-Max endpoints
| Method | Endpoint | Description |
|---|---|---|
| POST |
Chat Completion / |
Creates a model response for the given chat conversation. |
Qwen 2.5-Max pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| BASIC | $1 / month | — |
|
| PRO | $5 / month | — |
|
| ULTRA | $25 / month | — |
|
| MEGA Recommended | $75 / month | — |
|