Qwerky 72B – Linear-Attention RWKV Model for Fast, Scalable
Qwerky-72B is a high-performance, RWKV-based language model that reimagines Meta’s Qwen 2.5 72B with linear attention to deliver ultra-fast inference speeds and low computational cost — without sacrificing accuracy. Built for scalable AI systems and large-context scenarios, Qwerky retains competitive performance on industry-standard benchmarks like ARC, HellaSwag, Lambada, and MMLU, while…
Qwerky 72B – Linear-Attention RWKV Model for Fast, Scalable endpoints
| Method | Endpoint | Description |
|---|---|---|
| POST |
Chat Completions /Qwerky-25/chat |
add your prompt and interact with model |
Qwerky 72B – Linear-Attention RWKV Model for Fast, Scalable pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| BASIC | Free | — |
|
| PRO | $5 / month | — |
|
| ULTRA | $15 / month | — |
|
| MEGA | $30 / month | — |
|