AI Gateway
🤖 Production-ready AI inference API with automatic scaling and ultra-low latency. ## Why Choose Episteme Nexus API? ✅ **Blazing Fast** - Average response time < 2 seconds ✅ **Cost-Effective** - Up to 70% cheaper than direct providers ✅ **Auto-Scaling** - Handles any traffic load automatically ✅ **OpenAI Compatible** - Drop-in replacement for OpenAI API ✅ **Multiple Models** - Access Llama…
AI Gateway endpoints
| Method | Endpoint | Description |
|---|---|---|
| GET |
health_health_get /health |
Check API health status |
| GET |
get_stats_stats_get /stats |
Get queue statistics |
| POST |
chat_completions_v1_chat_completions_post /v1/chat/completions |
OpenAI-compatible chat completion endpoint Compatible with OpenAI client libraries and tools. |
| POST |
generate_generate_post /generate |
Submit an AI generation job Returns a job_id that can be used to check status and retrieve results. |
| GET |
get_status_status__job_id__get /status/{job_id} |
Check the status of a generation job |