llama3.1-8b-icq-4bit
Llama-3.1-8B-Instruct 4-bit Qunatized Inference Endpoint
3 endpoints
The in-depth APIMemo review for this API hasn't been published yet —
the data below comes straight from the public marketplace listing.
llama3.1-8b-icq-4bit endpoints
| Method | Endpoint | Description |
|---|---|---|
| GET |
listModels /v1/models |
Returns a list of available models |
| GET |
healthCheck /healthz |
Check if the API is running and get basic information |
| POST |
createCompletion /v1/completions |
Creates a completion for the provided prompt. Compatible with OpenAI's completions API. Supports both streaming and non-streaming responses. |