LLM Gateway
Chat Completion Endpoint
OpenAI-compatible chat completion endpoint for LLM interactions.
POST
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
application/json
A list of messages comprising the conversation so far
ID of the model to use
Available options:
hermes-3-llama3.1-8b
, meta-llama/llama-3.3-70b-instruct
, mistralai/mistral-7b-instruct
, mistralai/mixtral-8x22b-instruct
, mistralai/mixtral-8x7b-instruct
, nvidia/llama-3.1-nemotron-70b-instruct
, qwen/qwen-2.5-coder-32b-instruct
The maximum number of tokens to generate
Whether to stream back partial progress
Sampling temperature between 0 and 1
Required range:
0 < x < 1