Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
application/json
Chat completion request parameters
ID of the model to use
Available options:
hermes-3-llama3.1-8b, meta-llama/llama-3.3-70b-instruct, mistralai/mistral-7b-instruct, mistralai/mixtral-8x22b-instruct, mistralai/mixtral-8x7b-instruct, nvidia/llama-3.1-nemotron-70b-instruct, qwen/qwen-2.5-coder-32b-instruct A list of messages comprising the conversation so far
The maximum number of tokens to generate
Example:
1024
Sampling temperature between 0 and 1
Required range:
0 <= x <= 1Whether to stream back partial progress