Error codes, explanations and solutions

Last updated: November 14, 2025

While using the together.ai API you may receive responses with a range of error codes. You can use this article as a reference to assist with troubleshooting the problem before getting in touch with us:

Code

Description

Cause

Solution

400

Invalid request

The request to the API is misconfigured

Ensure your request is a Valid JSON and your API Key is correct. Also ensure you're using the right prompt format - which is different for Mistral and LLaMA models.

401

Authentication error

The API key is missing or invalid

Ensure you are using the correct API Key and supplying it correctly

402

Payment required

The account linked with the API key has reached it's spending limit/run out of credits

Adjust your billing settings or make a payment to resume service.

403

Bad request

The combined total of the Input token count and max_tokens parameter must be less than the context length of the model being queried.

Set max_tokens to a lower number. If querying a chat model, you may set max_tokens to null and let the model decide when to stop generation.

404

Not found

The endpoint URL or model name is invalid

Check your request is being made to the correct endpoint (see the API reference page for details) and that the model being queried is available

429

Rate limit exceeded

Too many requests have been sent in a short period of time

Throttle the rate at which requests are sent to our servers (see our rate limits)

500

Server error

There is an unknown server error

This error is caused by an issue with our servers. Please try again. If the issue persists after 30 minutes, please contact us

503

Engine overloaded

Our servers are seeing a high amount of traffic

Please try again after a brief wait. If the the issue persists after 30 minutes, please contact us