Error codes, explanations and solutions

Last updated: November 14, 2025

While using the together.ai API you may receive responses with a range of error codes. You can use this article as a reference to assist with troubleshooting the problem before getting in touch with us:

Code	Description	Cause	Solution
400	Invalid request	The request to the API is misconfigured	Ensure your request is a Valid JSON and your A PI Key is correct. Also ensure you're using the right prompt format - which is different for Mistral and LLaMA models.
401	Authentication error	The API key is missing or invalid	Ensure you are using the correct API Key and supplying it correctly
402	Payment required	The account linked with the API key has reached it's spending limit/run out of credits	Adjust your billing settings or make a payment to resume service.
403	Bad request	The combined total of the Input token count and max_tokens parameter must be less than the context length of the model being queried.	Set `max_tokens` to a lower number. If querying a chat model, you may set `max_tokens` to `null` and let the model decide when to stop generation.
404	Not found	The endpoint URL or model name is invalid	Check your request is being made to the correct endpoint (see the API reference page for details) and that the model being queried is available
429	Rate limit exceeded	Too many requests have been sent in a short period of time	Throttle the rate at which requests are sent to our servers (see our rate limits)
500	Server error	There is an unknown server error	This error is caused by an issue with our servers. Please try again. If the issue persists after 30 minutes, please contact us
503	Engine overloaded	Our servers are seeing a high amount of traffic	Please try again after a brief wait. If the the issue persists after 30 minutes, please contact us