Together AI

Choose from the <a href="https://docs.together.ai/docs/inference-models" rel="nofollow noopener noreferrer" target="_blank">available models list</a>.

For <b>Serverless Endpoints</b> models, direct inference is possible without the need to initiate a virtual machine (VM).

If you're trying to run inference on a model you fine-tuned, initiate its VM instance either:

- Directly from the model's page on <a href="https://api.together.xyz/playground" rel="nofollow noopener noreferrer" target="_blank">api.together.ai</a>, or
- Utilizing the start and stop instances of our APIs.

If your desired model isn't listed, feel free to <a href="https://together.ai/model-requests" rel="nofollow noopener noreferrer" target="_blank">request a model</a>.

1. Choose from the <a href="https://docs.together.ai/docs/inference-models" rel="nofollow noopener noreferrer" target="_blank">available models list</a>.
2. For <b>Serverless Endpoints</b> models, direct inference is possible without the need to initiate a virtual machine (VM).
3. If you're trying to run inference on a model you fine-tuned, initiate its VM instance either:
   - Directly from the model's page on <a href="https://api.together.xyz/playground" rel="nofollow noopener noreferrer" target="_blank">api.together.ai</a>, or
   - Utilizing the start and stop instances of our APIs.
4. If your desired model isn't listed, feel free to <a href="https://together.ai/model-requests" rel="nofollow noopener noreferrer" target="_blank">request a model</a>.

I can't run inference with my model. What is going on?

DOCS

SUPPORT

Find answers and get help from Intercom Support and Community Experts

Link, Press control-option-right-arrow to exit

Empty Help Center

Uh oh. That page doesn’t exist.

Disappointed

Neutral

Smiley

Thinking...

Searching through sources...

Analyzing...

Title

Track the progress of all tickets related to your company.

Tickets portal.

{assigneeName} needs more information from you

Tickets

No access to tickets portal

English

linear-gradient(to bottom right, #FFF,#F1EFED)