Back to Registry

Together API Catalog

Comprehensive overview of all together models available through the LLM Kit

Overview

Provider
together
Total Models
38
Last Updated
2025-12-10

DeepSeek R1 Distill Llama 70B

Model ID deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Family
deepseek-r1-distill-llama-70b

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$2.0
Output
$2.0

DeepSeek R1 Distill Qwen 14B

Model ID deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Family
deepseek-r1-distill-qwen-14b

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.18
Output
$0.18

DeepSeek R1-0528-tput

Model ID deepseek-ai/DeepSeek-R1-0528-tput
Family
deepseek-r1-0528-tput

Specifications

Context Window: 163,839 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.55
Output
$2.19

DeepSeek-R1

Model ID deepseek-ai/DeepSeek-R1
Family
deepseek-r1

Specifications

Context Window: 163,839 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$3.0
Output
$7.0

DeepSeek-V3

Model ID deepseek-ai/DeepSeek-V3
Family
deepseek-v3

Specifications

Context Window: 163,839 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$1.25
Output
$1.25

DeepSeek-V3.1

Model ID deepseek-ai/DeepSeek-V3.1
Family
deepseek-v3.1

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.6
Output
$1.7

GLM-4.5-Air

Model ID zai-org/GLM-4.5-Air-FP8
Family
glm-4.5-air

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.2
Output
$1.1

GPT-OSS 120B

Model ID openai/gpt-oss-120b
Family
gpt-oss-120b

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.15
Output
$0.6

GPT-OSS 20B

Model ID openai/gpt-oss-20b
Family
gpt-oss-20b

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.05
Output
$0.2

Gemma 3N E4B Instruct

Model ID google/gemma-3n-E4B-it
Family
gemma-3n-e4b

Specifications

Context Window: 32,768 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.02
Output
$0.04

Gemma Instruct (2B)

Model ID google/gemma-2b-it
Family
gemma-2b

Specifications

Context Window: 8,192 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.02
Output
$0.04

Kimi K2 0905

Model ID moonshotai/Kimi-K2-Instruct-0905
Family
kimi-k2-0905

Specifications

Context Window: 262,144 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$1.0
Output
$3.0

Kimi K2 Instruct

Model ID moonshotai/Kimi-K2-Instruct
Family
kimi-k2

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$1.0
Output
$3.0

LLaMA-2 (70B)

Model ID meta-llama/Llama-2-70b-hf
Family
llama-2-70b

Specifications

Context Window: 4,096 tokens
Max Output Tokens: 2,048 tokens

Modalities

Input
text
Output
text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.9
Output
$0.9

Llama 3 70B Instruct Reference

Model ID meta-llama/Meta-Llama-3-70b-chat-hf
Family
llama-3-70b-reference

Specifications

Context Window: 8,192 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.88
Output
$0.88

Llama 3 70B Instruct Turbo

Model ID meta-llama/Llama-3-70b-chat-hf-turbo
Family
llama-3-70b-turbo

Specifications

Context Window: 8,192 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.88
Output
$0.88

Llama 3 8B Instruct Lite

Model ID meta-llama/Meta-Llama-3-8B-Instruct-Lite
Family
llama-3-8b-lite

Specifications

Context Window: 8,192 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.1
Output
$0.1

Llama 3.1 405B Instruct Turbo

Model ID meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo
Family
llama-3.1-405b

Specifications

Context Window: 130,815 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$3.5
Output
$3.5

Llama 3.1 70B Instruct Turbo

Model ID meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo
Family
llama-3.1-70b

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.88
Output
$0.88

Llama 3.1 8B Instruct Turbo

Model ID meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
Family
llama-3.1-8b

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.18
Output
$0.18

Llama 3.2 3B Instruct Turbo

Model ID meta-llama/Llama-3.2-3B-Instruct-Turbo
Family
llama-3.2-3b

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.06
Output
$0.06

Llama 3.3 70B Instruct Turbo

Model ID meta-llama/Llama-3.3-70B-Instruct-Turbo
Family
llama-3.3-70b

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.88
Output
$0.88

Llama 4 Maverick (17Bx128E)

Model ID meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
Family
llama-4-maverick

Specifications

Context Window: 1,048,576 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.27
Output
$0.85

Llama 4 Scout (17Bx16E)

Model ID meta-llama/Llama-4-Scout-17B-16E-Instruct
Family
llama-4-scout

Specifications

Context Window: 1,048,576 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.18
Output
$0.59

Marin 8B Instruct

Model ID marin-community/marin-8b-instruct
Family
marin-8b

Specifications

Context Window: 4,096 tokens
Max Output Tokens: 2,048 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.18
Output
$0.18

Mistral (7B) Instruct v0.2

Model ID mistralai/Mistral-7B-Instruct-v0.2
Family
mistral-7b-v0.2

Specifications

Context Window: 32,768 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.2
Output
$0.2

Mistral Instruct

Model ID mistralai/Mistral-7B-Instruct-v0.1
Family
mistral-7b-v0.1

Specifications

Context Window: 8,192 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.2
Output
$0.2

Mistral Small 3

Model ID mistralai/Mistral-Small-24B-Instruct-2501
Family
mistral-small-3

Specifications

Context Window: 32,768 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.8
Output
$0.8

Mixtral 8x7B Instruct v0.1

Model ID mistralai/Mixtral-8x7B-v0.1
Family
mixtral-8x7b

Specifications

Context Window: 32,768 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.6
Output
$0.6

Qwen 2.5 72B

Model ID Qwen/Qwen2.5-72B-Instruct-Turbo
Family
qwen2.5-72b

Specifications

Context Window: 32,768 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$1.2
Output
$1.2

Qwen QwQ-32B

Model ID Qwen/QwQ-32B
Family
qwq-32b

Specifications

Context Window: 32,768 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$1.2
Output
$1.2

Qwen2.5 7B Instruct Turbo

Model ID Qwen/Qwen2.5-7B-Instruct-Turbo
Family
qwen2.5-7b

Specifications

Context Window: 32,768 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.3
Output
$0.3

Qwen2.5 Coder 32B Instruct

Model ID Qwen/Qwen2.5-Coder-32B-Instruct
Family
qwen2.5-coder-32b

Specifications

Context Window: 32,768 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.8
Output
$0.8

Qwen2.5-VL 72B Instruct

Model ID Qwen/Qwen2.5-VL-72B-Instruct
Family
qwen2.5-vl-72b

Specifications

Context Window: 32,768 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text, image
Output
text

Capabilities

Vision Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$1.95
Output
$8.0

Qwen3 235B A22B FP8 Throughput

Model ID Qwen/Qwen3-235B-A22B-fp8-tput
Family
qwen3-235b-a22b-tput

Specifications

Context Window: 40,960 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.2
Output
$0.6

Qwen3 235B A22B Instruct 2507 FP8

Model ID Qwen/Qwen3-235B-A22B-Instruct-2507-tput
Family
qwen3-235b-a22b-instruct

Specifications

Context Window: 262,144 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.2
Output
$0.6

Qwen3 235B A22B Thinking 2507 FP8

Model ID Qwen/Qwen3-235B-A22B-Thinking-2507
Family
qwen3-235b-a22b-thinking

Specifications

Context Window: 262,144 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.65
Output
$3.0

Qwen3-Coder 480B A35B Instruct

Model ID Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Family
qwen3-coder-480b

Specifications

Context Window: 256,000 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$2.0
Output
$2.0