Back to Registry

Groq API Catalog

Comprehensive overview of all groq models available through the LLM Kit

Overview

Provider
groq
Total Models
15
Last Updated
2025-12-10

GPT-OSS 120B

Model ID openai/gpt-oss-120b
Family
gpt-oss-120b

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 65,536 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling Reasoning

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.15
Output
$0.75

GPT-OSS 20B

Model ID openai/gpt-oss-20b
Family
gpt-oss-20b

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 65,536 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling Reasoning

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.1
Output
$0.5

Groq Compound

Model ID groq/compound
Family
groq-compound

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Web search Code execution Tool use

Groq Compound Mini

Model ID groq/compound-mini
Family
groq-compound-mini

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Web search Code execution Tool use

Kimi K2 0905

Model ID moonshotai/kimi-k2-instruct-0905
Family
kimi-k2-0905

Specifications

Context Window: 262,144 tokens
Max Output Tokens: 16,384 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$1.0
Output
$3.0

Llama 3.1 8B Instant

Model ID llama-3.1-8b-instant
Family
llama-3.1-8b

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 131,072 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.05
Output
$0.08

Llama 3.3 70B Versatile

Model ID llama-3.3-70b-versatile
Family
llama-3.3-70b

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 32,768 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.59
Output
$0.79

Llama 4 Maverick (17Bx128E)

Model ID meta-llama/llama-4-maverick-17b-128e-instruct
Family
llama-4-maverick

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.2
Output
$0.6

Llama 4 Scout (17Bx16E)

Model ID meta-llama/llama-4-scout-17b-16e-instruct
Family
llama-4-scout

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.11
Output
$0.34

Llama Guard 4 12B

Model ID meta-llama/llama-guard-4-12b
Family
llama-guard-4-12b

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 1,024 tokens

Modalities

Input
text
Output
text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.2
Output
$0.2

PlayAI TTS

Model ID playai-tts
Family
playai-tts

Specifications

Context Window: 8,192 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
audio

Capabilities

Text to speech

PlayAI TTS Arabic

Model ID playai-tts-arabic
Family
playai-tts-arabic

Specifications

Context Window: 8,192 tokens
Max Output Tokens: 8,192 tokens

Modalities

Input
text
Output
audio

Capabilities

Text to speech

Qwen3 32B

Model ID qwen/qwen3-32b
Family
qwen3-32b

Specifications

Context Window: 131,072 tokens
Max Output Tokens: 40,960 tokens

Modalities

Input
text
Output
text

Capabilities

Function calling

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.29
Output
$0.59

Whisper Large v3

Model ID whisper-large-v3
Family
whisper-large-v3

Specifications

Context Window: tokens

Modalities

Input
audio
Output
text

Capabilities

Speech to text

Whisper Large v3 Turbo

Model ID whisper-large-v3-turbo
Family
whisper-large-v3-turbo

Specifications

Context Window: tokens

Modalities

Input
audio
Output
text

Capabilities

Speech to text