Back to Registry

Fireworks-ai API Catalog

Comprehensive overview of all fireworks-ai models available through the LLM Kit

Overview

Provider
fireworks-ai
Total Models
18
Last Updated
2026-02-07

Deepseek v3.2

Model ID accounts/fireworks/models/deepseek-v3p2
Family
deepseek-v3

Specifications

Context Window: 160,000 tokens
Max Output Tokens: 20,000 tokens

Modalities

Input
text
Output
text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

FLUX.1 Kontext Max

Model ID accounts/fireworks/models/flux-kontext-max
Family
flux-kontext

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text, image

Capabilities

Image generation

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.08
Output
$0.08

FLUX.1 Kontext Pro

Model ID accounts/fireworks/models/flux-kontext-pro
Family
flux-kontext

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text, image

Capabilities

Image generation

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.04
Output
$0.04

FLUX.1 [dev] FP8

Model ID accounts/fireworks/models/flux-1-dev-fp8
Family
flux-1

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text, image

Capabilities

Image generation

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.0005
Output
$0.0005

FLUX.1 [schnell] FP8

Model ID accounts/fireworks/models/flux-1-schnell-fp8
Family
flux-1

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text, image

Capabilities

Image generation

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.00035
Output
$0.00035

GLM-4.6

Model ID accounts/fireworks/models/glm-4p6
Family
glm-4p5v

Specifications

Context Window: 198,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text

Capabilities

Fine tuning

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

GLM-4.7

Model ID accounts/fireworks/models/glm-4p7
Family
glm-4p5v

Specifications

Context Window: 198,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text

Capabilities

Fine tuning

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

Kimi K2 Instruct 0905

Model ID accounts/fireworks/models/kimi-k2-instruct-0905
Family
kimi-k2

Specifications

Context Window: 256,000 tokens
Max Output Tokens: 20,000 tokens

Modalities

Input
text
Output
text

Capabilities

Fine tuning

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

Kimi K2 Thinking

Model ID accounts/fireworks/models/kimi-k2-thinking
Family
kimi-k2

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 20,000 tokens

Modalities

Input
text
Output
text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

MiniMax-M2.1

Model ID accounts/fireworks/models/minimax-m2p1
Family
minimax-m2p1

Specifications

Context Window: 200,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text
Output
text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

Qwen3 VL 235B A22B Instruct

Model ID accounts/fireworks/models/qwen3-vl-235b-a22b-instruct
Family
qwen3

Specifications

Context Window: 256,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text, image
Output
text

Capabilities

Fine tuning Vision

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

Qwen3 VL 235B A22B Thinking

Model ID accounts/fireworks/models/qwen3-vl-235b-a22b-thinking
Family
qwen3

Specifications

Context Window: 256,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text, image
Output
text

Capabilities

Fine tuning Vision

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

Qwen3 VL 30B A3B Instruct

Model ID accounts/fireworks/models/qwen3-vl-30b-a3b-instruct
Family
qwen3

Specifications

Context Window: 256,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text, image
Output
text

Capabilities

Fine tuning Vision

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

Qwen3 VL 30B A3B Thinking

Model ID accounts/fireworks/models/qwen3-vl-30b-a3b-thinking
Family
qwen3

Specifications

Context Window: 256,000 tokens
Max Output Tokens: 4,096 tokens

Modalities

Input
text, image
Output
text

Capabilities

Fine tuning Vision

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.5
Output
$1.5

Streaming ASR v1

Model ID accounts/fireworks/models/fireworks-asr-large
Family
fireworks-asr-large

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 16,000 tokens

Modalities

Input
audio
Output
text

Capabilities

Speech to text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.0032
Output
$0.0032

Streaming ASR v2

Model ID accounts/fireworks/models/fireworks-asr-v2
Family
fireworks-asr-v2

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 16,000 tokens

Modalities

Input
audio
Output
text

Capabilities

Speech to text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.0035
Output
$0.0035

Whisper V3 Large

Model ID accounts/fireworks/models/whisper-v3
Family
whisper-v3

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 16,000 tokens

Modalities

Input
audio
Output
text

Capabilities

Speech to text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.0015
Output
$0.0015

Whisper V3 Turbo

Model ID accounts/fireworks/models/whisper-v3-turbo
Family
whisper-v3-turbo

Specifications

Context Window: 128,000 tokens
Max Output Tokens: 16,000 tokens

Modalities

Input
audio
Output
text

Capabilities

Speech to text

Pricing (per million tokens)

Text Tokens - Standard
Input
$0.0009
Output
$0.0009