Coming Soon

Our serverless inference stack is currently in a closed beta. Join the waitlist to get early access.

Supported Models

Access the latest open-source models through a single API. Text generation, code, multimodal, speech, embeddings, and image generation.

Showing 29 models

Llama 3.3 70B

Meta

Featured

Highly capable open-source model for complex reasoning

text
Parameters

70B

Context

128K

Input

$0.35/1M

Output

$0.40/1M

Mistral Large

Mistral

Featured

Top-tier reasoning and multilingual capabilities

text code
Parameters

123B

Context

128K

Input

$0.50/1M

Output

$0.50/1M

DeepSeek V3

DeepSeek

Featured

Frontier-class model with MoE architecture

text code
Parameters

671B

Context

128K

Input

$0.80/1M

Output

$0.80/1M

Qwen 2.5 72B

Alibaba

Featured

Strong multilingual and coding capabilities

text code
Parameters

72B

Context

128K

Input

$0.40/1M

Output

$0.45/1M

Mixtral 8x22B

Mistral

Featured

Efficient MoE architecture with expert routing

text code
Parameters

176B MoE

Context

64K

Input

$0.50/1M

Output

$0.50/1M

Llama 3.1 405B

Meta

Featured

Largest open-source model, frontier performance

text
Parameters

405B

Context

128K

Input

$1.00/1M

Output

$1.00/1M

Llama 3.1 70B

Meta

text
Parameters

70B

Context

128K

Input

$0.35/1M

Output

$0.40/1M

Llama 3.1 8B

Meta

Fast and efficient for simple tasks

text
Parameters

8B

Context

128K

Input

$0.10/1M

Output

$0.10/1M

Qwen 2.5 32B

Alibaba

text code
Parameters

32B

Context

128K

Input

$0.25/1M

Output

$0.30/1M

Qwen 2.5 7B

Alibaba

text
Parameters

7B

Context

128K

Input

$0.08/1M

Output

$0.08/1M

Mistral Nemo

Mistral

text
Parameters

12B

Context

128K

Input

$0.15/1M

Output

$0.15/1M

Gemma 2 27B

Google

text
Parameters

27B

Context

8K

Input

$0.20/1M

Output

$0.25/1M

Gemma 2 9B

Google

text
Parameters

9B

Context

8K

Input

$0.10/1M

Output

$0.10/1M

DeepSeek Coder V2

DeepSeek

Specialized for code generation and understanding

code
Parameters

236B MoE

Context

128K

Input

$0.60/1M

Output

$0.60/1M

Codestral

Mistral

Optimized for code completion and generation

code
Parameters

22B

Context

32K

Input

$0.25/1M

Output

$0.25/1M

Qwen 2.5 Coder 32B

Alibaba

code
Parameters

32B

Context

128K

Input

$0.30/1M

Output

$0.35/1M

Llama 3.2 Vision 90B

Meta

Vision-language model for image understanding

multimodal text
Parameters

90B

Context

128K

Input

$0.55/1M

Output

$0.55/1M

Llama 3.2 Vision 11B

Meta

multimodal text
Parameters

11B

Context

128K

Input

$0.15/1M

Output

$0.15/1M

Qwen2 VL 72B

Alibaba

Advanced vision-language understanding

multimodal
Parameters

72B

Context

32K

Input

$0.45/1M

Output

$0.50/1M

Pixtral Large

Mistral

Multimodal model for vision and text

multimodal
Parameters

124B

Context

128K

Input

$0.55/1M

Output

$0.55/1M

Whisper Large V3

OpenAI

State-of-the-art speech-to-text

speech
Parameters

1.5B

Context

30s audio

Input

$0.006/min/1M

SeamlessM4T V2

Meta

Multilingual speech translation

speech
Parameters

2.3B

Context

30s audio

Input

$0.008/min/1M

BGE Large EN v1.5

BAAI

High-quality English embeddings

embedding
Parameters

335M

Context

512 tokens

Input

$0.02/1M

BGE M3

BAAI

Multilingual embeddings with long context

embedding
Parameters

568M

Context

8K tokens

Input

$0.03/1M

GTE Qwen2 7B

Alibaba

Large embedding model for retrieval

embedding
Parameters

7B

Context

32K tokens

Input

$0.05/1M

FLUX.1 Schnell

Black Forest Labs

Fast high-quality image generation

image
Parameters

12B

Context

-

Input

$0.003/image/1M

FLUX.1 Dev

Black Forest Labs

Development-optimized image generation

image
Parameters

12B

Context

-

Input

$0.025/image/1M

SDXL Turbo

Stability AI

Real-time image generation

image
Parameters

6.6B

Context

-

Input

$0.002/image/1M

Stable Diffusion 3 Medium

Stability AI

Balanced quality and speed

image
Parameters

2B

Context

-

Input

$0.015/image/1M

Need a different model?

We're constantly adding new models based on customer demand. Let us know which models you'd like to see, and we'll prioritize adding them to the platform.

View documentation

Ready to get started?

Request access and start using these models in minutes. No credit card required.