Skip to main content
Model ID: llama-3.3-70b
Model card

Model Stats

SPEED
~2100
tokens/sec
INPUT / OUTPUT
/
CONTEXT
Free Tier
65k tokens
Paid Tiers
128k tokens
MAX OUTPUT
Free Tier
8k tokens
Paid Tiers
65k tokens

Pricing

Input
$0.85 / M tokens
Output
$1.20 / M tokens
Exploration pricing shown above is per million tokens. For volume discounts and enterprise features, see our pricing page.

Rate Limits

TierRequests/minInput Tokens/minDaily Tokens
Free3060k1M
Developer1K1MN/A

Endpoints

Chat Completions
/v1/chat/completions
Completions
/v1/completions

Capabilities

Streaming
Structured Outputs
Tool Calling
Parallel Tool Calling

Need Higher Limits?

Reach out for custom pricing with our Enterprise tier for higher rate limits and dedicated support.