Model ID:
Model cardllama-3.3-70bModel Stats
SPEED
~2100
tokens/sec
INPUT / OUTPUT
/
CONTEXT
Free Tier
65k tokens
Paid Tiers
128k tokens
MAX OUTPUT
Free Tier
8k tokens
Paid Tiers
65k tokens
Pricing
Input
$0.85 / M tokens
Output
$1.20 / M tokens
Exploration pricing shown above is per million tokens. For volume discounts and enterprise features, see our pricing page.
Rate Limits
| Tier | Requests/min | Input Tokens/min | Daily Tokens |
|---|---|---|---|
| Free | 30 | 60k | 1M |
| Developer | 1K | 1M | N/A |
Endpoints
→
Chat Completions
/v1/chat/completions→
Completions
/v1/completionsCapabilities
✓Streaming
✓Structured Outputs
✓Tool Calling
✓Parallel Tool Calling

