Model ID:
Model cardllama3.1-8bModel Stats
SPEED
~2200
tokens/sec
INPUT / OUTPUT
/
CONTEXT
Free Tier
8k tokens
Paid Tiers
32k tokens
MAX OUTPUT
Free Tier
8k tokens
Paid Tiers
8k tokens
Pricing
Input
$0.10 / M tokens
Output
$0.10 / M tokens
Exploration pricing shown above is per million tokens. For volume discounts and enterprise features, see our pricing page.
Rate Limits
| Tier | Requests/min | Input Tokens/min | Daily Tokens |
|---|---|---|---|
| Free | 30 | 60k | 1M |
| Developer | 1K | 1M | N/A |
Endpoints
→
Chat Completions
/v1/chat/completions→
Completions
/v1/completionsCapabilities
✓Streaming
✓Structured Outputs
✓Tool Calling
✓Tool Calling w/ Structured Outputs

