
Cerebras
Inference8 modelcontext maks 131KGratis: 1M token/hari
Catatan dari sumber
Free tier, no credit card. Ultra-fast inference (~2,600 tok/s). 1M tokens/day cap. 8K context cap on free tier. llama3.1-8b scheduled for deprecation May 27, 2026.
Cara claim API key gratis
Langkah umum โ detail pastinya ikutin halaman resmi Cerebras.
- 1.Buka halaman API key Cerebras โ
- 2.Daftar akun baru, atau login kalau udah punya.
- 3.Generate API key di dashboard / settings.
- 4.Pakai API key + Base URL
https://api.cerebras.ai/v1di SDK atau HTTP client.
Model tersedia (8)
Llama 3.1 70B
llama-3-1-70b
- Modality
- text
- Context
- 131K
- Output
- โ
qwen-3-235b-a22b-instruct-2507
qwen-3-235b-a22b-instruct-2507
- Modality
- Text
- Context
- 131K (8K on free)
- Output
- 8K
- Rate limit
- 30 RPM, 14,400 RPD, 1M TPD
qwen-3-32b
qwen-3-32b
- Modality
- Text
- Context
- 131K (8K on free)
- Output
- 8K
- Rate limit
- 30 RPM, 14,400 RPD, 1M TPD
gpt-oss-120b
gpt-oss-120b
- Modality
- Text
- Context
- 128K (8K on free)
- Output
- 8K
- Rate limit
- 30 RPM, 14,400 RPD, 1M TPD
llama-3.3-70b
llama-3.3-70b
- Modality
- Text
- Context
- 128K (8K on free)
- Output
- 8K
- Rate limit
- 30 RPM, 14,400 RPD, 1M TPD
llama-4-scout-17b-16e-instruct
llama-4-scout-17b-16e-instruct
- Modality
- Text + Vision
- Context
- 128K (8K on free)
- Output
- 8K
- Rate limit
- 30 RPM, 14,400 RPD, 1M TPD
zai-glm-4.7
zai-glm-4.7
- Modality
- Text
- Context
- 128K (8K on free)
- Output
- 8K
- Rate limit
- 10 RPM, 100 RPD, 1M TPD
Llama 3.1 8B
llama-3-1-8b
- Modality
- โ
- Context
- โ
- Output
- โ
- Rate limit
- 30 requests/minute, 60,000 tokens/minute, 900 requests/hour, 1,000,000 tokens/hour, 14,400 requests/day, 1,000,000 tokens/day
| Model | Modality | Context | Output | Rate limit |
|---|---|---|---|---|
Llama 3.1 70B llama-3-1-70b | text | 131K | โ | โ |
qwen-3-235b-a22b-instruct-2507 qwen-3-235b-a22b-instruct-2507 | Text | 131K (8K on free) | 8K | 30 RPM, 14,400 RPD, 1M TPD |
qwen-3-32b qwen-3-32b | Text | 131K (8K on free) | 8K | 30 RPM, 14,400 RPD, 1M TPD |
gpt-oss-120b gpt-oss-120b | Text | 128K (8K on free) | 8K | 30 RPM, 14,400 RPD, 1M TPD |
llama-3.3-70b llama-3.3-70b | Text | 128K (8K on free) | 8K | 30 RPM, 14,400 RPD, 1M TPD |
llama-4-scout-17b-16e-instruct llama-4-scout-17b-16e-instruct | Text + Vision | 128K (8K on free) | 8K | 30 RPM, 14,400 RPD, 1M TPD |
zai-glm-4.7 zai-glm-4.7 | Text | 128K (8K on free) | 8K | 10 RPM, 100 RPD, 1M TPD |
Llama 3.1 8B llama-3-1-8b | โ | โ | โ | 30 requests/minute, 60,000 tokens/minute, 900 requests/hour, 1,000,000 tokens/hour, 14,400 requests/day, 1,000,000 tokens/day |