Lewati ke konten
Kembali ke direktori
Logo Cerebras

Cerebras

Inference8 modelcontext maks 131KGratis: 1M token/hari

Catatan dari sumber

Free tier, no credit card. Ultra-fast inference (~2,600 tok/s). 1M tokens/day cap. 8K context cap on free tier. llama3.1-8b scheduled for deprecation May 27, 2026.

Cara claim API key gratis

Langkah umum โ€” detail pastinya ikutin halaman resmi Cerebras.

  1. 1.Buka halaman API key Cerebras โ†—
  2. 2.Daftar akun baru, atau login kalau udah punya.
  3. 3.Generate API key di dashboard / settings.
  4. 4.Pakai API key + Base URL https://api.cerebras.ai/v1 di SDK atau HTTP client.

Model tersedia (8)

Llama 3.1 70B
llama-3-1-70b
Modality
text
Context
131K
Output
โ€”
qwen-3-235b-a22b-instruct-2507
qwen-3-235b-a22b-instruct-2507
Modality
Text
Context
131K (8K on free)
Output
8K
Rate limit
30 RPM, 14,400 RPD, 1M TPD
qwen-3-32b
qwen-3-32b
Modality
Text
Context
131K (8K on free)
Output
8K
Rate limit
30 RPM, 14,400 RPD, 1M TPD
gpt-oss-120b
gpt-oss-120b
Modality
Text
Context
128K (8K on free)
Output
8K
Rate limit
30 RPM, 14,400 RPD, 1M TPD
llama-3.3-70b
llama-3.3-70b
Modality
Text
Context
128K (8K on free)
Output
8K
Rate limit
30 RPM, 14,400 RPD, 1M TPD
llama-4-scout-17b-16e-instruct
llama-4-scout-17b-16e-instruct
Modality
Text + Vision
Context
128K (8K on free)
Output
8K
Rate limit
30 RPM, 14,400 RPD, 1M TPD
zai-glm-4.7
zai-glm-4.7
Modality
Text
Context
128K (8K on free)
Output
8K
Rate limit
10 RPM, 100 RPD, 1M TPD
Llama 3.1 8B
llama-3-1-8b
Modality
โ€”
Context
โ€”
Output
โ€”
Rate limit
30 requests/minute, 60,000 tokens/minute, 900 requests/hour, 1,000,000 tokens/hour, 14,400 requests/day, 1,000,000 tokens/day