Start for free
Begin with $25 in free credits to explore our models via the Playground.
Simple API call
Switch to inference by changing a single line of code. Start saving in 5 minutes.
Pay as you go
Only pay for what you use. Set limits and monitor usage via our dashboards.
TEXT TO TEXT
Prices shown are per 1 million tokens
Model | Quantization | Input | Output |
---|---|---|---|
DeepSeek R1 | FP8 | $0.75 | $3.00 |
DeepSeek R1 Distill Llama 70B | FP8 | $0.40 | $0.40 |
DeepSeek V3 | FP8 | $0.40 | $1.20 |
DeepSeek V3 0324 | FP8 | $0.75 | $1.50 |
Google Gemma 3 | BF16 | $0.30 | $0.40 |
Llama 3.1 70B Instruct | FP16 | $0.30 | $0.40 |
Llama 3.1 8B Instruct | FP16 | $0.03 | $0.03 |
FP8 | $0.025 | $0.025 | |
Llama 3.2 11B Vision Instruct | FP16 | $0.055 | $0.055 |
Llama 3.2 1B Instruct | FP16 | $0.01 | $0.01 |
Llama 3.2 3B Instruct | FP16 | $0.02 | $0.02 |
Llama 3.3 70B Instruct | FP16 | $0.30 | $0.40 |
Mistral Nemo 12B Instruct | FP8 | $0.038 | $0.10 |
Qwen 2.5 7B Vision Instruct | BF16 | $0.20 | $0.20 |