Model Category | Cost per 1M tokens |
---|---|
8B and smaller | $0.48 |
14B models | $1.50 |
32B models | $1.90 |
70B+ models | $2.90 |
Model | Input (per 1M tokens) | Output (per 1M tokens) |
---|---|---|
Llama 3.1 8B Instruct | $0.30 | $0.45 |
Qwen 2.5 14B Instruct | $1.00 | $1.50 |
Llama 3.1 70B Instruct | $1.80 | $2.00 |
Model | Rate per CU Hour |
---|---|
Llama 3.1 8B | $1.50 |
Mistral Nemo 12B | $1.50 |
Qwen 2.5 32B Coder | $6.00 |
Qwen 2.5 72B | $12.00 |
Llama 3.1 70B | $12.00 |