Access our models directly through our API and pay for what you use, or deploy on Amazon SageMaker.
Get free, rate-limited usage for learning and prototyping. Usage is free until you go into production
Power your business with hosted language models.
Features
Default Model
Command
Input
$1.00
/1M tokens
Output
$2.00
/1M tokens
Command Light
Input
$0.30
/1M tokens
Output
$0.60
/1M tokens
Fine-tuned Model
Command Light
Training
$1.00
/1M tokens
Input
$0.30
/1M tokens
Output
$0.60
/1M tokens
We charge differently for input and output tokens. You are charged based on the sum of tokens processed. Only Command Light is available for fine-tuning at this moment. Fine-tuning inference costs will be the same as our base models.
If you require dedicated model instances, dedicated support channels, or custom deployment options, get in touch with our sales team
"Cohere excels at delivering high quality, low-latency language AI models and really supporting them. Having Cohere's team as an extension of ours lets us go 10x faster."
Language models understand “tokens” rather than characters or bytes. The number of tokens per word depends on the complexity of the text. Simple text may approach 1 token per word on average, while complex texts may use less common words that require 3-4 tokens per word on average. For more details on tokens, refer to this page.
Want to try it first?
Create an account and build with Cohere