Embedding Cost Calculator
Estimate embedding API costs for OpenAI, Cohere, Google, Voyage, Jina, Mistral, and Anthropic. Toggle batch discounts and include RAG query costs for a complete picture.
Last updated: 2026-06-13
Best balance of cost and performance for most RAG and search use cases.
Estimated embedding cost
for 665,000 tokens
Calculation
Model comparison
| Model | Provider | Dimensions | Context | Effective price | Cost |
|---|---|---|---|---|---|
| text-embedding-3-small selected | OpenAI | 1,536 | 8,191 | $0.02/1M | $0.01 |
| jina-embeddings-v3 | Jina AI | 1,024 | 8,192 | $0.02/1M | $0.01 |
| voyage-3-lite | Voyage AI | 512 | 32,000 | $0.03/1M | $0.02 |
| text-embedding-ada-002 | OpenAI | 1,536 | 8,191 | $0.10/1M | $0.07 |
| embed-english-v3 | Cohere | 1,024 | 512 | $0.10/1M | $0.07 |
| embed-multilingual-v3 | Cohere | 1,024 | 512 | $0.10/1M | $0.07 |
| text-embedding-004 | 768 | 2,048 | $0.10/1M | $0.07 | |
| text-multilingual-embedding-002 | 768 | 2,048 | $0.10/1M | $0.07 | |
| voyage-3 | Voyage AI | 1,024 | 32,000 | $0.10/1M | $0.07 |
| mistral-embed | Mistral | 1,024 | 8,092 | $0.10/1M | $0.07 |
| Titan Embeddings V2 | Anthropic (AWS Bedrock) | 1,024 | 8,192 | $0.10/1M | $0.07 |
| text-embedding-3-large | OpenAI | 3,072 | 8,191 | $0.13/1M | $0.09 |
How embedding pricing works
Pay per token, not per document
Embedding APIs charge by the number of tokens you send, not the number of documents. A 500-word document is roughly 665 tokens. If you embed 10,000 such documents, you send about 6.65 million tokens to the API.
Batch API discounts
OpenAI and several other providers offer a batch API that processes jobs asynchronously for a significant discount — often 50%. Use this calculator to see how much you can save if your embedding workload is not real-time.
RAG adds query costs
Retrieval-augmented generation has two cost components: the one-time embedding of your knowledge base, and the recurring cost of embedding each user query plus the LLM generation call. This calculator lets you estimate both.
Compare before you commit
Prices and context windows vary. A cheaper model may truncate long documents, while a more expensive model may deliver better retrieval accuracy. Use the comparison table to balance cost, context length, and output dimensions for your use case.