Pricing
Language models
Multiple models, each with different capabilities and price points. You can think of tokens as pieces of words, where 1,000 tokens are about 750 words. This paragraph is 40 tokens.
Use this tool to manually calculate your input tokens.
GPT-4 Turbo
With 128k context, fresher knowledge and the broadest set of capabilities, GPT-4 Turbo is more powerful than GPT-4 and offered at a lower price.
Model | Input | Output |
---|---|---|
gpt-4-0125-preview | BT 0.11 / 1K tokens | BT 0.33 / 1K tokens |
Vision pricing calculator
Name | Value |
---|---|
Price per 1K tokens (fixed) | BT 0.11 |
512 x 512 tiles | 1 x 1 |
Total tiles | 1 |
Base tokens | 85 |
Tile tokens | 170 x 1 = 170 |
Total tokens | 255 |
Total price | BT 0.02805 |
GPT-3.5 Turbo
GPT-3.5 Turbo models are capable and cost-effective.
gpt-3.5-turbo-0125
is the flagship model of this family, supports a 16K context
window and is optimized for dialog. This is the model currenty
being used in our system.
Model | Input | Output |
---|---|---|
gpt-3.5-turbo-0125 | BT 0.0055 / 1K tokens | BT 0.0165 / 1K tokens |
Other models
Image models
Build DALL·E directly into your apps to generate and edit novel images and art. DALL·E 3 is the highest quality model and DALL·E 2 is optimized for lower cost.
Model | Quality | Resolution | Price |
---|---|---|---|
DALL·E 3 | Standard | 1024x1024 | BT 0.44 / image |
Standard | 1024x1792, 1792x1024 | BT 0.88 / image | |
DALL·E 3 | HD | 1024x1024 | BT 0.88 / image |
HD | 1024x1792, 1792x1024 | BT 1.32 / image | |
DALL·E 2 | 1024x1024 | BT 0.23 / image | |
512x512 | BT 0.198 / image | ||
DALL·E 2 | 256x256 | BT 0.176 / image |
Audio models
Whisper can transcribe speech into text and translate many languages into English.
Text-to-speech (TTS) can convert text into spoken audio.
Model | Usage |
---|---|
Whisper | BT 0.066 / minute (rounded to the nearest second) |
TTS | BT 0.165 / 1K characters |
TTS HD | BT 0.33 / 1K characters |
Please be informed that the voice you are currently hearing is generated by artificial intelligence technology for Text-to-Speech synthesis. This voice is not that of a human, but rather a computer-generated voice. We want to ensure transparency and clarity in our communication, and we appreciate your understanding. If you have any questions or concerns, please feel free to reach out to us