Pricing


Language models

Multiple models, each with different capabilities and price points. You can think of tokens as pieces of words, where 1,000 tokens are about 750 words. This paragraph is 40 tokens.

Use this tool to manually calculate your input tokens.


GPT-4 Turbo

With 128k context, fresher knowledge and the broadest set of capabilities, GPT-4 Turbo is more powerful than GPT-4 and offered at a lower price.

Learn about GPT-4 Turbo


Model Input Output
gpt-4-0125-preview BT 0.11 / 1K tokens BT 0.33 / 1K tokens

Vision pricing calculator


Name Value
Price per 1K tokens (fixed) BT 0.11
512 x 512 tiles 1 x 1
Total tiles 1
Base tokens 85
Tile tokens 170 x 1 = 170
Total tokens 255
Total price BT 0.02805

GPT-3.5 Turbo

GPT-3.5 Turbo models are capable and cost-effective.

gpt-3.5-turbo-0125 is the flagship model of this family, supports a 16K context window and is optimized for dialog. This is the model currenty being used in our system.

Learn about GPT-3 Turbo


Model Input Output
gpt-3.5-turbo-0125 BT 0.0055 / 1K tokens BT 0.0165 / 1K tokens

Other models


Image models

Build DALL·E directly into your apps to generate and edit novel images and art. DALL·E 3 is the highest quality model and DALL·E 2 is optimized for lower cost.

Learn about image generation


Model Quality Resolution Price
DALL·E 3 Standard 1024x1024 BT 0.44 / image
Standard 1024x1792, 1792x1024 BT 0.88 / image
DALL·E 3 HD 1024x1024 BT 0.88 / image
HD 1024x1792, 1792x1024 BT 1.32 / image
DALL·E 2 1024x1024 BT 0.23 / image
512x512 BT 0.198 / image
DALL·E 2 256x256 BT 0.176 / image

Audio models

Whisper can transcribe speech into text and translate many languages into English.

Text-to-speech (TTS) can convert text into spoken audio.


Model Usage
Whisper BT 0.066 / minute (rounded to the nearest second)
TTS BT 0.165 / 1K characters
TTS HD BT 0.33 / 1K characters

Please be informed that the voice you are currently hearing is generated by artificial intelligence technology for Text-to-Speech synthesis. This voice is not that of a human, but rather a computer-generated voice. We want to ensure transparency and clarity in our communication, and we appreciate your understanding. If you have any questions or concerns, please feel free to reach out to us