AI & LLM tools
Utilities for anyone building on top of hosted language-model APIs — counting tokens, estimating spend, and comparing costs across major vendors. Pricing is refreshed periodically against each provider’s public list price.
Token counts are heuristic, not exact. Each major LLM family uses its own tokenizer — OpenAI’s tiktoken, Anthropic’s internal BPE variant, Google’s SentencePiece — and they disagree on the same input by 5–15% depending on language and formatting. The counter here uses a tiktoken-compatible cl100k_baseencoding, which is correct for the GPT-4 / GPT-3.5 family and a close approximation for the rest. Treat the cost estimate as a budgeting floor: add a 10–20% margin for system prompts, retries, and the conversation turns you don’t see until you ship. Output tokens are billed at a higher rate than input tokens for most providers, so a chat workload with a 1:3 input-to-output ratio costs more per turn than a summarization workload with a 10:1 ratio at the same total token volume — model and shape of the call matter more than raw length.
Available tools
Token counter & API cost
Heuristic token count plus per-call cost for GPT, Claude, Gemini, and Llama models.
LLM cost calculator
Per-call and monthly cost across 15 models — OpenAI, Anthropic, Google, Meta — with workload presets and a comparison table.
Context window visualizer
Paste a prompt and see how it fills each LLM's context window. 16 models from GPT-4o (128K) to Gemini 1.5 Pro (2M).