Spaces:

mukunda1729
/

token-counter

Sleeping

Initial: token counter across model families

66448b4 verified 24 days ago

1.06 kB

	---
	title: Token Counter
	emoji: 🔢
	colorFrom: yellow
	colorTo: gray
	sdk: gradio
	sdk_version: "5.49.1"
	python_version: "3.12"
	app_file: app.py
	pinned: false
	license: mit
	short_description: "Count tokens across Claude, GPT, Llama tokenizers."
	tags:
	- tokenization
	- llm
	- context-window
	- agentfit
	---

	# Token Counter

	Paste any text and see how it tokenizes across Claude, GPT, and other model families. Powered by [`agentfit`](https://pypi.org/project/agentfit-py/).

	## Why?

	- Different tokenizers split the same string very differently — Claude often uses ~half the tokens GPT does for the same Chinese / emoji input.
	- Useful when budgeting prompts and deciding which model to use for non-English content.
	- Sanity-check your own token counter against a reference.

	## Related

	- [`agentfit` on PyPI](https://pypi.org/project/agentfit-py/)
	- [The Agent Reliability Stack](https://mukundakatta.github.io/agent-stack/)
	- Companion dataset: [`token-counting-edge-cases`](https://huggingface.co/datasets/mukunda1729/token-counting-edge-cases)