Tokens vs Words — How Many Tokens per Word
Word count feels intuitive, but models bill tokens. This shows tokens per word on a real article so you can convert between the two with eyes open.
Overview
English runs roughly three tokens for every four words, but "roughly" carries a lot of weight: punctuation, rare words, and formatting all push the ratio around. This loads an article-length text and reports the tokens-per-word figure directly, so a word count you already have becomes a token estimate you can trust within a range. Words are a writer's unit; tokens are the model's — this is the bridge between them.
Workflow
-
Count a real text
Words and tokens reported together on an article.
-
Read tokens per word
English sits near 4 words to 3 tokens — with drift.
-
Convert your own counts
Apply the ratio to a word count you already have.
Why This Works
- A direct tokens-per-word figure turns word counts into token estimates
- Real text shows the ratio drifting with punctuation and rare words
- The range respects that the conversion is approximate, not exact
Best for
- Converting a known word count to tokens
- Estimating from a word-count limit
- Understanding the words-to-tokens ratio
Not for
- Counting words for an essay or post limit — use the Character Counter
- Context-window fit decisions — use the Context Window Estimator
Use cases
- Converting a known word count to tokens
- Estimating from a word-count limit
- Understanding the words-to-tokens ratio