Tokenizer index
Byte-pair encoding vocabularies with token counts, special tokens, and per-token lookup.
OpenAI
Mistral
Alibaba
DeepSeek