Technical Help: Which Tokenizer for Perplexity LLaMa Models?

vikvang · May 15, 2025, 12:14am

Hi everyone!

Does anyone know which tokenizer (and library) we should use to count tokens the exact way the Perplexity models do it?

I’m integrating Perplexity’s LLaMa 3.1 models (small/large/huge online and small/large chat) into our product. We need to exactly count input tokens for user quota management, but couldn’t find documentation about which exact tokenizer these models use. For example, OpenAI models use Tiktoken.

Thanks!

Topic		Replies	Views
API: Unable to use the "llama-3.1-sonar-huge-128k-online" model Bug Reports	1	206	May 15, 2025
Perplexity API doesn't support tool call, even though Llama 3.1 does Bug Reports	1	151	May 15, 2025
Where can i find the price of perplexity API CHAT models (not online)? Bug Reports	1	85	May 15, 2025
Use Perplexity w/ DeepSeek model in Cursor IDE Bug Reports	0	127	May 15, 2025
Supported models not working? Bug Reports	2	494	May 15, 2025

Technical Help: Which Tokenizer for Perplexity LLaMa Models?

Related topics