shikhar-srivastava 's Collections

Tokenizer Study (LLaMA 130M)

Correlating tokenizer properties on pre-trained LLMs with their downstream performance.