Hilberg Scaling in Neural Language Models (ICSDS 2025)
Collection
A suite of three GPT autoregressive language models trained on the July 20, 2025 English Wikipedia dump for experiments on entropy scaling. • 4 items • Updated
The present is a 5M-parameter GPT autoregressive language model trained on the July 20, 2025 English Wikipedia dump for experiments on entropy scaling and Hilberg conjecture. For more information on this, you can check here. Dataset available here.
This model is part of the following suite: