Bochkov commited on
Commit
903ca8e
·
verified ·
1 Parent(s): f680f51

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -1
README.md CHANGED
@@ -90,4 +90,22 @@ This model is a research checkpoint. It is not intended for deployment. It may p
90
 
91
  ## Training data
92
 
93
- The model was trained on the same FineWeb-Edu + Cosmopedia mixture used for the matched comparisons in the paper. Dataset terms and licenses are those of the original datasets.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
90
 
91
  ## Training data
92
 
93
+ The model was trained on the same FineWeb-Edu + Cosmopedia mixture used for the matched comparisons in the paper. Dataset terms and licenses are those of the original datasets.
94
+
95
+ ---
96
+
97
+ ## 🧑‍🔬 Citation & Concept
98
+
99
+ If you use this model or the underlying concepts in your research, please cite our work:
100
+
101
+ ```
102
+ @misc{bochkov2026languagemodelstrainableinput,
103
+ title={Language Models Without a Trainable Input Embedding Table: Learning from Fixed Minimal Binary Token Codes},
104
+ author={A. Bochkov},
105
+ year={2026},
106
+ eprint={2605.09751},
107
+ archivePrefix={arXiv},
108
+ primaryClass={cs.CL},
109
+ url={https://arxiv.org/abs/2605.09751},
110
+ }
111
+ ```