open-sci-ref 0.02 Collection Updated open-sci-ref baselines. Re-training without dropout. Re-training on DCLM, FineWeb-Edu, Nemotron, HPLT-2, Pile. Further ref datasets included. • 3 items • Updated Feb 2