view article Article Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs +3 4 days ago • 19
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published 16 days ago • 347
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 11 days ago • 822
MolmoWeb-Data Collection This is the collection of all datasets in MolmoWebMix. • 6 items • Updated 19 days ago • 24
MolmoWeb Collection This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 6 items • Updated 2 days ago • 24
CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation Paper • 2103.06874 • Published Mar 11, 2021 • 3
The MultiBERTs: BERT Reproductions for Robustness Analysis Paper • 2106.16163 • Published Jun 30, 2021 • 1
Well-Read Students Learn Better: On the Importance of Pre-training Compact Models Paper • 1908.08962 • Published Aug 23, 2019 • 1
view article Article Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic and HuggingFace Buckets 22 days ago • 17