Running 4 Distilling 100B+ Models 40x Faster with TRL π 4 Read and download a research article on model distillation
arcee-ai/Trinity-Large-Thinking Text Generation β’ 399B β’ Updated 3 days ago β’ 14.3k β’ β’ 149
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation β’ 124B β’ Updated 2 days ago β’ 463k β’ 326
Running on CPU Upgrade 219 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens π 219 Explore synthetic data experiments on a virtual bookshelf