nubbury
updated
StarCoder 2 and The Stack v2: The Next Generation
Paper
• 2402.19173
• Published • 156
Griffin: Mixing Gated Linear Recurrences with Local Attention for
Efficient Language Models
Paper
• 2402.19427
• Published • 57
Simple linear attention language models balance the recall-throughput
tradeoff
Paper
• 2402.18668
• Published • 20
Priority Sampling of Large Language Models for Compilers
Paper
• 2402.18734
• Published • 19
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper
• 2402.17764
• Published • 628
When Scaling Meets LLM Finetuning: The Effect of Data, Model and
Finetuning Method
Paper
• 2402.17193
• Published • 26
Towards Optimal Learning of Language Models
Paper
• 2402.17759
• Published • 18
Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in
Text-to-Image Generation
Paper
• 2402.17245
• Published • 11
Disentangled 3D Scene Generation with Layout Learning
Paper
• 2402.16936
• Published • 11
VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction
Paper
• 2402.17427
• Published • 10