-
Rho-1: Not All Tokens Are What You Need
Paper • 2404.07965 • Published • 94 -
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper • 2404.10667 • Published • 24 -
Instruction-tuned Language Models are Better Knowledge Learners
Paper • 2402.12847 • Published • 26 -
DoRA: Weight-Decomposed Low-Rank Adaptation
Paper • 2402.09353 • Published • 32
Collections
Discover the best community collections!
Collections including paper arxiv:2404.05829
-
SambaLingo: Teaching Large Language Models New Languages
Paper • 2404.05829 • Published • 13 -
sambanovasystems/SambaLingo-Arabic-Chat
Text Generation • 7B • Updated • 35 • 64 -
sambanovasystems/SambaLingo-Arabic-Base
Text Generation • 7B • Updated • 29 • 37 -
sambanovasystems/SambaLingo-Arabic-Base-70B
Text Generation • 69B • Updated • 17 • 1
-
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Paper • 2309.09400 • Published • 87 -
Tuning LLMs with Contrastive Alignment Instructions for Machine Translation in Unseen, Low-resource Languages
Paper • 2401.05811 • Published • 9 -
Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis
Paper • 2409.20059 • Published • 16 -
Are Character-level Translations Worth the Wait? Comparing Character- and Subword-level Models for Machine Translation
Paper • 2302.14220 • Published
-
Rho-1: Not All Tokens Are What You Need
Paper • 2404.07965 • Published • 94 -
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper • 2404.10667 • Published • 24 -
Instruction-tuned Language Models are Better Knowledge Learners
Paper • 2402.12847 • Published • 26 -
DoRA: Weight-Decomposed Low-Rank Adaptation
Paper • 2402.09353 • Published • 32
-
SambaLingo: Teaching Large Language Models New Languages
Paper • 2404.05829 • Published • 13 -
sambanovasystems/SambaLingo-Arabic-Chat
Text Generation • 7B • Updated • 35 • 64 -
sambanovasystems/SambaLingo-Arabic-Base
Text Generation • 7B • Updated • 29 • 37 -
sambanovasystems/SambaLingo-Arabic-Base-70B
Text Generation • 69B • Updated • 17 • 1
-
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Paper • 2309.09400 • Published • 87 -
Tuning LLMs with Contrastive Alignment Instructions for Machine Translation in Unseen, Low-resource Languages
Paper • 2401.05811 • Published • 9 -
Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis
Paper • 2409.20059 • Published • 16 -
Are Character-level Translations Worth the Wait? Comparing Character- and Subword-level Models for Machine Translation
Paper • 2302.14220 • Published