-
NousResearch/Llama-3.2-1B
Text Generation β’ 1B β’ Updated β’ 18.8k β’ 19 -
meta-llama/Llama-3.2-1B
Text Generation β’ 1B β’ Updated β’ 1.3M β’ 2.37k -
xai-org/grok-1
Text Generation β’ Updated β’ 182 β’ 2.41k -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation β’ 8B β’ Updated β’ 59.8k β’ 110
Collections
Discover the best community collections!
Collections including paper arxiv:2204.05149
-
Vision Arena (Testing VLMs side-by-side)
πΌ562Explore AI-powered visual tasks in Vision Arena
-
The Carbon Footprint of Machine Learning Training Will Plateau, Then Shrink
Paper β’ 2204.05149 β’ Published β’ 12 -
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Paper β’ 2409.17146 β’ Published β’ 121
-
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper β’ 2501.18585 β’ Published β’ 61 -
RWKV-7 "Goose" with Expressive Dynamic State Evolution
Paper β’ 2503.14456 β’ Published β’ 154 -
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Paper β’ 2503.15265 β’ Published β’ 46 -
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Paper β’ 2503.15558 β’ Published β’ 50
-
NousResearch/Llama-3.2-1B
Text Generation β’ 1B β’ Updated β’ 18.8k β’ 19 -
meta-llama/Llama-3.2-1B
Text Generation β’ 1B β’ Updated β’ 1.3M β’ 2.37k -
xai-org/grok-1
Text Generation β’ Updated β’ 182 β’ 2.41k -
unsloth/Meta-Llama-3.1-8B-bnb-4bit
Text Generation β’ 8B β’ Updated β’ 59.8k β’ 110
-
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper β’ 2501.18585 β’ Published β’ 61 -
RWKV-7 "Goose" with Expressive Dynamic State Evolution
Paper β’ 2503.14456 β’ Published β’ 154 -
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Paper β’ 2503.15265 β’ Published β’ 46 -
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning
Paper β’ 2503.15558 β’ Published β’ 50
-
Vision Arena (Testing VLMs side-by-side)
πΌ562Explore AI-powered visual tasks in Vision Arena
-
The Carbon Footprint of Machine Learning Training Will Plateau, Then Shrink
Paper β’ 2204.05149 β’ Published β’ 12 -
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Paper β’ 2409.17146 β’ Published β’ 121