-
Typhoon T1: An Open Thai Reasoning Model
Paper • 2502.09042 • Published • 16 -
typhoon-ai/llama3.2-typhoon2-3b-instruct
Text Generation • 3B • Updated • 715 • 9 -
typhoon-ai/typhoon-t1-3b-sci-fm-iclr-2025-exp-dataset
Viewer • Updated • 167k • 7 -
typhoon-ai/llama3.2-typhoon2-t1-3b-research-preview
Text Generation • 3B • Updated • 36 • 6
Collections
Discover the best community collections!
Collections including paper arxiv:2502.09042
-
typhoon-ai/llama3.2-typhoon2-t1-3b-research-preview
Text Generation • 3B • Updated • 36 • 6 -
typhoon-ai/llama3.1-typhoon2-deepseek-r1-70b-preview
Text Generation • 71B • Updated • 14 • 13 -
typhoon-ai/llama3.2-typhoon2-t1-3b-research-preview-mlx-4bit
Text Generation • 0.5B • Updated • 17 • 1 -
Typhoon T1: An Open Thai Reasoning Model
Paper • 2502.09042 • Published • 16
-
Rho-1: Not All Tokens Are What You Need
Paper • 2404.07965 • Published • 94 -
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper • 2404.10667 • Published • 24 -
Instruction-tuned Language Models are Better Knowledge Learners
Paper • 2402.12847 • Published • 26 -
DoRA: Weight-Decomposed Low-Rank Adaptation
Paper • 2402.09353 • Published • 32
-
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search
Paper • 2412.18319 • Published • 39 -
Token-Budget-Aware LLM Reasoning
Paper • 2412.18547 • Published • 46 -
Efficiently Serving LLM Reasoning Programs with Certaindex
Paper • 2412.20993 • Published • 36 -
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
Paper • 2412.17256 • Published • 47
-
Typhoon T1: An Open Thai Reasoning Model
Paper • 2502.09042 • Published • 16 -
typhoon-ai/llama3.2-typhoon2-3b-instruct
Text Generation • 3B • Updated • 715 • 9 -
typhoon-ai/typhoon-t1-3b-sci-fm-iclr-2025-exp-dataset
Viewer • Updated • 167k • 7 -
typhoon-ai/llama3.2-typhoon2-t1-3b-research-preview
Text Generation • 3B • Updated • 36 • 6
-
typhoon-ai/llama3.2-typhoon2-t1-3b-research-preview
Text Generation • 3B • Updated • 36 • 6 -
typhoon-ai/llama3.1-typhoon2-deepseek-r1-70b-preview
Text Generation • 71B • Updated • 14 • 13 -
typhoon-ai/llama3.2-typhoon2-t1-3b-research-preview-mlx-4bit
Text Generation • 0.5B • Updated • 17 • 1 -
Typhoon T1: An Open Thai Reasoning Model
Paper • 2502.09042 • Published • 16
-
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search
Paper • 2412.18319 • Published • 39 -
Token-Budget-Aware LLM Reasoning
Paper • 2412.18547 • Published • 46 -
Efficiently Serving LLM Reasoning Programs with Certaindex
Paper • 2412.20993 • Published • 36 -
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
Paper • 2412.17256 • Published • 47
-
Rho-1: Not All Tokens Are What You Need
Paper • 2404.07965 • Published • 94 -
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper • 2404.10667 • Published • 24 -
Instruction-tuned Language Models are Better Knowledge Learners
Paper • 2402.12847 • Published • 26 -
DoRA: Weight-Decomposed Low-Rank Adaptation
Paper • 2402.09353 • Published • 32