Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2507.20534

Papers reimplemented

List of research papers, architectures, and techniques reimplemented in LLM-quest or Hugging Face's TRL. Missing: Qwen3.5, Qwen3-Next, GPT-2

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220
Reinforced Attention Learning

Paper • 2602.04884 • Published Feb 4 • 29
Learning to Reason in 13 Parameters

Paper • 2602.04118 • Published Feb 4 • 6
LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters

Paper • 2405.17604 • Published May 27, 2024 • 3

Kimi K2: Open Agentic Intelligence

Paper • 2507.20534 • Published Jul 28, 2025 • 14
litert-community/Gemma3-1B-IT

Text Generation • Updated Jan 9 • 18.8k • 572

google/gemma-3n-E2B-it-litert-lm

Text Generation • Updated Dec 8, 2025 • 5.98k • 403
Kimi K2: Open Agentic Intelligence

Paper • 2507.20534 • Published Jul 28, 2025 • 14

Kimi K2: Open Agentic Intelligence

Paper • 2507.20534 • Published Jul 28, 2025 • 14

Kimi K2: Open Agentic Intelligence

Paper • 2507.20534 • Published Jul 28, 2025 • 14

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence

moonshotai/Kimi-K2-Thinking

Text Generation • 1.1T • Updated Jan 30 • 89k • • 1.7k
moonshotai/Kimi-K2-Instruct-0905

Text Generation • 1T • Updated Jan 30 • 389k • • 698
moonshotai/Kimi-K2-Instruct

Text Generation • 1T • Updated Jan 30 • 257k • • 2.35k
moonshotai/Kimi-K2-Base

Text Generation • Updated Jan 30 • 8.75k • 298

Papers reimplemented

List of research papers, architectures, and techniques reimplemented in LLM-quest or Hugging Face's TRL. Missing: Qwen3.5, Qwen3-Next, GPT-2

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220
Reinforced Attention Learning

Paper • 2602.04884 • Published Feb 4 • 29
Learning to Reason in 13 Parameters

Paper • 2602.04118 • Published Feb 4 • 6
LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters

Paper • 2405.17604 • Published May 27, 2024 • 3

Kimi K2: Open Agentic Intelligence

Paper • 2507.20534 • Published Jul 28, 2025 • 14

Kimi K2: Open Agentic Intelligence

Paper • 2507.20534 • Published Jul 28, 2025 • 14
litert-community/Gemma3-1B-IT

Text Generation • Updated Jan 9 • 18.8k • 572

Kimi K2: Open Agentic Intelligence

Paper • 2507.20534 • Published Jul 28, 2025 • 14

google/gemma-3n-E2B-it-litert-lm

Text Generation • Updated Dec 8, 2025 • 5.98k • 403
Kimi K2: Open Agentic Intelligence

Paper • 2507.20534 • Published Jul 28, 2025 • 14

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence

moonshotai/Kimi-K2-Thinking

Text Generation • 1.1T • Updated Jan 30 • 89k • • 1.7k
moonshotai/Kimi-K2-Instruct-0905

Text Generation • 1T • Updated Jan 30 • 389k • • 698
moonshotai/Kimi-K2-Instruct

Text Generation • 1T • Updated Jan 30 • 257k • • 2.35k
moonshotai/Kimi-K2-Base

Text Generation • Updated Jan 30 • 8.75k • 298

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs