Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Apps
llama.cpp
LM Studio
Jan
Draw Things
DiffusionBee
JoyFusion
vLLM
Ollama
MLX LM
Docker Model Runner
Lemonade
SGLang
Unsloth
Pi
Inference Providers
Groq
Novita
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
Fireworks
Featherless AI
Zai
Replicate
Cohere
Scaleway
Public AI
OVHcloud AI Endpoints
HF Inference API
WaveSpeed
Misc
offline-rl
Inference Endpoints
text-generation-inference
Eval Results (legacy)
text-embeddings-inference
4-bit precision
Merge
custom_code
8-bit precision
Mixture of Experts
Carbon Emissions
Eval Results

Models

11
Full-text search
Active filters: offline-rl

jakegrigsby/metamon

Reinforcement Learning • Updated Mar 4 • 4

chenyuwang1/test

Updated Nov 16, 2025

golfoscar/qwen2.5-0.5b-math-oreo

Text Generation • 0.5B • Updated Dec 14, 2025 • 2

debaterhub/debate-grpo-iter2-groupA

Updated Jan 22 • 2

code3939/DecisionTransformer-Unity-Sim

Reinforcement Learning • Updated Jan 29

Ason-jay/fetch-lift-td3-bc

Robotics • Updated Feb 12 • 1

Ason-jay/fetch-lift-iql-tau07

Robotics • Updated Feb 12

Ason-jay/fetch-lift-iql-tau09

Robotics • Updated Feb 12 • 2

jangwon-kim-cocel/Bayesian-Policy-Distillation

Reinforcement Learning • Updated Feb 15 • 1

Camais03/camie-crafter

Reinforcement Learning • Updated 18 days ago • 221 • 4

BAIBHAV1234/Sepsis-OpenEnv

Updated 10 days ago
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs