Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Draw Things
DiffusionBee
JoyFusion
vLLM
Ollama
MLX LM
Docker Model Runner
Lemonade
SGLang
Unsloth
Pi
Inference Providers
Select all
Groq
Novita
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
Fireworks
Featherless AI
Zai
Replicate
Cohere
Scaleway
Public AI
OVHcloud AI Endpoints
HF Inference API
WaveSpeed
Misc
Reset Misc
kv-cache
Inference Endpoints
text-generation-inference
Eval Results (legacy)
text-embeddings-inference
4-bit precision
Merge
custom_code
8-bit precision
Mixture of Experts
Carbon Emissions
Eval Results
Apply filters
Models
59
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
kv-cache
Clear all
nm-testing/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head
Updated
Nov 25, 2025
ddddamn/IronCell-Mark-1
Updated
Feb 13
anthonym21/Mistral-7B-v0.3-CoDA-GQA-L
Text Generation
•
7B
•
Updated
Feb 24
•
3.84k
fromthesky/PLDR-LLM-v51-SOC-110M-1
Text Generation
•
0.1B
•
Updated
24 days ago
•
3.06k
fromthesky/PLDR-LLM-v51-SOC-110M-2
Text Generation
•
0.1B
•
Updated
24 days ago
•
3.09k
fromthesky/PLDR-LLM-v51-SOC-110M-3
Text Generation
•
0.1B
•
Updated
24 days ago
•
3.06k
fromthesky/PLDR-LLM-v51-SOC-110M-4
Text Generation
•
0.1B
•
Updated
24 days ago
•
3.08k
fromthesky/PLDR-LLM-v51-SOC-110M-5
Text Generation
•
0.1B
•
Updated
24 days ago
•
3.14k
ElvianElvy/MESA-Qwen2.5-0.5B-KL-Regularized
Text Generation
•
Updated
Mar 12
chinedudave06/musicgen-small-onnx
Text-to-Audio
•
Updated
Mar 18
•
79
chinedudave06/musicgen-small-stereo-onnx
Text-to-Audio
•
Updated
Mar 18
•
73
chinedudave06/musicgen-medium-onnx
Text-to-Audio
•
Updated
Mar 18
•
74
chinedudave06/musicgen-medium-stereo-onnx
Text-to-Audio
•
Updated
Mar 18
•
64
Croc-Prog-HF/LoreWeaver-2-LoRA_PREVIEW
Text Generation
•
Updated
28 days ago
NOT-OMEGA/NanoMind
Text Generation
•
Updated
16 days ago
bmeyer2025/tiny-gpt-shakespeare
Text Generation
•
Updated
18 days ago
•
2.82k
eigengram/eigengram-protocol
Updated
17 days ago
adobeXd/nexus
Text Generation
•
Updated
15 days ago
•
1
Engram-protocol/engram
Feature Extraction
•
Updated
16 days ago
Funkylazer/dm-qwen3.5-27b-b1p-knowledgecorridor
Text Generation
•
27B
•
Updated
12 days ago
•
368
atomicmilkshake/llama-cpp-turboquant-binaries
Updated
10 days ago
majentik/Qwen3.5-27B-TurboQuant-2bit
Text Generation
•
Updated
6 days ago
majentik/Qwen3.5-27B-TurboQuant-MLX-2bit
Text Generation
•
27B
•
Updated
6 days ago
•
532
majentik/Qwen3.5-27B-RotorQuant-2bit
Text Generation
•
Updated
6 days ago
majentik/Qwen3.5-27B-RotorQuant-MLX-2bit
Text Generation
•
27B
•
Updated
6 days ago
•
342
satya007/gemmacut-spectral
Text Generation
•
Updated
6 days ago
zzh618/DASH-KV-Qwen2-7B-Instruct
Updated
about 2 hours ago
zzh618/DASH-KV-Qwen2.5-14B-Instruct
Updated
about 2 hours ago
zzh618/DASH-KV-Llama-3.1-8B-Instruct
Updated
about 2 hours ago
Previous
1
2
Next