Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Draw Things
DiffusionBee
JoyFusion
vLLM
Ollama
MLX LM
Docker Model Runner
Lemonade
SGLang
Unsloth
Pi
Inference Providers
Select all
Groq
Novita
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
Fireworks
Featherless AI
Zai
Replicate
Cohere
Scaleway
Public AI
OVHcloud AI Endpoints
HF Inference API
WaveSpeed
Misc
Reset Misc
NeelNanda/pile-10k
Inference Endpoints
text-generation-inference
Eval Results (legacy)
text-embeddings-inference
4-bit precision
Merge
custom_code
8-bit precision
Mixture of Experts
Carbon Emissions
Eval Results
Apply filters
Models
110
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
NeelNanda/pile-10k
Clear all
INC4AI/phi-2-int4-inc
Text Generation
•
0.6B
•
Updated
Oct 22, 2024
•
22
•
4
INC4AI/gemma-2b-int4-inc
Text Generation
•
3B
•
Updated
Aug 26, 2024
•
18
•
1
INC4AI/falcon-7b-sq-int8-inc
Text Generation
•
Updated
Apr 17, 2024
•
16
Intel/Phi-3-mini-4k-instruct-int4-inc
Updated
Jul 4, 2024
•
5
Intel/Baichuan2-13B-Chat-int4-inc
Updated
Jul 4, 2024
•
1
Intel/SOLAR-10.7B-Instruct-v1.0-int4-inc
Updated
Jul 4, 2024
•
1
Intel/opt-1.3b-int4-inc-recipe
Updated
Nov 6, 2024
•
1
Intel/Phi-3-mini-128k-instruct-int4-inc-recipe
Updated
Nov 8, 2024
•
1
INC4AI/Mistral-7B-v0.1-int4-inc-lmhead
Text Generation
•
1B
•
Updated
May 29, 2024
•
9
•
1
Fizzarolli/phi3-4x4b-v1
Text Generation
•
11B
•
Updated
Jun 4, 2024
•
8
•
1
bartowski/phi3-4x4b-v1-GGUF
Text Generation
•
11B
•
Updated
Jun 3, 2024
•
123
INC4AI/Qwen2-0.5B-Instuct-int4-inc
Text Generation
•
0.6B
•
Updated
Jun 6, 2024
•
8
•
1
INC4AI/Qwen2-1.5B-Instuct-int4-inc
Text Generation
•
2B
•
Updated
Jun 6, 2024
•
9
•
3
INC4AI/Qwen2-7B-int4-inc
Text Generation
•
2B
•
Updated
Oct 24, 2024
•
7
•
6
Intel/Qwen2.5-0.5B-Instruct-int4-inc
Updated
Oct 10, 2024
•
2
Intel/Qwen2.5-1.5B-Instruct-int4-inc
Updated
Oct 10, 2024
•
2
mradermacher/phi3-4x4b-v1-GGUF
11B
•
Updated
Nov 15, 2024
•
42
mradermacher/phi3-4x4b-v1-i1-GGUF
11B
•
Updated
Nov 15, 2024
•
1.36k
OPEA/Meta-Llama-3.1-70B-Instruct-int4-asym-inc
11B
•
Updated
Apr 30, 2025
•
3
•
1
OPEA/Qwen2.5-32B-Instruct-int4-sym-mixed-inc
6B
•
Updated
Apr 30, 2025
•
12
•
1
OPEA/Qwen2.5-14B-Instruct-int4-sym-inc
3B
•
Updated
Apr 30, 2025
•
2
OPEA/Meta-Llama-3.1-8B-Instruct-int4-sym-inc
2B
•
Updated
Jun 5, 2025
•
69
OPEA/Qwen2-VL-7B-Instruct-int4-sym-inc
3B
•
Updated
Jun 5, 2025
•
9
•
1
OPEA/Phi-3.5-vision-instruct-int4-sym-inc
Updated
Apr 30, 2025
•
9
OPEA/Qwen2.5-7B-Instruct-int4-sym-inc
2B
•
Updated
Apr 30, 2025
•
19
•
1
OPEA/Llama-3.2-11B-Vision-Instruct-int4-sym-inc
3B
•
Updated
Jun 5, 2025
•
33
•
2
OPEA/llava-v1.5-7b-int4-sym-inc
1B
•
Updated
Jul 18, 2025
•
17
•
1
OPEA/cogvlm2-llama3-chat-19B-int4-sym-inc
20B
•
Updated
Jul 18, 2025
•
3
OPEA/Qwen2.5-72B-Instruct-int4-sym-inc
12B
•
Updated
Apr 30, 2025
•
8
•
1
OPEA/Qwen2-7B-int4-sym-inc
8B
•
Updated
Apr 30, 2025
•
6
Previous
1
2
3
4
Next