Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
254.6
TFLOPS
21
GadflyII
GadflyII
Follow
webxos's profile picture
aliez-ren's profile picture
therealhd's profile picture
27 followers
·
1 following
AI & ML interests
None yet
Recent Activity
new
activity
3 days ago
GadflyII/GLM-4.6V-NVFP4:
Well done nvfp4 quant
new
activity
3 days ago
GadflyII/Qwen3-Coder-Next-NVFP4:
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
new
activity
about 1 month ago
GadflyII/GLM-4.7-Flash-MTP-NVFP4:
SGLang and MTP
View all activity
Organizations
GadflyII
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
GadflyII/GLM-4.6V-NVFP4
3 days ago
Well done nvfp4 quant
3
#1 opened 3 months ago by
josephbreda
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
3 days ago
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
🤯
👍
4
5
#5 opened about 2 months ago by
scottgl
New activity in
GadflyII/GLM-4.7-Flash-MTP-NVFP4
about 1 month ago
SGLang and MTP
1
#2 opened about 1 month ago by
Michalea
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
about 2 months ago
Model requests?
12
#4 opened 2 months ago by
pathosethoslogos
New activity in
GadflyII/GLM-4.6V-NVFP4
about 2 months ago
Fails on a single DGX spark with errors below
1
#2 opened about 2 months ago by
Adrian1234
New activity in
GadflyII/GLM-4.7-Flash-MXFP4
about 2 months ago
Update MXFP4 format to compressed-tensors
1
#3 opened about 2 months ago by
mgoin
New activity in
lukealonso/MiniMax-M2.5-NVFP4
about 2 months ago
Here's the vLLM recipe I'm using with 2x RTX Pro 6000
👍
3
17
#1 opened 2 months ago by
zenmagnets
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
2 months ago
MMLU PRO Benchmark
3
#3 opened 2 months ago by
sevapru
vLLM 0.16?
1
#2 opened 2 months ago by
MMaxHugg
Memory
1
#1 opened 2 months ago by
struxx
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
2 months ago
confused response
7
#8 opened 2 months ago by
jiangyizhi
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
3 months ago
MTP quality, 47 layer
3
#7 opened 3 months ago by
Michalea
New activity in
GadflyII/GLM-4.7-Flash-MTP-NVFP4
3 months ago
Upload folder using huggingface_hub
#1 opened 3 months ago by
GadflyII
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
3 months ago
Can't deploy by vllm 0.14.1 + transformers
8
#6 opened 3 months ago by
Butterfly-314
New activity in
GadflyII/GLM-4.7-Flash-MXFP4
3 months ago
can not run
4
#1 opened 3 months ago by
aliez-ren
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
3 months ago
please create mlx version of this
3
#4 opened 3 months ago by
Narutoouz
Wasn't able to recreate MMLU-Pro benchmarks
5
#5 opened 3 months ago by
zenmagnets
New activity in
GadflyII/MiniMax-M2.1-NVFP4
3 months ago
Request for GLM 4.6V
3
#1 opened 3 months ago by
SFPLM
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
3 months ago
GadflyII/GLM-4.7-Flash-NVFP4
15
#3 opened 3 months ago by
Yu21342
Really appreciate that you ran performance comparison tests with BF16!
3
#2 opened 3 months ago by
zenmagnets
Load more