Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
11
35
26
Mengzhao Chen
ChenMnZ
Follow
sosoai's profile picture
21world's profile picture
AdinaY's profile picture
24 followers
·
5 following
https://chenmnz.github.io/
ChenMnZ
AI & ML interests
model compression
Recent Activity
upvoted
a
paper
about 13 hours ago
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression
upvoted
a
paper
about 13 hours ago
FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling
upvoted
a
paper
about 13 hours ago
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation
View all activity
Organizations
None yet
ChenMnZ
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
11 months ago
mlfoundations/scaling
Updated
Mar 15, 2024
•
4
liked
a model
about 1 year ago
nvidia/DeepSeek-R1-NVFP4
Text Generation
•
397B
•
Updated
Jun 6, 2025
•
5.35k
•
274
liked
a dataset
over 1 year ago
mengfn/MATH-APS
Preview
•
Updated
Oct 9, 2024
•
23
•
9
liked
a Space
over 1 year ago
Running
on
Zero
1.04k
CogVideoX-5B
🎥
1.04k
Text-to-Video
liked
2 models
over 1 year ago
ChenMnZ/Mistral-Large-Instruct-2407-EfficientQAT-w2g64-GPTQ
Updated
Aug 6, 2024
•
7
•
25
mistralai/Mistral-Large-Instruct-2407
Updated
Jul 28, 2025
•
6.57k
•
859
liked
a Space
over 1 year ago
Build error
Featured
137
Diffree
🖼
137
liked
2 models
over 1 year ago
OpenGVLab/InternVL2-Llama3-76B
Image-Text-to-Text
•
Updated
Mar 25, 2025
•
217
•
212
deepseek-ai/DeepSeek-V2-Chat-0628
Text Generation
•
236B
•
Updated
Jul 18, 2024
•
2.97k
•
177
liked
3 datasets
almost 2 years ago
Kaining/MMT-Bench
Viewer
•
Updated
Jun 21, 2024
•
30k
•
52
•
10
cais/mmlu
Viewer
•
Updated
Mar 8, 2024
•
231k
•
417k
•
710
togethercomputer/RedPajama-Data-V2
Updated
Nov 21, 2024
•
4.23k
•
400
liked
3 models
almost 2 years ago
catid/cat-llama-3-8b-instruct-aqlm
Text Generation
•
3B
•
Updated
Apr 21, 2024
•
12
•
6
mobiuslabsgmbh/Llama-2-7b-chat-hf_1bitgs8_hqq
Text Generation
•
Updated
Feb 5, 2025
•
19
•
74
1bitLLM/bitnet_b1_58-3B
Text Generation
•
3B
•
Updated
Mar 29, 2024
•
1.17k
•
262
liked
a dataset
almost 2 years ago
allenai/dolma
Updated
Apr 17, 2024
•
2.94k
•
1.01k
liked
a model
almost 2 years ago
NousResearch/OLMo-Bitnet-1B
Text Generation
•
Updated
Apr 11, 2024
•
585
•
120
liked
a dataset
about 2 years ago
ILSVRC/imagenet-1k
Viewer
•
Updated
Sep 17, 2025
•
1.43M
•
117k
•
768
liked
a model
about 2 years ago
hanguo/lq-lora
Updated
Aug 1, 2024
•
4
liked
a Space
over 2 years ago
Running
4.84k
Arena Leaderboard
🏆
4.84k
View the LMArena language model leaderboard
Load more