Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
3
luchangli03
luchangli03
Follow
0 followers
·
4 following
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 month ago
lightseekorg/kimi-k2.5-eagle3
new
activity
about 1 month ago
AQ-MedAI/Kimi-K25-eagle3:
Can you reduce the kv head num of this model? "num_key_value_heads": 64, which requies a lots of kv cache
liked
a model
about 1 month ago
AQ-MedAI/Kimi-K25-eagle3
View all activity
Organizations
luchangli03
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
AQ-MedAI/Kimi-K25-eagle3
about 1 month ago
Can you reduce the kv head num of this model? "num_key_value_heads": 64, which requies a lots of kv cache
2
#1 opened about 1 month ago by
luchangli03
New activity in
jerryzh168/Kimi-K2-Thinking-FP8
2 months ago
Can you provide the code that convert the int4 weight to fp8? thanks
#2 opened 2 months ago by
luchangli03
New activity in
chutesai/DeepSeek-V3.1-Terminus-NextN
5 months ago
what's the difference between this nextn and self contained MTP model in DeepSeek-V3.1-Terminus?
#1 opened 5 months ago by
luchangli03