Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
383.0
TFLOPS
1
1
Tom
MarjorTom
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
new
activity
12 days ago
deepseek-ai/DeepSeek-V4-Pro:
关于 "Observations and Proposals" 中激活函数建议的疑问:去掉 gate projection 为何能放宽 EP 带宽要求?
liked
a model
12 days ago
deepseek-ai/DeepSeek-V4-Pro
new
activity
12 days ago
deepseek-ai/DeepSeek-V4-Pro:
关于 "Observations and Proposals" 中激活函数建议的疑问:去掉 gate projection 为何能放宽 EP 带宽要求?
View all activity
Organizations
None yet
MarjorTom
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
deepseek-ai/DeepSeek-V4-Pro
12 days ago
关于 "Observations and Proposals" 中激活函数建议的疑问:去掉 gate projection 为何能放宽 EP 带宽要求?
2
#126 opened 12 days ago by
MarjorTom
关于 "Observations and Proposals" 中激活函数建议的疑问:去掉 gate projection 为何能放宽 EP 带宽要求?
2
#126 opened 12 days ago by
MarjorTom