Jianping
depasser
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 hours ago
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation upvoted a paper 2 months ago
Scaling Embeddings Outperforms Scaling Experts in Language Models liked a model over 1 year ago
bartowski/MiniCPM-V-2_6-GGUFOrganizations
None yet