Ngai Wong
samnwong
AI & ML interests
compact modeling
Recent Activity
upvoted a paper about 12 hours ago
OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond upvoted a paper about 1 month ago
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and MitigationOrganizations
None yet