YuyangXie
YuyangXie
ยท
AI & ML interests
Edge LLM, quantization, Speculative decoding, inference
Recent Activity
upvoted a paper 8 days ago
HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention upvoted a paper about 2 months ago
DFlash: Block Diffusion for Flash Speculative Decoding upvoted an article 3 months ago
The Optimal Architecture for Small Language ModelsOrganizations
None yet