Mattias Dürrmeier
mattduerrmeier
AI & ML interests
LLM Inference, faster and more efficient kernels, local inference
Recent Activity
updated a collection about 8 hours ago
systems upvoted a paper about 8 hours ago
FlashDecoding++: Faster Large Language Model Inference on GPUs updated a collection about 8 hours ago
systemsOrganizations
None yet