Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mattias Dürrmeier's picture
5 3

Mattias Dürrmeier

mattduerrmeier
  • mattduerrmeier

AI & ML interests

LLM Inference, faster and more efficient kernels, local inference

Recent Activity

updated a collection about 4 hours ago
systems
upvoted a paper about 4 hours ago
FlashDecoding++: Faster Large Language Model Inference on GPUs
updated a collection about 4 hours ago
systems
View all activity

Organizations

None yet

Collections 1

systems
  • CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

    Paper • 2602.24286 • Published Feb 27 • 98
  • FlashFormer: Whole-Model Kernels for Efficient Low-Batch Inference

    Paper • 2505.22758 • Published May 28, 2025 • 1
  • Liger Kernel: Efficient Triton Kernels for LLM Training

    Paper • 2410.10989 • Published Oct 14, 2024 • 3
  • FlashDecoding++: Faster Large Language Model Inference on GPUs

    Paper • 2311.01282 • Published Nov 2, 2023 • 38
systems
  • CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

    Paper • 2602.24286 • Published Feb 27 • 98
  • FlashFormer: Whole-Model Kernels for Efficient Low-Batch Inference

    Paper • 2505.22758 • Published May 28, 2025 • 1
  • Liger Kernel: Efficient Triton Kernels for LLM Training

    Paper • 2410.10989 • Published Oct 14, 2024 • 3
  • FlashDecoding++: Faster Large Language Model Inference on GPUs

    Paper • 2311.01282 • Published Nov 2, 2023 • 38

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs