---
library_name: kernels
license: mit
---

This is the repository card of kernels-community/flash-mla that has been pushed on the Hub. It was built to be used with the [`kernels` library](https://github.com/huggingface/kernels). This card was automatically generated.

## How to use

```python
# make sure `kernels` is installed: `pip install -U kernels`
from kernels import get_kernel

kernel_module = get_kernel("kernels-community/flash-mla")
__version__ = kernel_module.__version__

__version__(...)
```

## Available functions
- `__version__`
- `FlashMLASchedMeta`
- `get_mla_metadata`
- `flash_mla_with_kvcache`
- `flash_attn_varlen_func`
- `flash_attn_varlen_qkvpacked_func`
- `flash_attn_varlen_kvpacked_func`
- `flash_mla_sparse_fwd`

## Benchmarks

Benchmarking script is available for this kernel. Run `kernels benchmark kernels-community/flash-mla`.