fix: align RotaryEmbedding with Qwen2Moe pattern for transformers compat
#4 opened 28 days ago
by
kashif
Runnable via dInfer?
👀 1
#3 opened about 2 months ago
by
Muzel
Could you provide the official NVFP4 version? Dear friend.
#2 opened about 2 months ago
by
win10
Support for mlx lm and llama.cpp
➕ 3
#1 opened 2 months ago
by
Narutoouz