fix: align RotaryEmbedding with Qwen2Moe pattern for transformers compat
#4 opened 3 days ago
by
kashif
Runnable via dInfer?
👀 1
#3 opened 28 days ago
by
Muzel
Could you provide the official NVFP4 version? Dear friend.
#2 opened about 1 month ago
by
win10
Support for mlx lm and llama.cpp
➕ 2
#1 opened about 1 month ago
by
Narutoouz