Qwopus3.5-v3 Collection 🌟Qwopus3.5-v3 is the latest model in the Claude series. • 12 items • Updated 4 days ago • 73
JANG Quantized - GGUF for MLX Collection MLX models at full speed, GGUF quality. MiniMax M2.7: 88-95.5% MMLU. Requires MLX Studio. @dealignai • 26 items • Updated 1 day ago • 6
High Quality Uncensored - GGUF on MLX Collection These are the empirically proven highest quality uncensored models on MLX. • 19 items • Updated 10 days ago • 14
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper • 2312.11514 • Published Dec 12, 2023 • 264