view article Article Reverse Engineering a $500M Mystery: From HashHop to Memory-Augmented Language Models Jan 23 ⢠10
view article Article Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement Dec 3, 2025 ⢠14
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 May 24, 2023 ⢠177
DeepSeek R1 (All Versions) Collection DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. ⢠37 items ⢠Updated 1 day ago ⢠267
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper ⢠2505.04588 ⢠Published May 7, 2025 ⢠65
ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset Paper ⢠2405.10004 ⢠Published May 16, 2024 ⢠1
Quantifying the Carbon Emissions of Machine Learning Paper ⢠1910.09700 ⢠Published Oct 21, 2019 ⢠41
MMMModal -- Multi-Images Multi-Audio Multi-turn Multi-Modal Paper ⢠2402.11297 ⢠Published Feb 17, 2024 ⢠2