Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9, 2025 • 134
GPT-OSS Science (4.2B to 20B) Collection Specialized GPT-OSS models optimized for scientific reasoning tasks in physics, chemistry, biology, and STEM from GPQA and MMLU science domains. • 29 items • Updated Aug 13, 2025 • 2
GPT-OSS Law (4.2B to 20B) Collection Legal domain GPT-OSS models excelling at legal reasoning, jurisprudence, and understanding legal frameworks from MMLU legal subjects. • 29 items • Updated Aug 13, 2025 • 3
GPT-OSS Health / Medicine (4.2B to 20B) Collection Medical domain GPT-OSS models specializing in clinical knowledge, anatomy, medical procedures, & health-related reasoning from MMLU medical subjects. • 29 items • Updated Aug 13, 2025 • 3
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 13 days ago • 841
Falcon-H1-Tiny Collection A series of extremely small, yet powerful language models redefining capabilities at small scale • 19 items • Updated Mar 2 • 37
Distilled LLms Collection This is a collection of various distilled llms by Cannae ai! • 9 items • Updated 10 days ago • 1
Context Cascade Compression: Exploring the Upper Limits of Text Compression Paper • 2511.15244 • Published Nov 19, 2025 • 3
GraphMind Collection More in https://arxiv.org/pdf/2507.17168, Graph Reasoning Model series • 4 items • Updated Aug 22, 2025 • 1
GPT Reddit Comment Detection Collection Collection of datasets and models used for detecting LLM bots on reddit. • 6 items • Updated Mar 3 • 1
Heretic - Abliterated, Uncensored, Unrestricted POWER. Collection Models that have be abliterated using the HERETIC method. Done properly, this completely removed almost all censorship with no damage to the model. • 122 items • Updated 4 days ago • 64
Clara Medical Collection NVIDIA Clara Open Models for medical imaging AI: segment, generate, and reason across CT, MRI, and X-ray. Built on MONAI by NVIDIA. • 6 items • Updated 8 days ago • 16
Inference Optimized Checkpoints (with Model Optimizer) Collection A collection of generative models quantized and optimized for inference with Model Optimizer. • 61 items • Updated 8 days ago • 138
view article Article Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement Dec 3, 2025 • 14
Qwen 3 / 2.5 Reasoning/Thinking REG + MOEs. Collection Qwen 3 / 2.5 Reasoning/Thinking models in both regular and MOE configuration built by me. Source code links also below too. • 69 items • Updated 22 days ago • 12
BioNeMo - Optimize Collection NVIDIA BioNeMo Models for Optimization • 4 items • Updated 8 days ago • 8