john smith's picture

john smith

k249

·

AI & ML interests

None yet

Recent Activity

liked a model about 21 hours ago

McGill-NLP/A3-Qwen3.5-2B

liked a model about 21 hours ago

phxember/Uni-MuMER-Qwen3.5-2B

liked a model about 21 hours ago

schuttdev/hipfire-qwen3.5-2b

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 134

upvoted 3 collections 9 days ago

GPT-OSS Science (4.2B to 20B)

Specialized GPT-OSS models optimized for scientific reasoning tasks in physics, chemistry, biology, and STEM from GPQA and MMLU science domains. • 29 items • Updated Aug 13, 2025 • 2

GPT-OSS Law (4.2B to 20B)

Legal domain GPT-OSS models excelling at legal reasoning, jurisprudence, and understanding legal frameworks from MMLU legal subjects. • 29 items • Updated Aug 13, 2025 • 3

GPT-OSS Health / Medicine (4.2B to 20B)

Medical domain GPT-OSS models specializing in clinical knowledge, anatomy, medical procedures, & health-related reasoning from MMLU medical subjects. • 29 items • Updated Aug 13, 2025 • 3

upvoted an article 11 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

+5

13 days ago

•

841

upvoted 2 collections 24 days ago

Falcon-H1-Tiny

A series of extremely small, yet powerful language models redefining capabilities at small scale • 19 items • Updated Mar 2 • 37

Distilled LLms

This is a collection of various distilled llms by Cannae ai! • 9 items • Updated 10 days ago • 1

upvoted a paper 24 days ago

Context Cascade Compression: Exploring the Upper Limits of Text Compression

Paper • 2511.15244 • Published Nov 19, 2025 • 3

upvoted a collection 24 days ago

med

57 items • Updated 24 days ago • 1

upvoted a collection 27 days ago

GraphMind

More in https://arxiv.org/pdf/2507.17168, Graph Reasoning Model series • 4 items • Updated Aug 22, 2025 • 1

upvoted a collection 29 days ago

Olmo Hybrid

6 items • Updated Mar 5 • 24

upvoted 2 collections 2 months ago

GPT Reddit Comment Detection

Collection of datasets and models used for detecting LLM bots on reddit. • 6 items • Updated Mar 3 • 1

Medical Datasets

15 items • Updated about 10 hours ago • 48

upvoted 3 collections 4 months ago

Heretic - Abliterated, Uncensored, Unrestricted POWER.

Models that have be abliterated using the HERETIC method. Done properly, this completely removed almost all censorship with no damage to the model. • 122 items • Updated 4 days ago • 64

Clara Medical

NVIDIA Clara Open Models for medical imaging AI: segment, generate, and reason across CT, MRI, and X-ray. Built on MONAI by NVIDIA. • 6 items • Updated 8 days ago • 16

Inference Optimized Checkpoints (with Model Optimizer)

A collection of generative models quantized and optimized for inference with Model Optimizer. • 61 items • Updated 8 days ago • 138

upvoted an article 4 months ago

Article

Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement

Dec 3, 2025

•

14

upvoted 3 collections 5 months ago

Qwen 3 / 2.5 Reasoning/Thinking REG + MOEs.

Qwen 3 / 2.5 Reasoning/Thinking models in both regular and MOE configuration built by me. Source code links also below too. • 69 items • Updated 22 days ago • 12

BioNeMo - Optimize

NVIDIA BioNeMo Models for Optimization • 4 items • Updated 8 days ago • 8

DistilQwen

All DistilQwen models and datasets • 22 items • Updated Oct 9, 2025 • 4