Christoph Holthaus's picture

Christoph Holthaus

choltha

·

AI & ML interests

None yet

Recent Activity

liked a model about 3 hours ago

Jiunsong/supergemma4-26b-uncensored-gguf-v2

liked a Space 3 days ago

opendatalab/MinerU

liked a model 5 days ago

LuffyTheFox/Qwen3.5-35B-A3B-Uncensored-FernflowerAI-GGUF

View all activity

Organizations

upvoted a paper 11 days ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 135

upvoted a collection 12 days ago

Gemma 4

Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated 2 days ago • 127

upvoted a collection 13 days ago

Bonsai

1-bit Bonsai models • 6 items • Updated about 10 hours ago • 170

upvoted a collection about 1 month ago

Olmo Hybrid

6 items • Updated Mar 5 • 24

upvoted 3 collections about 2 months ago

Qwen3.5

Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 10 days ago • 144

ColBERT-Zero 🐶

First large-scale fully pre-trained ColBERT model using only public data, outperforming GTE-ModernColBERT and GTE-ModernBERT • 10 items • Updated 7 days ago • 20

Qwen3.5

21 items • Updated Mar 9 • 1.5k

upvoted 2 collections 2 months ago

RynnBrain

11 items • Updated 1 day ago • 24

LLaDA2.1

3 items • Updated 20 days ago • 24

upvoted 4 collections 3 months ago

Open Coding Agents

13 items • Updated Mar 5 • 52

FLUX.2

Our second generation of FLUX • 21 items • Updated 8 days ago • 187

Tiny-A2D

Small diffusion language models adapted from AR models • 4 items • Updated Dec 6, 2025 • 18

DFlash

Block Diffusion for Flash Speculative Decoding • 13 items • Updated 9 days ago • 55

upvoted an article 4 months ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

43

upvoted 4 collections 4 months ago

T5Gemma 2

3 items • Updated Mar 12 • 73

LLaDA 2.0

7 items • Updated 20 days ago • 41

GLM-4.6V

3 items • Updated Dec 8, 2025 • 49

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 164

upvoted a collection 5 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 168

upvoted a paper 5 months ago

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 19