Erik Scholz's picture

Erik Scholz

Green-Sky

·

Green-Sky

AI & ML interests

None yet

Recent Activity

upvoted a collection 2 days ago

liked a model 3 days ago

yifanyu/I-DLM-32B

liked a model 3 days ago

yifanyu/I-DLM-8B

View all activity

Organizations

upvoted a collection 2 days ago

TIPSv2

TIPSv2 foundational vision-language models. Webpage: https://gdm-tipsv2.github.io/ • 9 items • Updated 3 days ago • 15

upvoted a collection 27 days ago

Qwen3.5

21 items • Updated Mar 9 • 1.53k

upvoted a collection 5 months ago

Z-Image

7 items • Updated Jan 28 • 149

upvoted a collection 6 months ago

SPARK.Chroma

2 items • Updated Mar 2 • 3

upvoted 2 collections 7 months ago

MobileLLM-R1

MobileLLM-R1, a series of sub-billion parameter reasoning models • 10 items • Updated Nov 21, 2025 • 30

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 49 items • Updated Mar 2 • 141

upvoted a collection 8 months ago

GPT OSS

2 items • Updated Dec 16, 2025 • 14

upvoted 7 papers 9 months ago

Voxtral

Paper • 2507.13264 • Published Jul 17, 2025 • 34

Shared DIFF Transformer

Paper • 2501.17900 • Published Jan 29, 2025 • 1

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 182

Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?

Paper • 2502.11895 • Published Feb 17, 2025 • 3

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16, 2025 • 85

An Extra RMSNorm is All You Need for Fine Tuning to 1.58 Bits

Paper • 2505.08823 • Published May 12, 2025 • 2

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 628

upvoted a collection 10 months ago

blt

4 items • Updated Apr 17, 2025 • 29

upvoted a paper 10 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108

upvoted 2 collections over 1 year ago

llama.vim

Recommended models for the llama.vim and llama.vscode plugins • 10 items • Updated Dec 16, 2025 • 67

story writing favourites

Models I personally liked for generating stories in the past. Not a recommendation, most of these are outdated. • 17 items • Updated Mar 2 • 98

upvoted 2 papers over 1 year ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6, 2024 • 61

Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models

Paper • 2407.12327 • Published Jul 17, 2024 • 79