409 638

gn00029914

AI & ML interests

None yet

Recent Activity

liked a model about 3 hours ago

mlx-community/gemma-4-31b-it-mxfp8

liked a model about 4 hours ago

mlx-community/gemma-4-31b-it-8bit

liked a model about 8 hours ago

majentik/gemma-4-E4B-turboquant

View all activity

Organizations

upvoted a paper 8 days ago

HellaSwag: Can a Machine Really Finish Your Sentence?

Paper • 1905.07830 • Published May 19, 2019 • 7

upvoted a collection 9 days ago

Gemma 4

Collection

6 items • Updated 3 days ago • 2

upvoted 3 papers 9 days ago

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 161

RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search

Paper • 2405.12497 • Published May 21, 2024 • 1

Practical and Asymptotically Optimal Quantization of High-Dimensional Vectors in Euclidean Space for Approximate Nearest Neighbor Search

Paper • 2409.09913 • Published Sep 16, 2024 • 1

upvoted a collection 10 days ago

Gemma 4

Collection

8 items • Updated 10 days ago • 573

upvoted an article 10 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

11 days ago

•

822

upvoted 4 papers 10 days ago

A decoder-only foundation model for time-series forecasting

Paper • 2310.10688 • Published Oct 14, 2023 • 24

GLU Variants Improve Transformer

Paper • 2002.05202 • Published Feb 12, 2020 • 5

APTx: better activation function than MISH, SWISH, and ReLU's variants used in deep learning

Paper • 2209.06119 • Published Sep 10, 2022 • 2

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 120

upvoted a paper 11 days ago

Meta-Harness: End-to-End Optimization of Model Harnesses

Paper • 2603.28052 • Published 14 days ago • 16

upvoted 2 articles 13 days ago

Article

Liberate your OpenClaw

17 days ago

•

Article

Build a Domain-Specific Embedding Model in Under a Day

23 days ago

•

upvoted 4 papers 14 days ago

RULER: What's the Real Context Size of Your Long-Context Language Models?

Paper • 2404.06654 • Published Apr 9, 2024 • 40

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published Mar 6 • 47

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published Dec 31, 2024 • 31

It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization

Paper • 2504.13173 • Published Apr 17, 2025 • 20

upvoted 2 papers 15 days ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published 20 days ago • 134

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 134

gn00029914

AI & ML interests

Recent Activity

Organizations

gn00029914's activity

Welcome Gemma 4: Frontier multimodal intelligence on device

Liberate your OpenClaw

Build a Domain-Specific Embedding Model in Under a Day