1 17 15

Zeb K

baobaoh

zebwithb

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

samuelcardillo/Carnice-MoE-35B-A3B

liked a model 5 days ago

DJLougen/hermes-qwen3.5-35b-a3b-GGUF

liked a model 6 days ago

kai-os/Carnice-27b

View all activity

Organizations

liked a model 3 days ago

samuelcardillo/Carnice-MoE-35B-A3B

36B • Updated 5 days ago • 5

liked a model 5 days ago

DJLougen/hermes-qwen3.5-35b-a3b-GGUF

Text Generation • 35B • Updated 11 days ago • 19.4k • 4

liked a model 6 days ago

kai-os/Carnice-27b

Text Generation • 27B • Updated 6 days ago • 1.62k • 30

upvoted a paper 10 months ago

Continuous Thought Machines

Paper • 2505.05522 • Published May 8, 2025 • 13

liked a model about 1 year ago

google/gemma-3-1b-it

Text Generation • 1.0B • Updated Apr 4, 2025 • 805k • 917

upvoted a paper about 1 year ago

Agentic Knowledgeable Self-awareness

Paper • 2504.03553 • Published Apr 4, 2025 • 27

upvoted a collection about 1 year ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 561

upvoted 5 papers about 1 year ago

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published Feb 26, 2025 • 47

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published Feb 26, 2025 • 38

upvoted an article about 1 year ago

Article

The Large Language Model Course

Jan 16, 2025

•

226

upvoted 7 papers about 1 year ago

CritiQ: Mining Data Quality Criteria from Human Preferences

Paper • 2502.19279 • Published Feb 26, 2025 • 11

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 154

Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12, 2025 • 47

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13, 2025 • 192

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13, 2025 • 150

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14, 2025 • 127

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20, 2025 • 63

Zeb K

AI & ML interests

Recent Activity

Organizations

baobaoh's activity

The Large Language Model Course