1 17 196

lack

Hosseinlack123

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

linoyts/open-image-generation

upvoted a paper about 2 months ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

upvoted an article 2 months ago

Building an AI-powered search engine from scratch

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

Paper • 2602.13367 • Published Feb 13 • 35

upvoted 3 articles 2 months ago

Article

Building an AI-powered search engine from scratch

Dec 12, 2024

•

Article

Search the Web with AI

Jan 10, 2025

•

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

845

upvoted a paper 2 months ago

Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery

Paper • 2601.20088 • Published Jan 27 • 4

upvoted an article 3 months ago

Article

AutoThink: Adaptive Reasoning for Large Language Models

May 27, 2025

•

upvoted a collection 3 months ago

Dataset Mix for Pre-Training SLMs

Collection

11 items • Updated Mar 25, 2025 • 2

upvoted 3 articles 4 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

Article

The Optimal Architecture for Small Language Models

Dec 26, 2025

•

120

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

769

upvoted a collection 4 months ago

Awesome SFT datasets

Collection

A curated list of interesting datasets to fine-tune language models with. • 41 items • Updated Mar 2 • 150

upvoted a collection 5 months ago

Ministral 3

Collection

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 165

upvoted an article 5 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

•

450

upvoted a paper 5 months ago

Craw4LLM: Efficient Web Crawling for LLM Pretraining

Paper • 2502.13347 • Published Feb 19, 2025 • 30

upvoted an article 6 months ago

Article

Releasing Common Corpus: the largest public domain dataset for training LLMs

Mar 20, 2024

•

upvoted a paper 6 months ago

Essential-Web v1.0: 24T tokens of organized web data

Paper • 2506.14111 • Published Jun 17, 2025 • 46

upvoted an article 7 months ago

Article

🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders

Aug 31, 2025

•

lack

AI & ML interests

Recent Activity

Organizations

Hosseinlack123's activity

Building an AI-powered search engine from scratch

Search the Web with AI

Uncensor any LLM with abliteration

AutoThink: Adaptive Reasoning for Large Language Models

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

The Optimal Architecture for Small Language Models

SmolLM3: smol, multilingual, long-context reasoner

SmolLM - blazingly fast and remarkably powerful

Releasing Common Corpus: the largest public domain dataset for training LLMs

🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders