Small-reasoning-models HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 1.07M • 930 Qwen/Qwen3-4B-Base Text Generation • 4B • Updated Jul 26, 2025 • 1.01M • 84
Fine-tuning Text-to-LoRA: Instant Transformer Adaption Paper • 2506.06105 • Published Jun 6, 2025 • 4 Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights Paper • 2506.16406 • Published Jun 19, 2025 • 133 ChatDOC/OCRFlux-3B Image-Text-to-Text • Updated Jul 9, 2025 • 2.81k • 367
AI Labs - FT Datasets Datasets that we want to use for experimenting with fine-tuning Salesforce/ReasoningJudgeBench Viewer • Updated Jun 7, 2025 • 1.48k • 41 • 5 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 395k • 325 bobox/OpenbookQA-4ST Viewer • Updated Jul 9, 2024 • 9.28k • 25 • 3 Salesforce/wikitext Viewer • Updated Jan 4, 2024 • 3.71M • 1.15M • 665
Small-reasoning-models HuggingFaceTB/SmolLM3-3B Text Generation • 3B • Updated Sep 10, 2025 • 1.07M • 930 Qwen/Qwen3-4B-Base Text Generation • 4B • Updated Jul 26, 2025 • 1.01M • 84
AI Labs - FT Datasets Datasets that we want to use for experimenting with fine-tuning Salesforce/ReasoningJudgeBench Viewer • Updated Jun 7, 2025 • 1.48k • 41 • 5 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 395k • 325 bobox/OpenbookQA-4ST Viewer • Updated Jul 9, 2024 • 9.28k • 25 • 3 Salesforce/wikitext Viewer • Updated Jan 4, 2024 • 3.71M • 1.15M • 665
Fine-tuning Text-to-LoRA: Instant Transformer Adaption Paper • 2506.06105 • Published Jun 6, 2025 • 4 Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights Paper • 2506.16406 • Published Jun 19, 2025 • 133 ChatDOC/OCRFlux-3B Image-Text-to-Text • Updated Jul 9, 2025 • 2.81k • 367