When Models Manipulate Manifolds: The Geometry of a Counting Task Paper • 2601.04480 • Published Jan 8 • 4
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 Text Generation • 18B • Updated 30 days ago • 612k • 137
view article Article Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models Jan 6 • 28
meituan-longcat/LongCat-Flash-Thinking-2601 Text Generation • 562B • Updated Jan 23 • 3.64k • 108
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 186
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 124
view article Article Introducing swift-huggingface: The Complete Swift Client for Hugging Face Dec 5, 2025 • 43