Running on CPU Upgrade Featured 3.1k The Smol Training Playbook 📚 3.1k The secrets to building world-class LLMs
Intra-Layer Recurrence in Transformers for Language Modeling Paper • 2505.01855 • Published May 3, 2025
pritamdeka/BioBERT-mnli-snli-scinli-scitail-mednli-stsb Sentence Similarity • Updated Sep 6, 2024 • 80.2k • • 58