view article Article Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs 12 days ago • 6
view article Article A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons Feb 4, 2025 • 34
prathameshkalamkar/gemma-2b-sql-finetuned-dist-4gpu Text Generation • 3B • Updated May 9, 2025 • 1
prathameshkalamkar/gemma-2b-sql-finetuned-dist-4gpu Text Generation • 3B • Updated May 9, 2025 • 1
SemEval 2023 Task 6: LegalEval - Understanding Legal Texts Paper • 2304.09548 • Published Apr 19, 2023
Named Entity Recognition in Indian court judgments Paper • 2211.03442 • Published Nov 7, 2022 • 1
Running 3.78k The Ultra-Scale Playbook 🌌 3.78k The ultimate guide to training LLM on large GPU Clusters
Aalap: AI Assistant for Legal & Paralegal Functions in India Paper • 2402.01758 • Published Jan 30, 2024 • 2
opennyaiorg/Aalap-Mistral-7B-v0.1-bf16 Text Generation • 7B • Updated Jun 11, 2025 • 1.2k • 9
opennyaiorg/Aalap-Mistral-7B-v0.1-bf16 Text Generation • 7B • Updated Jun 11, 2025 • 1.2k • 9