GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents Paper โข 2603.24329 โข Published 23 days ago โข 28
Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents Paper โข 2510.23691 โข Published Oct 27, 2025 โข 56
Physical AI Collection Collection of open, commercial-grade datasets for physical AI developers โข 29 items โข Updated 3 days ago โข 143
microsoft/Phi-3-mini-4k-instruct-gguf Text Generation โข 4B โข Updated Dec 10, 2025 โข 64.6k โข 576
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models Paper โข 2602.07026 โข Published Feb 2 โข 140
view post Post 3209 releasing: smol vision ๐ผ A repository with notebooks on shrinking, optimizing, speeding-up, customizing large vision models! https://github.com/merveenoyan/smol-vision 1 reply ยท ๐ฅ 18 18 โค๏ธ 4 4 ๐ 3 3 ๐ค 2 2 ๐ 1 1 ๐ค 1 1 ๐ง 1 1 ๐คฏ 1 1 โ 1 1 ๐ 1 1 ๐ 1 1 + Reply
TEDi: Temporally-Entangled Diffusion for Long-Term Motion Synthesis Paper โข 2307.15042 โข Published Jul 27, 2023 โข 7 โข 1
view article Article You could have designed state of the art positional encoding Nov 25, 2024 โข 468
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix Nov 3, 2025 โข 65
view article Article Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques ๐ ๐ Aug 26, 2024 โข 88
Running on CPU Upgrade Featured 3.11k The Smol Training Playbook ๐ 3.11k The secrets to building world-class LLMs