FlashHead Collection Efficient Drop-In Replacement for the Classification Head in Language Model Inference. https://github.com/embedl/flash-head • 30 items • Updated 2 days ago • 2
LFM2 2.6B Mr. Tic Tac Toe ❌ ⭕ Collection Dataset and models for transforming LFM2 2.6B into a Tic Tac Toe master using RL Environments. Free course: https://t.ly/4jIFq • 8 items • Updated 4 days ago • 2
QSBench | Synthetic Quantum Circuits for ML Benchmarking Collection Free synthetic quantum datasets. Includes QASM, adjacency matrices, gate statistics, entanglement metrics. • 7 items • Updated 7 days ago • 1
Agent-STAR Collection Resources for paper "Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe" • 9 items • Updated 20 days ago • 2
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 28 items • Updated about 12 hours ago • 123
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 11 days ago • 822
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 13 days ago • 47
🍺 The Bartenders 🍺 Collection This is a collection of models that I've trained on data collected through conversations with frontier models GPT, Claude, Perplexity and myself. • 7 items • Updated 12 days ago • 2
view article Article SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation 20 days ago • 16
view article Article ORBA: Orthogonal Reflection Bounded Ablation — A Geometrically Exact Detour in Directional Activation Editing 19 days ago • 5
Flash-Cascade Collection Using Multimodal Large Language Models (MLLMs) for false alarm reduction in image-based fire detection.Doi:https://doi.org/10.21203/rs.3.rs-8847038/v1 • 10 items • Updated 28 days ago • 2