view article Article Multimodal Embedding & Reranker Models with Sentence Transformers 4 days ago • 36
view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs 5 days ago • 40
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 11 days ago • 822
FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol Paper • 2603.24943 • Published 18 days ago • 12
gliner2 family Collection GLiNER2 extends the original GLiNER architecture to support multi-task information extraction with a schema-driven interface. This base model provid • 4 items • Updated Feb 10 • 42
view article Article Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI 26 days ago • 62
Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2512.20848 • Published Dec 23, 2025 • 42
TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas Paper • 2603.16448 • Published 26 days ago • 58
Building AI Coding Agents for the Terminal: Scaffolding, Harness, Context Engineering, and Lessons Learned Paper • 2603.05344 • Published Mar 5 • 7
Can Aha Moments Be Fake? Identifying True and Decorative Thinking Steps in Chain-of-Thought Paper • 2510.24941 • Published Oct 28, 2025 • 4
Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 8 days ago • 140