IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse Paper • 2603.12201 • Published Mar 12 • 53
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory Paper • 2603.03269 • Published Mar 3 • 63
Lost in Stories: Consistency Bugs in Long Story Generation by LLMs Paper • 2603.05890 • Published Mar 6 • 93
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence Paper • 2603.07660 • Published Mar 8 • 86
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published Feb 24 • 102
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 156
A decoder-only foundation model for time-series forecasting Paper • 2310.10688 • Published Oct 14, 2023 • 26
BitDance: Scaling Autoregressive Generative Models with Binary Tokens Paper • 2602.14041 • Published Feb 15 • 53
rizkysulaeman/Gemma3N-4B-Conv-MM-Img-Audio-Text-Code-Reasoning-Q8_0-GGUF Any-to-Any • 7B • Updated Feb 19 • 170 • 2
rizkysulaeman/Gemma3N-4B-Conv-MM-Img-Audio-Text-HealthCare-Q8_0-GGUF Any-to-Any • 7B • Updated Feb 19 • 32
rizkysulaeman/Gemma3N-4B-Conv-MM-Img-Audio-Text-HealthCare-Q8_0-GGUF Any-to-Any • 7B • Updated Feb 19 • 32
rizkysulaeman/Qwen3-VL-8B-Vision-GRPO-HealthCare-Q8_0-GGUF Image-Text-to-Text • 8B • Updated Feb 17 • 15 • 1
rizkysulaeman/Qwen3-VL-8B-Vision-GRPO-HealthCare-Q8_0-GGUF Image-Text-to-Text • 8B • Updated Feb 17 • 15 • 1
rizkysulaeman/Gemma3N-4B-Conv-MM-Img-Audio-Text-Code-Reasoning-Q8_0-GGUF Any-to-Any • 7B • Updated Feb 19 • 170 • 2