ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference Paper • 2511.10645 • Published Nov 13, 2025 • 8 • 4
ParoQuant Collection Pairwise Rotation Quantization for Efficient Reasoning LLM Inference • 18 items • Updated 12 days ago • 18
SparseLoRA Collection Accelerating LLM Fine-Tuning with Contextual Sparsity • 4 items • Updated Mar 11 • 2
ArcMemo: Abstract Reasoning Composition with Lifelong LLM Memory Paper • 2509.04439 • Published Sep 4, 2025 • 1
KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems Paper • 2510.12872 • Published Oct 14, 2025 • 4
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper • 2510.15870 • Published Oct 17, 2025 • 92