Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio Paper • 2603.25926 • Published 21 days ago • 8
Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio Paper • 2603.25926 • Published 21 days ago • 8
high-quality Chinese training datasets Collection a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets. • 13 items • Updated May 22, 2025 • 24
Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence Paper • 2503.20533 • Published Mar 26, 2025 • 12