PAS: Data-Efficient Plug-and-Play Prompt Augmentation System Paper • 2407.06027 • Published Jul 8, 2024 • 10
KeyVideoLLM: Towards Large-scale Video Keyframe Selection Paper • 2407.03104 • Published Jul 3, 2024 • 1
Synth-Empathy: Towards High-Quality Synthetic Empathy Data Paper • 2407.21669 • Published Jul 31, 2024
CFBench: A Comprehensive Constraints-Following Benchmark for LLMs Paper • 2408.01122 • Published Aug 2, 2024
MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark Paper • 2408.07543 • Published Aug 14, 2024
SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models Paper • 2407.20756 • Published Jul 30, 2024 • 1
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search Paper • 2409.17972 • Published Sep 26, 2024
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction Paper • 2410.21169 • Published Oct 28, 2024 • 30
MC-LLaVA: Multi-Concept Personalized Vision-Language Model Paper • 2411.11706 • Published Nov 18, 2024 • 1
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning Paper • 2410.12952 • Published Oct 16, 2024
MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification Paper • 2502.13383 • Published Feb 19, 2025
Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning Paper • 2505.13261 • Published May 19, 2025
MathClean: A Benchmark for Synthetic Mathematical Data Cleaning Paper • 2502.19058 • Published Feb 26, 2025
Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions Paper • 2506.07527 • Published Jun 9, 2025 • 3
LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts Paper • 2505.13928 • Published May 20, 2025 • 2
QAEncoder: Towards Aligned Representation Learning in Question Answering System Paper • 2409.20434 • Published Sep 30, 2024 • 1
Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models Paper • 2506.12776 • Published Jun 15, 2025 • 2