Open-COT-Data

university

https://github.com/Open-DataFlow/Open-COT-Data

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

lhpku20010120 authored a paper 10 days ago

PAS: Data-Efficient Plug-and-Play Prompt Augmentation System

lhpku20010120 authored a paper 10 days ago

KeyVideoLLM: Towards Large-scale Video Keyframe Selection

lhpku20010120 authored a paper 10 days ago

Synth-Empathy: Towards High-Quality Synthetic Empathy Data

View all activity

lhpku20010120

authored 20 papers 10 days ago

MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark

Paper • 2408.07543 • Published Aug 14, 2024

SynthVLM: High-Efficiency and High-Quality Synthetic Data for Vision Language Models

Paper • 2407.20756 • Published Jul 30, 2024 • 1

BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search

Paper • 2409.17972 • Published Sep 26, 2024

Baichuan Alignment Technical Report

Paper • 2410.14940 • Published Oct 19, 2024 • 51

Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

Paper • 2410.21169 • Published Oct 28, 2024 • 30

MC-LLaVA: Multi-Concept Personalized Vision-Language Model

Paper • 2411.11706 • Published Nov 18, 2024 • 1

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published Jan 26, 2025 • 60

Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning

Paper • 2410.12952 • Published Oct 16, 2024

MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification

Paper • 2502.13383 • Published Feb 19, 2025

Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning

Paper • 2505.13261 • Published May 19, 2025

Let's Verify Math Questions Step by Step

Paper • 2505.13903 • Published May 20, 2025 • 2

MathClean: A Benchmark for Synthetic Mathematical Data Cleaning

Paper • 2502.19058 • Published Feb 26, 2025

Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions

Paper • 2506.07527 • Published Jun 9, 2025 • 3

LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts

Paper • 2505.13928 • Published May 20, 2025 • 2

QAEncoder: Towards Aligned Representation Learning in Question Answering System

Paper • 2409.20434 • Published Sep 30, 2024 • 1

Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models

Paper • 2506.12776 • Published Jun 15, 2025 • 2

AI & ML interests

Recent Activity

Team members 2

Open-COT-Data's activity