ParaVT

community

AI & ML interests

None defined yet.

Recent Activity

yzzyu authored a paper about 21 hours ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

kcz358 authored a paper about 21 hours ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

mwxely updated a Space 1 day ago

View all activity

authored a paper about 21 hours ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Paper • 2605.20342 • Published 8 days ago • 30

authored a paper about 21 hours ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Paper • 2605.20342 • Published 8 days ago • 30

updated a Space 1 day ago

ParaVT

Parallel Video Tool Calling with Multi-Agent RL

published a Space 1 day ago

ParaVT

Parallel Video Tool Calling with Multi-Agent RL

submitted a paper to Daily Papers 1 day ago

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Paper • 2605.20342 • Published 8 days ago • 30

authored 2 papers 5 days ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published Mar 16 • 186

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Paper • 2605.20342 • Published 8 days ago • 30

updated a dataset 6 days ago

ParaVT/ParaVT-Source

Updated 6 days ago • 815 • 2

updated a model 6 days ago

ParaVT/ParaVT-8B

Video-Text-to-Text • 9B • Updated 6 days ago • 97 • 3

updated a dataset 6 days ago

ParaVT/ParaVT-Parquet

Viewer • Updated 6 days ago • 101k • 185 • 3

published 2 datasets 9 days ago

ParaVT/ParaVT-Source

Updated 6 days ago • 815 • 2

ParaVT/ParaVT-Parquet

Viewer • Updated 6 days ago • 101k • 185 • 3

published a model 9 days ago

ParaVT/ParaVT-8B

Video-Text-to-Text • 9B • Updated 6 days ago • 97 • 3

authored a paper 14 days ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published 26 days ago • 48

authored a paper 15 days ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published 16 days ago • 29

submitted a paper to Daily Papers 15 days ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published 16 days ago • 29

authored a paper 20 days ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published 26 days ago • 48

authored a paper 20 days ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published 26 days ago • 48

submitted a paper to Daily Papers 21 days ago

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published 26 days ago • 48

authored a paper 23 days ago

MultiHaystack: Benchmarking Multimodal Retrieval and Reasoning over 40K Images, Videos, and Documents

Paper • 2603.05697 • Published Mar 5