Dongfu Jiang's picture

Dongfu Jiang

DongfuJiang

·

https://jdf-prog.github.io/

AI & ML interests

Large Language Model, Modality Reasoning and their evaluation

Recent Activity

upvoted a paper 2 days ago

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

upvoted a paper 4 days ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

upvoted a paper 7 days ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

View all activity

Organizations

upvoted a paper 2 days ago

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2604.12374 • Published 4 days ago • 27

upvoted a paper 4 days ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Paper • 2604.07413 • Published 10 days ago • 91

upvoted a paper 7 days ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published 9 days ago • 255

upvoted a paper 9 days ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

Paper • 2604.05117 • Published 12 days ago • 35

upvoted a paper 14 days ago

ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

Paper • 2603.27862 • Published 19 days ago • 30

upvoted a paper 24 days ago

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published about 1 month ago • 94

upvoted a paper 28 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 29 days ago • 66

upvoted a paper about 1 month ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 180

upvoted a paper about 2 months ago

VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction

Paper • 2602.13294 • Published Feb 9 • 13

upvoted 3 papers 5 months ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published Dec 1, 2025 • 74

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published Dec 2, 2025 • 55

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published Nov 19, 2025 • 44

upvoted 2 articles 6 months ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

Oct 30, 2025

•

80

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30, 2025

•

43

upvoted 5 papers 6 months ago

VisCoder2: Building Multi-Language Visualization Coding Agents

Paper • 2510.23642 • Published Oct 24, 2025 • 22

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16, 2025 • 108

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published Oct 12, 2025 • 28

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 277

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9, 2025 • 81

upvoted a paper 7 months ago

VideoScore2: Think before You Score in Generative Video Evaluation

Paper • 2509.22799 • Published Sep 26, 2025 • 26