Unknown's picture

Open to Collab

Unknown PRO

tuandunghcmut

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

liked a model 2 days ago

AIDC-AI/Marco-Mini-Instruct

View all activity

Organizations

upvoted a collection 24 days ago

Nemotron-Cascade 2

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 3 days ago • 48

upvoted a paper 24 days ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published Mar 6 • 119

upvoted 2 collections 28 days ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 15 items • Updated 3 days ago • 268

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 28 items • Updated 3 days ago • 123

upvoted a paper about 1 month ago

MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks

Paper • 2601.14652 • Published Jan 21 • 4

upvoted a collection about 2 months ago

Layout Generation Dataset

2 items • Updated Jul 27, 2024 • 1

upvoted 2 papers about 2 months ago

ToolComp: A Multi-Tool Reasoning & Process Supervision Benchmark

Paper • 2501.01290 • Published Jan 2, 2025 • 1

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 247

upvoted a collection about 2 months ago

Nemotron-Terminal

We are releasing Nemotron-Terminal models and training datasets. • 5 items • Updated 3 days ago • 34

upvoted 3 papers about 2 months ago

Endless Terminals: Scaling RL Environments for Terminal Agents

Paper • 2601.16443 • Published Jan 23 • 18

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Paper • 2602.07274 • Published Feb 6 • 208

Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers

Paper • 2602.18292 • Published Feb 20 • 13

upvoted an article 2 months ago

Article

SmolLM-Smashed: Tiny Giants, Optimized for Speed

Jan 13

•

15

upvoted a paper 2 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 179

upvoted an article 2 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

71

upvoted a collection 2 months ago

Enterprise Agents and Benchmarks

Enterprise agent ecosystem featuring AssetOpsBench (industrial) and ITBench (SRE, FinOps, CISO), CUGA to accelerate AI Automation • 16 items • Updated 9 days ago • 15

upvoted 2 papers 3 months ago

Toward Efficient Agents: Memory, Tool learning, and Planning

Paper • 2601.14192 • Published Jan 20 • 57

Deep Research: A Systematic Survey

Paper • 2512.02038 • Published Nov 24, 2025 • 73

upvoted an article 3 months ago

Article

Design Patterns for Building Agentic Workflows

Jul 14, 2025

•

9

upvoted a paper 3 months ago

DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering

Paper • 2507.11527 • Published Jul 15, 2025 • 35