2 36 9

nanatata

AI & ML interests

None yet

Recent Activity

upvoted a paper about 10 hours ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

upvoted a paper 3 months ago

ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

upvoted a paper 3 months ago

ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration

View all activity

Organizations

None yet

upvoted a paper about 10 hours ago

OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models

Paper • 2604.10866 • Published 4 days ago • 45

upvoted 4 papers 3 months ago

ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

Paper • 2601.10156 • Published Jan 15 • 26

ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration

Paper • 2601.06860 • Published Jan 11 • 16

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Paper • 2601.05808 • Published Jan 9 • 37

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published Jan 5 • 113

upvoted 2 papers 4 months ago

Thinking with Programming Vision: Towards a Unified View for Thinking with Images

Paper • 2512.03746 • Published Dec 3, 2025 • 17

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 161

liked a dataset 5 months ago

We-Math/VTBench

Viewer • Updated Nov 26, 2025 • 500 • 24 • 7

upvoted 4 papers 5 months ago

MedSAM3: Delving into Segment Anything with Medical Concepts

Paper • 2511.19046 • Published Nov 24, 2025 • 55

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 96

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7, 2025 • 46

V-Thinker: Interactive Thinking with Images

Paper • 2511.04460 • Published Nov 6, 2025 • 98

liked 2 datasets 5 months ago

We-Math/V-Perception-40K

Viewer • Updated Nov 7, 2025 • 36.7k • 18 • 7

We-Math/V-Interaction-400K

Viewer • Updated Nov 7, 2025 • 253k • 469 • 14

liked a model 5 months ago

We-Math/V-Thinker

8B • Updated Nov 6, 2025 • 66 • 9

liked a dataset 5 months ago

dongguanting/ARPO-RL-DeepSearch-1K

Viewer • Updated Oct 17, 2025 • 1.07k • 61 • 6

upvoted 3 papers 5 months ago

π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

Paper • 2510.25889 • Published Oct 29, 2025 • 66

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 80

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28, 2025 • 72

upvoted a paper 6 months ago

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24, 2025 • 103

nanatata

AI & ML interests

Recent Activity

Organizations

nanatata's activity