2 20 2

Tianqing Fang

tqfang229

https://tqfang.github.io/

AI & ML interests

LLM, Agent

Recent Activity

upvoted a paper 13 days ago

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

upvoted a paper 25 days ago

Free(): Learning to Forget in Malloc-Only Reasoning Models

upvoted a collection about 1 month ago

Penguin-VL

View all activity

Organizations

None yet

upvoted a paper 13 days ago

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

Paper • 2603.28407 • Published 16 days ago • 68

upvoted a paper 25 days ago

Free(): Learning to Forget in Malloc-Only Reasoning Models

Paper • 2602.08030 • Published Feb 8 • 6

upvoted a collection about 1 month ago

Penguin-VL

Collection

7 items • Updated 2 days ago • 13

upvoted a paper about 1 month ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published Mar 6 • 118

upvoted a collection 2 months ago

AgentDoG

Collection

A Diagnostic Guardrail Framework for AI Agent Safety and Security • 10 items • Updated 6 days ago • 107

upvoted a paper 3 months ago

Group Distributionally Robust Optimization-Driven Reinforcement Learning for LLM Reasoning

Paper • 2601.19280 • Published Jan 27 • 9

authored 4 papers 3 months ago

WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality

Paper • 2510.18560 • Published Oct 21, 2025 • 1

InComeS: Integrating Compression and Selection Mechanisms into LLMs for Efficient Model Editing

Paper • 2505.22156 • Published May 28, 2025

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published Dec 2, 2025 • 55

Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification

Paper • 2601.15808 • Published Jan 22 • 20

upvoted a paper 3 months ago

Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification

Paper • 2601.15808 • Published Jan 22 • 20

submitted a paper to Daily Papers 3 months ago

Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification

Paper • 2601.15808 • Published Jan 22 • 20

upvoted 2 papers 4 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 126

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published Dec 2, 2025 • 55

upvoted 4 papers 6 months ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30, 2025 • 119

HSCodeComp: A Realistic and Expert-level Benchmark for Deep Search Agents in Hierarchical Rule Application

Paper • 2510.19631 • Published Oct 22, 2025 • 28

DeepWideSearch: Benchmarking Depth and Width in Agentic Information Seeking

Paper • 2510.20168 • Published Oct 23, 2025 • 28

Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents

Paper • 2510.14438 • Published Oct 16, 2025 • 14

authored 2 papers 6 months ago

Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents

Paper • 2510.14438 • Published Oct 16, 2025 • 14

Benchmarking Commonsense Knowledge Base Population with an Effective Evaluation Dataset

Paper • 2109.07679 • Published Sep 16, 2021

Tianqing Fang

AI & ML interests

Recent Activity

Organizations

tqfang229's activity