赵嘉豪

jacobrobinson

AI & ML interests

Alignment-focused model research. Building practical tools.

Recent Activity

upvoted a paper about 9 hours ago

From Runnable to Shippable: Multi-Agent Test-Driven Development for Generating Full-Stack Web Applications from Requirements

upvoted a paper about 13 hours ago

Unsupervised Process Reward Models

liked a model 1 day ago

tencent/Hy-MT2-30B-A3B

View all activity

Organizations

None yet

upvoted a paper about 9 hours ago

From Runnable to Shippable: Multi-Agent Test-Driven Development for Generating Full-Stack Web Applications from Requirements

Paper • 2605.17242 • Published 7 days ago • 12

upvoted a paper about 13 hours ago

Unsupervised Process Reward Models

Paper • 2605.10158 • Published 13 days ago • 23

liked a model 1 day ago

tencent/Hy-MT2-30B-A3B

Translation • 30B • Updated 1 day ago • 970 • 283

upvoted a paper 1 day ago

From Reasoning Chains to Verifiable Subproblems: Curriculum Reinforcement Learning Enables Credit Assignment for LLM Reasoning

Paper • 2605.22074 • Published 3 days ago • 2

liked a model 2 days ago

iamseungpil/metacot-h200-arm3-stable-gfn-primary-0520

Updated about 15 hours ago • 2

upvoted a paper 4 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 11 days ago • 262

liked a dataset 5 days ago

allenai/dolma

Updated Apr 17, 2024 • 4.77k • 1.03k

liked a model 9 days ago

Pradheep1647/kibitzer-sft-elo-1079-step-004000

Updated 9 days ago • 1

liked a dataset 16 days ago

investigation-worlds/investigation-worlds

Viewer • Updated 12 days ago • 100 • 524 • 1

upvoted a paper 16 days ago

HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation

Paper • 2604.28196 • Published 24 days ago • 71

liked a dataset 29 days ago

world-igr-plum/regions

Updated Jun 17, 2025 • 373k • 14

upvoted 2 papers about 1 month ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 240

Crowded in B-Space: Calibrating Shared Directions for LoRA Merging

Paper • 2604.16826 • Published Apr 18 • 18

liked a model about 1 month ago

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated Apr 14 • 858 • 906

upvoted a paper about 1 month ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 246

liked a model about 1 month ago

Macooo/qwen3-1.7b-lima-pass5-lora

Updated Apr 13 • 1

liked a dataset about 1 month ago

fka/prompts.chat

Viewer • Updated about 12 hours ago • 1.83k • 47k • 9.7k

upvoted 3 papers about 1 month ago

An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU

Paper • 2603.16428 • Published Mar 17 • 51

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

Paper • 2604.04323 • Published Apr 6 • 41

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 364

赵 嘉豪

AI & ML interests

Recent Activity

Organizations

jacobrobinson's activity

赵嘉豪