19 14

Xc1hpxn23

xc1hpxn23

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

Conditional Equivalence of DPO and RLHF: Implicit Assumption, Failure Modes, and Provable Alignment

upvoted a paper 2 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

liked a model 3 days ago

bozuuu/my-cool-model

View all activity

Organizations

None yet

upvoted a paper about 7 hours ago

Conditional Equivalence of DPO and RLHF: Implicit Assumption, Failure Modes, and Provable Alignment

Paper • 2605.20834 • Published 1 day ago • 3

upvoted a paper 2 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 9 days ago • 259

liked a model 3 days ago

bozuuu/my-cool-model

Updated 3 days ago • 1

liked a dataset 7 days ago

pluslab/PLUS_Lab_GPUs_Data

Preview • Updated 1 minute ago • 18.5k • 7

liked a dataset 10 days ago

Wayl/20260511_103841_sfp_03_train

Viewer • Updated 10 days ago • 21.3k • 176 • 1

liked a dataset 15 days ago

wisent-ai/activations

Updated about 18 hours ago • 65.3k • 6

liked a model 20 days ago

gradients-io-tournaments/tournament-tourn_f4f456bc6d050b8b_20260430-04b98654-a18a-49c0-b291-2c623c1cfbc1-5C7vE26G

2B • Updated 20 days ago • 33 • 1

liked a dataset 27 days ago

P2SAMAPA/p2-etf-informer-results

Updated about 11 hours ago • 1.19k • 1

upvoted a paper about 1 month ago

MedGemma 1.5 Technical Report

Paper • 2604.05081 • Published Apr 6 • 14

liked a dataset about 1 month ago

lmsys/lmsys-chat-1m

Viewer • Updated Jul 27, 2024 • 1M • 8.4k • 890

upvoted 4 papers about 1 month ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 629

liked a dataset about 1 month ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 38.4k • 1.74k

upvoted a paper about 1 month ago

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 176

liked 2 models about 2 months ago

OrchardPair/DeepSeek-R1-Distill-Qwen-14B-4bit

2B • Updated Apr 4 • 477 • 1

neptunes5thmoon/equivariance

Updated 19 days ago • 3

upvoted 2 papers about 2 months ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models

Paper • 2603.25750 • Published Mar 20 • 36

Xc1hpxn23

AI & ML interests

Recent Activity

Organizations

xc1hpxn23's activity