Zhao Yian's picture

Zhao Yian

zhaoyian01

·

Zhao-Yian

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Enhancing Spatial Understanding in Image Generation via Reward Modeling

upvoted a paper 2 months ago

PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss

upvoted a paper 3 months ago

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Enhancing Spatial Understanding in Image Generation via Reward Modeling

Paper • 2602.24233 • Published Feb 27 • 59

upvoted a paper 2 months ago

PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss

Paper • 2602.02493 • Published Feb 2 • 46

upvoted a paper 3 months ago

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Paper • 2601.07832 • Published Jan 12 • 52

upvoted a paper 5 months ago

DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation

Paper • 2511.19365 • Published Nov 24, 2025 • 66

upvoted 2 papers 6 months ago

RT-DETRv4: Painlessly Furthering Real-Time Object Detection with Vision Foundation Models

Paper • 2510.25257 • Published Oct 29, 2025 • 6

iSegMan: Interactive Segment-and-Manipulate 3D Gaussians

Paper • 2505.11934 • Published May 17, 2025 • 1

upvoted a collection 6 months ago

RT-DETRs

RT-DETR family • 5 items • Updated Oct 31, 2025 • 1

upvoted a paper 6 months ago

DETRs Beat YOLOs on Real-time Object Detection

Paper • 2304.08069 • Published Apr 17, 2023 • 16

upvoted a paper about 1 year ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22, 2025 • 91

upvoted a paper over 1 year ago

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

Paper • 2410.12787 • Published Oct 16, 2024 • 30