2 8 5

zkjiang

justin-zk

https://jiangzhengkai.github.io/

AI & ML interests

Generative AI

Recent Activity

upvoted a paper 9 days ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

liked a model 7 months ago

tencent/HunyuanImage-3.0

liked a model 7 months ago

tencent/HunyuanImage-2.1

View all activity

Organizations

None yet

upvoted a paper 9 days ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 12 days ago • 233

liked 2 models 7 months ago

tencent/HunyuanImage-3.0

Text-to-Image • Updated Jan 28 • 18.7k • • 663

tencent/HunyuanImage-2.1

Text-to-Image • Updated Oct 14, 2025 • 209 • • 380

upvoted 2 papers 11 months ago

MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs

Paper • 2506.01674 • Published Jun 2, 2025 • 28

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published Jun 3, 2025 • 58

upvoted an article 11 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

119

upvoted a paper over 1 year ago

EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation

Paper • 2501.01895 • Published Jan 3, 2025 • 55

liked a Space over 1 year ago

RAG Demo

👀

Generate detailed images from prompts and layouts

authored a paper over 1 year ago

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

Paper • 2411.06558 • Published Nov 10, 2024 • 36

upvoted a paper over 1 year ago

Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement

Paper • 2411.06558 • Published Nov 10, 2024 • 36

liked 2 Spaces over 1 year ago

Kolors Virtual Try-On

👕

10k

Generate a virtual try‑on image of a person wearing a garment

Personalize SAM

📉

updated a Space over 1 year ago

Personalize SAM

📉

authored 7 papers over 1 year ago

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

Paper • 2305.11176 • Published May 18, 2023

You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction

Paper • 2205.14871 • Published May 30, 2022

Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers

Paper • 2405.05945 • Published May 9, 2024 • 4

ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

Paper • 2403.11289 • Published Mar 17, 2024

zkjiang

AI & ML interests

Recent Activity

Organizations

justin-zk's activity

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

RAG Demo

Kolors Virtual Try-On

Personalize SAM

Personalize SAM