2 9 4

Kaixiong Gong

kxgong

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

Gen-Searcher: Reinforcing Agentic Search for Image Generation

upvoted a paper 22 days ago

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

upvoted a paper 23 days ago

Manifold-Aware Exploration for Reinforcement Learning in Video Generation

View all activity

Organizations

upvoted a paper 16 days ago

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Paper • 2603.28767 • Published 17 days ago • 57

upvoted a paper 22 days ago

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Paper • 2603.23500 • Published 23 days ago • 35

upvoted a paper 23 days ago

Manifold-Aware Exploration for Reinforcement Learning in Video Generation

Paper • 2603.21872 • Published 24 days ago • 33

authored a paper 3 months ago

iFSQ: Improving FSQ for Image Generation with 1 Line of Code

Paper • 2601.17124 • Published Jan 23 • 33

upvoted a paper 3 months ago

iFSQ: Improving FSQ for Image Generation with 1 Line of Code

Paper • 2601.17124 • Published Jan 23 • 33

upvoted 3 papers 4 months ago

JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization

Paper • 2511.23002 • Published Nov 28, 2025 • 26

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 265

MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

Paper • 2512.03041 • Published Dec 2, 2025 • 65

liked a model 7 months ago

tencent/HunyuanImage-2.1

Text-to-Image • Updated Oct 14, 2025 • 229 • • 380

liked a model 10 months ago

tencent/Hunyuan3D-2.1

Image-to-3D • Updated Oct 17, 2025 • 38.3k • 892

authored a paper about 1 year ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27, 2025 • 79

upvoted a paper about 1 year ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27, 2025 • 79

authored a paper over 1 year ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 25

liked a model almost 2 years ago

fal/AuraSR

Updated Jul 15, 2024 • 249 • 307

upvoted a paper almost 2 years ago

Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level

Paper • 2406.11817 • Published Jun 17, 2024 • 13

New activity in mistralai/Mixtral-8x7B-v0.1 about 2 years ago

Out of memory issue.

#34 opened about 2 years ago by

kxgong

authored a paper about 2 years ago

Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Paper • 2401.14405 • Published Jan 25, 2024 • 13

authored 2 papers over 2 years ago

Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors

Paper • 2312.04963 • Published Dec 7, 2023 • 17

OneLLM: One Framework to Align All Modalities with Language

Paper • 2312.03700 • Published Dec 6, 2023 • 24

updated a model over 2 years ago

kxgong/Meta-Transformer

Updated Jul 27, 2023 • 4

Kaixiong Gong

AI & ML interests

Recent Activity

Organizations

kxgong's activity

Out of memory issue.