new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Nov 20

Submitted by

taesiri

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

·
25 authors

Submitted by

taesiri

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

·
11 authors

Submitted by

ambud26

What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity

3

Submitted by

taesiri

VisPlay: Self-Evolving Vision-Language Models from Images

UIUC-CS

University of Illinois at Urbana-Champaign

Submitted by

hangyulmd

Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset

kaist-ai

Submitted by

Jevin754

ARC-Chapter: Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries

TencentARC

ARC Lab, Tencent PCG

Submitted by

taesiri

MHR: Momentum Human Rig

·
41 authors

Submitted by

Hao-Zhe

Mixture of States: Routing Token-Level Dynamics for Multimodal Generation

Submitted by

nielsr

RoMa v2: Harder Better Faster Denser Feature Matching

·
10 authors

Submitted by

doraemonILoveYou

FreeAskWorld: An Interactive and Closed-Loop Simulator for Human-Centric Embodied AI

·
9 authors

Submitted by

dorienh

Aligning Generative Music AI with Human Preferences: Methods and Challenges

·
2 authors

2

Submitted by

spc819

Medal S: Spatio-Textual Prompt Model for Medical Segmentation

·
6 authors