community-science-team

community

AI & ML interests

None defined yet.

Recent Activity

nielsr submitted a paper 3 days ago

Geometric Context Transformer for Streaming 3D Reconstruction

nielsr submitted a paper 10 days ago

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

akhaliq submitted a paper 16 days ago

MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines

View all activity

submitted a paper to Daily Papers 3 days ago

Geometric Context Transformer for Streaming 3D Reconstruction

Paper • 2604.14141 • Published 4 days ago • 4

submitted a paper to Daily Papers 10 days ago

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Paper • 2604.04913 • Published 13 days ago • 10

submitted a paper to Daily Papers 16 days ago

MultiGen: Level-Design for Editable Multiplayer Worlds in Diffusion Game Engines

Paper • 2603.06679 • Published 20 days ago • 5

submitted a paper to Daily Papers 16 days ago

MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

Paper • 2603.28130 • Published 20 days ago • 11

submitted a paper to Daily Papers 22 days ago

AVO: Agentic Variation Operators for Autonomous Evolutionary Search

Paper • 2603.24517 • Published 25 days ago • 10

submitted a paper to Daily Papers 27 days ago

Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders

Paper • 2603.19209 • Published about 1 month ago • 5

submitted a paper to Daily Papers about 1 month ago

V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning

Paper • 2603.14482 • Published Mar 15 • 30

submitted a paper to Daily Papers about 1 month ago

V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising

Paper • 2603.16792 • Published Mar 17 • 3

submitted a paper to Daily Papers about 1 month ago

Omnilingual MT: Machine Translation for 1,600 Languages

Paper • 2603.16309 • Published Mar 17 • 21

submitted a paper to Daily Papers about 1 month ago

Multimodal OCR: Parse Anything from Documents

Paper • 2603.13032 • Published Mar 13 • 43

authored a paper about 1 month ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65

submitted 2 papers to Daily Papers about 2 months ago

VidEoMT: Your ViT is Secretly Also a Video Segmentation Model

Paper • 2602.17807 • Published Feb 19 • 7

Causal-JEPA: Learning World Models through Object-Level Latent Interventions

Paper • 2602.11389 • Published Feb 11 • 8

submitted a paper to Daily Papers 2 months ago

SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization

Paper • 2602.04811 • Published Feb 4 • 2

submitted 2 papers to Daily Papers 3 months ago

Visual Personalization Turing Test

Paper • 2601.22680 • Published Jan 30 • 2

Causal World Modeling for Robot Control

Paper • 2601.21998 • Published Jan 29 • 31

submitted a paper to Daily Papers 3 months ago

UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders

Paper • 2601.17950 • Published Jan 25 • 4

submitted 3 papers to Daily Papers 3 months ago

Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis

Paper • 2601.14253 • Published Jan 20 • 10

V-DPM: 4D Video Reconstruction with Dynamic Point Maps

Paper • 2601.09499 • Published Jan 14 • 11

UM-Text: A Unified Multimodal Model for Image Understanding

Paper • 2601.08321 • Published Jan 13 • 12