cagatay odabasi's picture

cagatay odabasi

cagatayodabasi

·

cagbal

AI & ML interests

None yet

Recent Activity

upvoted a collection about 1 month ago

upvoted a paper about 2 months ago

VLANeXt: Recipes for Building Strong VLA Models

liked a dataset 2 months ago

nvidia/PhysicalAI-Robotics-Manipulation-Objects-Kitchen-MJCF

View all activity

Organizations

upvoted a collection about 1 month ago

MolmoPoint

MolmoPoint models • 3 items • Updated Mar 18 • 11

upvoted a paper about 2 months ago

VLANeXt: Recipes for Building Strong VLA Models

Paper • 2602.18532 • Published Feb 20 • 52

upvoted a collection 5 months ago

VST

A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities. • 6 items • Updated Feb 1 • 6

upvoted a collection 8 months ago

Cosmos-Predict2

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos-predict25 • 10 items • Updated 4 days ago • 36

upvoted a paper about 1 year ago

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published Jan 7, 2025 • 82

upvoted a collection about 1 year ago

Physical AI

Collection of open, commercial-grade datasets for physical AI developers • 29 items • Updated 4 days ago • 143

upvoted a paper about 1 year ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 125

upvoted a collection over 1 year ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated Mar 2 • 89

upvoted a paper over 1 year ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 129

upvoted a collection over 1 year ago

Theia

Distilling Diverse Vision Foundation Models for Robot Learning • 6 items • Updated Sep 30, 2024 • 9

upvoted an article over 1 year ago

Article

Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 👐 📚

Jul 10, 2024

•

93

upvoted 2 papers over 1 year ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22, 2024 • 94

3D-VLA: A 3D Vision-Language-Action Generative World Model

Paper • 2403.09631 • Published Mar 14, 2024 • 12

upvoted a collection over 1 year ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 4 days ago • 64

upvoted 6 papers over 1 year ago

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Paper • 2408.06941 • Published Aug 13, 2024 • 32

DC3DO: Diffusion Classifier for 3D Objects

Paper • 2408.06693 • Published Aug 13, 2024 • 11

Imagen 3

Paper • 2408.07009 • Published Aug 13, 2024 • 62

Task-oriented Sequential Grounding in 3D Scenes

Paper • 2408.04034 • Published Aug 7, 2024 • 8

Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Paper • 2408.03615 • Published Aug 7, 2024 • 31

Achieving Human Level Competitive Robot Table Tennis

Paper • 2408.03906 • Published Aug 7, 2024 • 28