Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling Paper • 2111.14819 • Published Nov 29, 2021
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 11 days ago • 182
Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training Paper • 2603.12255 • Published Mar 12 • 91
BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence Paper • 2411.14869 • Published Nov 22, 2024 • 1
DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation Paper • 2303.06285 • Published Mar 11, 2023
Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation Paper • 2312.10113 • Published Dec 15, 2023
Paint Transformer: Feed Forward Neural Painting with Stroke Prediction Paper • 2108.03798 • Published Aug 9, 2021
Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence Paper • 2203.00911 • Published Mar 2, 2022
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding Paper • 2412.13193 • Published Dec 17, 2024 • 1