Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Manu Gaur's picture
1 5

Manu Gaur

manu-gaur
dark-pen's profile picture
·
https://manugaurdl.github.io/
  • gaur_manu
  • manugaurdl

AI & ML interests

I am interested in self-supervised learning, multimodal models, generative modelling and reinforcement learning

Recent Activity

authored a paper 10 days ago
No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning
authored a paper 10 days ago
Steerable Visual Representations
upvoted a paper 14 days ago
Steerable Visual Representations
View all activity

Organizations

None yet

upvoted a paper 14 days ago

Steerable Visual Representations

Paper • 2604.02327 • Published 16 days ago • 53
upvoted a paper about 1 month ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published Mar 3 • 103
upvoted a paper 3 months ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Paper • 2601.16208 • Published Jan 22 • 55
upvoted a paper 6 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 170
upvoted a paper about 1 year ago

No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning

Paper • 2409.03025 • Published Sep 4, 2024 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs