Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Miguel Alonso Jr's picture
1 7 6

Miguel Alonso Jr

miguelalonsojr
muhammadzeeshan007's profile picture 21world's profile picture nilq's profile picture
·
  • miguelalonsojr
  • miguelalonsojr
  • miguelalonsojr.bsky.social

AI & ML interests

ML, RL, Robotics

Organizations

Unity Technologies's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture

upvoted an article about 1 year ago
view article
Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

  • +2
Feb 4, 2025
•
192
upvoted 3 papers about 2 years ago

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 69

Nash Learning from Human Feedback

Paper • 2312.00886 • Published Dec 1, 2023 • 18

Aligning Large Multimodal Models with Factually Augmented RLHF

Paper • 2309.14525 • Published Sep 25, 2023 • 32
upvoted a paper over 2 years ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 64
upvoted 2 collections over 2 years ago

Zephyr 7B

Collection
Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 8 items • Updated Mar 2 • 152

Papers about model merging

Collection
referenced in the mergekit repo: https://github.com/cg123/mergekit • 4 items • Updated 25 days ago • 15
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs