Improve model card: Add pipeline tag, library_name, and paper link
#1
by nielsr HF Staff - opened
This PR enhances the model card for the Llama-3.2-3B-Instruct model, specifically the "Entropy Minimization" version, which is part of the "Co-rewarding" project.
Changes include:
- Adding
pipeline_tag: text-generationto improve discoverability on the Hugging Face Hub. - Specifying
library_name: transformersto enable the automated "How to use" widget for seamless interaction with the model. - Incorporating a direct link to the paper: Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models, providing immediate context for the model.
These updates aim to provide more comprehensive information and improve the user experience for this model.
resistz changed pull request status to merged