Improve model card: Add metadata, paper link, abstract, and citation
#1
by nielsr HF Staff - opened
This PR significantly enhances the model card for Co-rewarding-I: Qwen3-4B-Base trained on OpenRS by:
- Adding
pipeline_tag: text-generationfor better discoverability and to indicate its primary use case. - Adding
library_name: transformersto enable the automated "How to use" widget, as evidenced byconfig.json. - Adding
license: apache-2.0based on common practices and majority consensus from other reviewers. - Adding a prominent link to the associated Hugging Face paper: Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models.
- Including the full abstract of the paper to provide comprehensive context about the model.
- Adding the BibTeX citation for proper academic attribution.
These changes will make the model more informative, discoverable, and user-friendly on the Hugging Face Hub.
resistz changed pull request status to merged