Improve model card: Add metadata, paper link, abstract, and citation

#1
by nielsr HF Staff - opened

This PR significantly enhances the model card for Co-rewarding-I: Qwen3-4B-Base trained on OpenRS by:

  • Adding pipeline_tag: text-generation for better discoverability and to indicate its primary use case.
  • Adding library_name: transformers to enable the automated "How to use" widget, as evidenced by config.json.
  • Adding license: apache-2.0 based on common practices and majority consensus from other reviewers.
  • Adding a prominent link to the associated Hugging Face paper: Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models.
  • Including the full abstract of the paper to provide comprehensive context about the model.
  • Adding the BibTeX citation for proper academic attribution.

These changes will make the model more informative, discoverable, and user-friendly on the Hugging Face Hub.

resistz changed pull request status to merged

Sign up or log in to comment