Improve model card: Add metadata, paper link, abstract, and citation

by nielsr HF Staff - opened Oct 11, 2025

←

This PR significantly enhances the model card for Co-rewarding-I: Qwen3-4B-Base trained on OpenRS by:

Adding pipeline_tag: text-generation for better discoverability and to indicate its primary use case.
Adding library_name: transformers to enable the automated "How to use" widget, as evidenced by config.json.
Adding license: apache-2.0 based on common practices and majority consensus from other reviewers.
Adding a prominent link to the associated Hugging Face paper: Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models.
Including the full abstract of the paper to provide comprehensive context about the model.
Adding the BibTeX citation for proper academic attribution.

These changes will make the model more informative, discoverable, and user-friendly on the Hugging Face Hub.

resistz changed pull request status to merged Oct 11, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment