Improve model card: Add tags, paper link, abstract summary, and citation
#1
by nielsr HF Staff - opened
This PR significantly enhances the model card for GT-GRPO: Qwen3-8B-Base trained on DAPO-14k by:
- Adding
library_name: transformersto the metadata, enabling the automated "How to use" widget, as evidenced byconfig.jsonandtokenizer_config.json. - Adding
pipeline_tag: text-generationfor better discoverability, as the model is a Causal LM for reasoning tasks. - Incorporating a direct link to the research paper: Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models.
- Including a concise summary of the paper's abstract to provide essential context about the Co-rewarding framework.
- Updating the main title for clarity and adding a dedicated section for the GitHub repository.
- Adding the BibTeX citation provided in the project's GitHub README.
These changes improve the model's discoverability and provide users with comprehensive information for better understanding and usage.
Geraldxm changed pull request status to merged