Add model card for PRISM (Qwen2-VL-PRISM-SFT)
#1
by nielsr HF Staff - opened
This PR adds a comprehensive model card for the Qwen2-VL-PRISM-SFT model, which is a core component of the PRISM framework for robust Vision-Language Model (VLM) alignment.
Key additions and improvements include:
- Metadata: Added
license: mit,library_name: transformers, andpipeline_tag: image-text-to-text. These tags enhance discoverability on the Hugging Face Hub and enable the "Use in Transformers" widget, providing automated code snippets for users. - Paper Link: Linked the model to its associated paper, PRISM: Robust VLM Alignment with Principled Reasoning for Integrated Safety in Multimodality.
- GitHub Repository: Included a direct link to the official GitHub repository: https://github.com/SaFoLab-WISC/PRISM.
- Abstract: Incorporated the paper's abstract to provide a concise overview of the model's purpose and methodology.
- Model Details: Added information about the training datasets and model weights mentioned in the GitHub README.
- Citation: Included the BibTeX entry for easy referencing of the paper.
These updates significantly improve the model's documentation, making it more informative and accessible for the community.
andyc03 changed pull request status to merged