Add model card for PRISM (Qwen2-VL-PRISM-SFT)

by nielsr HF Staff - opened Aug 31, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+49

-0

nielsr

Aug 31, 2025

This PR adds a comprehensive model card for the Qwen2-VL-PRISM-SFT model, which is a core component of the PRISM framework for robust Vision-Language Model (VLM) alignment.

Key additions and improvements include:

Metadata: Added license: mit, library_name: transformers, and pipeline_tag: image-text-to-text. These tags enhance discoverability on the Hugging Face Hub and enable the "Use in Transformers" widget, providing automated code snippets for users.
Paper Link: Linked the model to its associated paper, PRISM: Robust VLM Alignment with Principled Reasoning for Integrated Safety in Multimodality.
GitHub Repository: Included a direct link to the official GitHub repository: https://github.com/SaFoLab-WISC/PRISM.
Abstract: Incorporated the paper's abstract to provide a concise overview of the model's purpose and methodology.
Model Details: Added information about the training datasets and model weights mentioned in the GitHub README.
Citation: Included the BibTeX entry for easy referencing of the paper.

These updates significantly improve the model's documentation, making it more informative and accessible for the community.

Add model card for PRISM (Qwen2-VL-PRISM-SFT)d9413d4f

andyc03 changed pull request status to merged Sep 1, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment