nvidia/llama-nemotron-rerank-vl-1b-v2 · Use default attention implementation with option to override

Use default attention implementation with option to override

#2

by nvidia-oliver-holworthy - opened Jan 8

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

nvidia-oliver-holworthy

NVIDIA org Jan 8

No description provided.

Jan 22

Added code for enabling sdpa and eager here: https://huggingface.co/nvidia/llama-nemotron-rerank-vl-1b-v2/discussions/3

nvidia-oliver-holworthy

NVIDIA org Feb 23

Replaced by #4

nvidia-oliver-holworthy changed pull request status to closed Feb 23

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment