nielsr HF Staff commited on
Commit
015edc9
·
verified ·
1 Parent(s): 6502f9a

Add initial model card for MVP: Multi-view-guided Passage Reranking

Browse files

This PR adds an initial model card for the MVP model, based on the paper "Multi-view-guided Passage Reranking with Large Language Models".

It includes:
- A link to the paper: [Multi-view-guided Passage Reranking with Large Language Models](https://huggingface.co/papers/2509.07485).
- The `license` (Apache 2.0).
- The `library_name` (transformers), which enables the automated "how to use" widget on the Hub page due to the model's compatibility with the library.
- The `pipeline_tag` (text-ranking), making the model discoverable at https://huggingface.co/models?pipeline_tag=text-ranking.
- The `base_model` (`google/t5-3b`) for better context.
- An overview of the model from the paper abstract and GitHub README.
- Setup and usage instructions directly from the GitHub repository.
- Links to available model checkpoints and datasets.
- A BibTeX citation.

Please review and merge this PR if everything looks good.

Files changed (1) hide show
  1. README.md +73 -0
README.md ADDED
@@ -0,0 +1,73 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ pipeline_tag: text-ranking
4
+ library_name: transformers
5
+ base_model: google/t5-3b
6
+ ---
7
+
8
+ # MVP: Multi-view-guided Passage Reranking with Large Language Models
9
+
10
+ This repository contains the official implementation of the paper: [**Multi-view-guided Passage Reranking with Large Language Models**](https://huggingface.co/papers/2509.07485).
11
+
12
+ <div align="center">
13
+
14
+ <a href="https://huggingface.co/papers/2509.07485"><img src="https://img.shields.io/badge/arXiv-Paper-red" alt="arXiv Paper"></a>
15
+ <a href="https://github.com/bulbna/MVP"><img src="https://img.shields.io/badge/GitHub-Code-blue?logo=github" alt="GitHub Code"></a>
16
+
17
+ </div>
18
+
19
+ ## Overview
20
+ <p align="center">
21
+ <img src="https://github.com/bulbna/MVP/raw/main/assets/fig_MVP_motivation.png" alt="MVP Motivation" width="50%">
22
+ </p>
23
+
24
+ Recent advances in large language models (LLMs) have shown impressive performance in passage reranking tasks. Despite their success, LLM-based methods still face challenges in efficiency and sensitivity to external biases.
25
+ - (i) Existing models rely mostly on autoregressive generation and sliding window strategies to rank passages, which incurs heavy computational overhead as the number of passages increases.
26
+ - (ii) External biases, such as positional or semantic bias, hinder the model’s ability to accurately represent passages and the input-order sensitivity.
27
+
28
+ To address these limitations, we propose Multi-View-guided Passage Reranking (MVP), a non-generative LLM-based reranker that encodes query–passage information into multiple views and computes relevance scores via anchor vectors in a single decoding step. An orthogonal loss encourages diversity across views. With only 220M parameters, MVP matches 7B-scale fine-tuned models while reducing inference latency by 100×, and the 3B variant achieves state-of-the-art results on both in-domain and out-of-domain benchmarks.
29
+
30
+ ## Setup Environment
31
+ ```
32
+ conda env create -f mvp.yaml
33
+ conda activate mvp
34
+ ```
35
+
36
+ ## How to Use
37
+ ### Run MVP
38
+ ```
39
+ cd inference
40
+ bash run_evaluation.sh
41
+ ```
42
+
43
+ ### Train MVP
44
+ ```
45
+ cd train
46
+ bash train.sh
47
+ ```
48
+
49
+ ## Model Checkpoints
50
+ 1. [MVP-base](https://huggingface.co/Jun421/MVP-base)
51
+ 2. [MVP-3b](https://huggingface.co/Jun421/MVP-3b)
52
+
53
+ ## Dataset
54
+ ### Evaluation Datasets
55
+ - [BM25-Top100](https://huggingface.co/datasets/Soyoung97/beir-eval-bm25-top100)
56
+
57
+ ### Training Datasets
58
+ - [Train/Valid](https://huggingface.co/datasets/Jun421/MVP-train)
59
+ This dataset is derived from BEIR/MSMARCO license, and its usage is restricted to **academic purposes** only.
60
+
61
+ ## Acknowledgments
62
+ We implemented our model based on the following repository: [ListT5](https://github.com/soyoung97/ListT5)
63
+
64
+ ## Citation
65
+ If you find our work useful or helpful for your research, please consider citing our paper:
66
+ ```bibtex
67
+ @inproceedings{na2025multiviewguided,
68
+ title={Multi-view-guided Passage Reranking with Large Language Models},
69
+ author={Na, Jeongwoo and Kwon, Jun and Choi, Eunseong and Lee, Jongwuk},
70
+ booktitle={Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP)},
71
+ year={2025}
72
+ }
73
+ ```