chenhaoguan commited on
Commit
56d2c5a
·
verified ·
1 Parent(s): 415899d

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +74 -0
README.md ADDED
@@ -0,0 +1,74 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ task_categories:
4
+ - visual-question-answering
5
+ language:
6
+ - en
7
+ tags:
8
+ - hallucination-detection
9
+ - object-hallucination
10
+ - pope
11
+ - coco
12
+ - benchmark
13
+ size_categories:
14
+ - 1K<n<10K
15
+ ---
16
+
17
+ # RePOPE: Revisiting Partial Object Hallucination Evaluation
18
+
19
+ RePOPE is a re-annotated version of the POPE (Polling-based Object Probing Evaluation) benchmark with corrected ground-truth labels. It evaluates object hallucination in multimodal large language models (MLLMs) by asking yes/no questions about object existence in MSCOCO images.
20
+
21
+ ## Dataset Details
22
+
23
+ - **Original Paper:** [RePOPE: Revisiting Partial Object Hallucination Evaluation](https://arxiv.org/abs/2405.14571)
24
+ - **Original Repository:** [https://github.com/YanNeu/RePOPE](https://github.com/YanNeu/RePOPE)
25
+ - **Images:** MSCOCO 2014 (subset of 500 images)
26
+
27
+ ## Dataset Structure
28
+
29
+ Each row contains:
30
+
31
+ - `image`: The MSCOCO image (struct with `bytes` and `path`)
32
+ - `image_id`: COCO image identifier (e.g., `000000310196`)
33
+ - `question`: A yes/no question about object presence (e.g., "Is there a snowboard in the image?")
34
+ - `answer`: Ground truth label (`yes` or `no`)
35
+ - `category`: Sampling strategy used to select the queried object (`random`, `popular`, or `adversarial`)
36
+
37
+ ### Splits
38
+
39
+ This dataset contains all three POPE sampling categories in a single split:
40
+
41
+ | Category | Count |
42
+ |---------------|-------|
43
+ | random | 2,774 |
44
+ | popular | 2,727 |
45
+ | adversarial | 2,684 |
46
+ | **Total** | **8,185** |
47
+
48
+ ### Label Distribution
49
+
50
+ | Answer | Count |
51
+ |--------|-------|
52
+ | yes | 3,539 |
53
+ | no | 4,646 |
54
+
55
+ ## How to Use
56
+
57
+ ```python
58
+ from datasets import load_dataset
59
+
60
+ ds = load_dataset("MM-Hallu/RePOPE")
61
+ ```
62
+
63
+ ## Citation
64
+
65
+ ```bibtex
66
+ @misc{neuhaus2024repope,
67
+ title={RePOPE: Revisiting Partial Object Hallucination Evaluation},
68
+ author={Yannik Neuschwander and Selen Yu and Jordy Van Landeghem and Jan Van Loock and Lilian Ngweta and Rukiye Savran Kizildag and Desmond Elliott and Matthew B. Blaschko},
69
+ year={2024},
70
+ eprint={2405.14571},
71
+ archivePrefix={arXiv},
72
+ primaryClass={cs.CV}
73
+ }
74
+ ```