Add pipeline tag and improve model card

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +39 -7
README.md CHANGED
@@ -1,17 +1,49 @@
1
  ---
2
  license: apache-2.0
 
3
  ---
4
- # FakeReasoning Model Card
5
 
6
- ## Model details
7
 
8
- **Model type:** FakeReasoning is a forgery detection and reasoning framework with three key components: 1) a dual-branch visual encoder that integrates CLIP and DINO to capture both high-level semantics and low-level artifacts; 2) a Forgery-Aware Feature Fusion Module that leverages DINO's attention maps and cross-attention mechanisms to guide MLLMs toward forgery-related clues; 3) a Classification Probability Mapper that couples language modeling and forgery detection, enhancing overall performance.
9
 
10
- **Project page:** https://pris-cv.github.io/FakeReasoning/
 
 
11
 
12
- **Paper:** https://arxiv.org/abs/2503.21210
13
 
14
- ## License
 
 
 
15
 
16
- FakeReasoning is licensed under the Apache 2.0 License.
17
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ pipeline_tag: image-text-to-text
4
  ---
 
5
 
6
+ # FakeReasoning
7
 
8
+ FakeReasoning is a forgery detection and reasoning framework designed to accurately detect AI-generated images and provide reliable reasoning over forgery attributes. It formulates detection and explanation as a unified Forgery Detection and Reasoning task (FDR-Task).
9
 
10
+ - **Project Page:** [https://pris-cv.github.io/FakeReasoning/](https://pris-cv.github.io/FakeReasoning/)
11
+ - **Paper:** [Toward Generalizable Forgery Detection and Reasoning](https://huggingface.co/papers/2503.21210)
12
+ - **Code:** [https://github.com/PRIS-CV/FakeReasoning](https://github.com/PRIS-CV/FakeReasoning)
13
 
14
+ ## Model Details
15
 
16
+ FakeReasoning consists of three key components:
17
+ 1. **Dual-branch visual encoder:** Integrates CLIP and DINO to capture both high-level semantics and low-level artifacts.
18
+ 2. **Forgery-Aware Feature Fusion Module:** Leverages DINO's attention maps and cross-attention mechanisms to guide the model toward forgery-related clues.
19
+ 3. **Classification Probability Mapper:** Couples language modeling and forgery detection, enhancing overall performance.
20
 
21
+ The model was trained on the **MMFR-Dataset**, a large-scale dataset containing 120K images across 10 generative models with 378K reasoning annotations.
22
 
23
+ ## Sample Usage
24
+
25
+ To use the model, please follow the installation instructions in the [official repository](https://github.com/PRIS-CV/FakeReasoning). You can then run inference using the following commands:
26
+
27
+ ```bash
28
+ cd LLaVA/forgery_eval
29
+ export DINO_PATH='path_to_dinov2-main'
30
+ export DINO_WEIGHT='path_to_dinov2_vitl14_pretrain.pth'
31
+
32
+ python inference.py \
33
+ --model-path AnnaGao/FakeReasoning \
34
+ --img_path path_to_your_image.png
35
+ ```
36
+
37
+ Note: Inference and evaluation require at least 30 GB of GPU memory on a single GPU.
38
+
39
+ ## Citation
40
+
41
+ ```bibtex
42
+ @article{gao2025fakereasoning,
43
+ title={FakeReasoning: Towards Generalizable Forgery Detection and Reasoning},
44
+ author={Gao, Yueying and Chang, Dongliang and Yu, Bingyao and Qin, Haotian and Chen, Lei and Liang, Kongming and Ma, Zhanyu},
45
+ journal={arXiv preprint arXiv:2503.21210},
46
+ year={2025},
47
+ url={https://arxiv.org/abs/2503.21210}
48
+ }
49
+ ```