nielsr HF Staff commited on
Commit
07c952a
·
verified ·
1 Parent(s): 8ca702c

Add paper link and citation

Browse files

This PR improves the model card by:
- Linking the model to its official research paper: [VideoSEAL: Mitigating Evidence Misalignment in Agentic Long Video Understanding by Decoupling Answer Authority](https://huggingface.co/papers/2605.12571).
- Adding a paper badge to the header for better visibility.
- Adding a BibTeX citation section to facilitate proper attribution for researchers.

Files changed (1) hide show
  1. README.md +21 -3
README.md CHANGED
@@ -1,10 +1,10 @@
1
  ---
2
- license: apache-2.0
3
- library_name: transformers
4
- pipeline_tag: video-text-to-text
5
  base_model: Qwen/Qwen3-8B
6
  language:
7
  - en
 
 
 
8
  tags:
9
  - video-understanding
10
  - long-video-understanding
@@ -19,6 +19,7 @@ tags:
19
  <h2 align="center">🎬 VideoSEAL: Mitigating Evidence Misalignment in Agentic Long Video Understanding by Decoupling Answer Authority</h2>
20
 
21
  <p align="center">
 
22
  <a href="https://github.com/Echochef/VideoSEAL"><img alt="Code" src="https://img.shields.io/badge/Code-GitHub-black?logo=github"></a>
23
  <a href="https://huggingface.co/CewEhao/VideoSEAL_8B"><img alt="HF Model" src="https://img.shields.io/badge/%F0%9F%A4%97%20HuggingFace-VideoSEAL__8B-yellow"></a>
24
  <img alt="ICML 2026" src="https://img.shields.io/badge/ICML-2026-blue">
@@ -30,12 +31,17 @@ tags:
30
  &nbsp;·&nbsp;
31
  💻 Code:
32
  <a href="https://github.com/Echochef/VideoSEAL">Echochef/VideoSEAL</a>
 
 
 
33
  </p>
34
 
35
  ## 👉 Introduction
36
 
37
  This is the official model card for **VideoSEAL: Mitigating Evidence Misalignment in Agentic Long Video Understanding by Decoupling Answer Authority** (ICML 2026).
38
 
 
 
39
  VideoSEAL provides offline build utilities for long video indexing:
40
 
41
  - OCR subtitles (SRT) → OCR captions + (optional) embeddings
@@ -116,3 +122,15 @@ MODEL_PATH='Qwen/Qwen3-8B' \
116
  ./scripts/train/run_video_workflow_grpo.sh test-reward
117
  pytest -q tests/rewards/test_video_reward_tool_env_integration.py
118
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
 
 
 
2
  base_model: Qwen/Qwen3-8B
3
  language:
4
  - en
5
+ library_name: transformers
6
+ license: apache-2.0
7
+ pipeline_tag: video-text-to-text
8
  tags:
9
  - video-understanding
10
  - long-video-understanding
 
19
  <h2 align="center">🎬 VideoSEAL: Mitigating Evidence Misalignment in Agentic Long Video Understanding by Decoupling Answer Authority</h2>
20
 
21
  <p align="center">
22
+ <a href="https://huggingface.co/papers/2605.12571"><img alt="Paper" src="https://img.shields.io/badge/Paper-HF--Paper-red"></a>
23
  <a href="https://github.com/Echochef/VideoSEAL"><img alt="Code" src="https://img.shields.io/badge/Code-GitHub-black?logo=github"></a>
24
  <a href="https://huggingface.co/CewEhao/VideoSEAL_8B"><img alt="HF Model" src="https://img.shields.io/badge/%F0%9F%A4%97%20HuggingFace-VideoSEAL__8B-yellow"></a>
25
  <img alt="ICML 2026" src="https://img.shields.io/badge/ICML-2026-blue">
 
31
  &nbsp;·&nbsp;
32
  💻 Code:
33
  <a href="https://github.com/Echochef/VideoSEAL">Echochef/VideoSEAL</a>
34
+ &nbsp;·&nbsp;
35
+ 📄 Paper:
36
+ <a href="https://huggingface.co/papers/2605.12571">2605.12571</a>
37
  </p>
38
 
39
  ## 👉 Introduction
40
 
41
  This is the official model card for **VideoSEAL: Mitigating Evidence Misalignment in Agentic Long Video Understanding by Decoupling Answer Authority** (ICML 2026).
42
 
43
+ VideoSEAL is an agentic framework for long-video question answering. It separates the *planner* role (deciding which evidence to gather) from the *answerer* role (judging the evidence), mitigating the "evidence misalignment" where models produce correct answers not supported by retrieved evidence.
44
+
45
  VideoSEAL provides offline build utilities for long video indexing:
46
 
47
  - OCR subtitles (SRT) → OCR captions + (optional) embeddings
 
122
  ./scripts/train/run_video_workflow_grpo.sh test-reward
123
  pytest -q tests/rewards/test_video_reward_tool_env_integration.py
124
  ```
125
+
126
+ ## 📜 Citation
127
+
128
+ ```bibtex
129
+ @inproceedings{videoseal2026,
130
+ title={VideoSEAL: Mitigating Evidence Misalignment in Agentic Long Video Understanding by Decoupling Answer Authority},
131
+ author={Dongyang Liu and others},
132
+ booktitle={International Conference on Machine Learning (ICML)},
133
+ year={2026},
134
+ url={https://huggingface.co/papers/2605.12571}
135
+ }
136
+ ```