Datadog
/

Toto-1.0-QA-Experimental

Visual Question Answering

vlm_with_timeseries

anomaly-reasoning

Model card Files Files and versions

sxie78-dd commited on 16 days ago

Commit

e110a62

·

verified ·

1 Parent(s): 2229230

Update README.md

Files changed (1) hide show

README.md +9 -5

README.md CHANGED Viewed

@@ -170,7 +170,7 @@ Running Toto-1.0-QA-Experimental typically requires multi-GPU setup (tested on 4
 ## Resources
-- [ARFBench Paper]()
 - [Dataset](https://huggingface.co/datasets/Datadog/ARFBench)
 - [Leaderboard](https://huggingface.co/spaces/Datadog/ARFBench)
 - [Code](https://github.com/DataDog/arfbench)
@@ -179,9 +179,13 @@ Running Toto-1.0-QA-Experimental typically requires multi-GPU setup (tested on 4
 ## Citation
 ```bibtex
-@inproceedings{xiearfbench,
-  title={ARFBench: Benchmarking Multimodal Time Series Reasoning for Software Incident Response},
-  author={Xie, Stephan and Cohen, Ben and Goswami, Mononito and Shen, Junhong and Khwaja, Emaad and Liu, Chenghao and Asker, David and Abou-Amal, Othmane and Talwalkar, Ameet},
-  booktitle={1st ICLR Workshop on Time Series in the Age of Large Models}
 }
 ```

 ## Resources
+- [ARFBench Paper](https://arxiv.org/abs/2604.21199)
 - [Dataset](https://huggingface.co/datasets/Datadog/ARFBench)
 - [Leaderboard](https://huggingface.co/spaces/Datadog/ARFBench)
 - [Code](https://github.com/DataDog/arfbench)
 ## Citation
 ```bibtex
+@misc{xie2026arfbenchbenchmarkingtimeseries,
+      title={ARFBench: Benchmarking Time Series Question Answering Ability for Software Incident Response},
+      author={Stephan Xie and Ben Cohen and Mononito Goswami and Junhong Shen and Emaad Khwaja and Chenghao Liu and David Asker and Othmane Abou-Amal and Ameet Talwalkar},
+      year={2026},
+      eprint={2604.21199},
+      archivePrefix={arXiv},
+      primaryClass={cs.LG},
+      url={https://arxiv.org/abs/2604.21199},
 }
 ```