sxie78-dd commited on
Commit
e110a62
·
verified ·
1 Parent(s): 2229230

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -5
README.md CHANGED
@@ -170,7 +170,7 @@ Running Toto-1.0-QA-Experimental typically requires multi-GPU setup (tested on 4
170
 
171
  ## Resources
172
 
173
- - [ARFBench Paper]()
174
  - [Dataset](https://huggingface.co/datasets/Datadog/ARFBench)
175
  - [Leaderboard](https://huggingface.co/spaces/Datadog/ARFBench)
176
  - [Code](https://github.com/DataDog/arfbench)
@@ -179,9 +179,13 @@ Running Toto-1.0-QA-Experimental typically requires multi-GPU setup (tested on 4
179
 
180
  ## Citation
181
  ```bibtex
182
- @inproceedings{xiearfbench,
183
- title={ARFBench: Benchmarking Multimodal Time Series Reasoning for Software Incident Response},
184
- author={Xie, Stephan and Cohen, Ben and Goswami, Mononito and Shen, Junhong and Khwaja, Emaad and Liu, Chenghao and Asker, David and Abou-Amal, Othmane and Talwalkar, Ameet},
185
- booktitle={1st ICLR Workshop on Time Series in the Age of Large Models}
 
 
 
 
186
  }
187
  ```
 
170
 
171
  ## Resources
172
 
173
+ - [ARFBench Paper](https://arxiv.org/abs/2604.21199)
174
  - [Dataset](https://huggingface.co/datasets/Datadog/ARFBench)
175
  - [Leaderboard](https://huggingface.co/spaces/Datadog/ARFBench)
176
  - [Code](https://github.com/DataDog/arfbench)
 
179
 
180
  ## Citation
181
  ```bibtex
182
+ @misc{xie2026arfbenchbenchmarkingtimeseries,
183
+ title={ARFBench: Benchmarking Time Series Question Answering Ability for Software Incident Response},
184
+ author={Stephan Xie and Ben Cohen and Mononito Goswami and Junhong Shen and Emaad Khwaja and Chenghao Liu and David Asker and Othmane Abou-Amal and Ameet Talwalkar},
185
+ year={2026},
186
+ eprint={2604.21199},
187
+ archivePrefix={arXiv},
188
+ primaryClass={cs.LG},
189
+ url={https://arxiv.org/abs/2604.21199},
190
  }
191
  ```