DMLR: Data-centric Machine Learning Research -- Past, Present and Future Paper • 2311.13028 • Published Nov 21, 2023 • 2
The Alternative Annotator Test for LLM-as-a-Judge: How to Statistically Justify Replacing Human Annotators with LLMs Paper • 2501.10970 • Published Jan 19, 2025 • 1
State of What Art? A Call for Multi-Prompt LLM Evaluation Paper • 2401.00595 • Published Dec 31, 2023 • 3