Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows Paper ⢠2512.13168 ⢠Published Dec 15, 2025 ⢠53 ⢠5
WideSearch: Benchmarking Agentic Broad Info-Seeking Paper ⢠2508.07999 ⢠Published Aug 11, 2025 ⢠111
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction Paper ⢠2508.11987 ⢠Published Aug 16, 2025 ⢠73
TUTA: Tree-based Transformers for Generally Structured Table Pre-training Paper ⢠2010.12537 ⢠Published Oct 21, 2020
Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows Paper ⢠2512.13168 ⢠Published Dec 15, 2025 ⢠53 ⢠5
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models Paper ⢠2407.09025 ⢠Published Jul 12, 2024 ⢠140 ⢠28
Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows Paper ⢠2512.13168 ⢠Published Dec 15, 2025 ⢠53
Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows Paper ⢠2512.13168 ⢠Published Dec 15, 2025 ⢠53
view post Post 4632 Finch đ° an enterprise-grade benchmark that measures whether AI agents can truly handle real world finance & accounting work. FinWorkBench/Finch⨠Built from real enterprise data (Enron + financial institutions), not synthetic tasks⨠Tests end-to-end finance workflows⨠Multimodal & cross-file reasoning⨠Expert annotated (700+ hours) and genuinely challenging hard See translation đĽ 4 4 + Reply
TableSense: Spreadsheet Table Detection with Convolutional Neural Networks Paper ⢠2106.13500 ⢠Published Jun 25, 2021 ⢠1
HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation Paper ⢠2108.06712 ⢠Published Aug 15, 2021 ⢠2
Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows Paper ⢠2512.13168 ⢠Published Dec 15, 2025 ⢠53