Spaces:

Scam-AI
/

spot-the-ai-receipt

Running

Fix: gradio>=5.40 + huggingface_hub<1.0 (resolve HfFolder import)

52bf66a verified 2 days ago

1.32 kB

	---
	title: Spot the AI Receipt
	emoji: 🧾
	colorFrom: blue
	colorTo: indigo
	sdk: gradio
	python_version: "3.12"
	app_file: app.py
	pinned: true
	short_description: Can you spot the AI-generated receipt?
	license: cc-by-nc-sa-4.0
	---

	# Spot the AI Receipt 🧾

	An interactive 2AFC (two-alternative forced choice) game built by [Scam.AI](https://www.scam.ai).

	Each round shows two receipts side by side:
	- One is an authentic receipt from the public [CORD-v2](https://huggingface.co/datasets/naver-clova-ix/cord-v2) dataset
	- One is fully AI-synthesized (GPT-4o generates the text, GPT-Image-1 renders the image) from our [GPT4o-Receipt](https://huggingface.co/datasets/Scam-AI/gpt4o-receipt) benchmark

	Pick the AI one. After 10 rounds you'll see your accuracy vs. the human and LLM baselines reported in our paper:

	> Zhang, Ren, et al. — "GPT4o-Receipt: A Dataset and Human Study for AI-Generated Document Forensics" (arXiv:2603.11442)

	Key finding: humans rate AI receipts as visually distinct from real ones (1.87/5 gap) yet only achieve F1 = 0.852 binary detection — well below LLMs like Claude Sonnet 4 (F1 = 0.975). The forensic signal is in arithmetic incoherence that humans rarely audit but LLMs verify trivially.

	Production-grade detection: [scam.ai](https://www.scam.ai).