File size: 963 Bytes
5c6c09b bba8312 5c6c09b bba8312 5c6c09b bba8312 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 | ---
title: BPL Shelf-List Card Extraction
emoji: 📇
colorFrom: gray
colorTo: indigo
sdk: static
pinned: false
license: cc-by-4.0
---
# Boston Public Library shelf-list cards → structured records
A demo of zero-shot structured extraction on scanned BPL shelf-list catalogue cards
using [NuExtract3](https://huggingface.co/numind/NuExtract3) (4B, Apache-2.0), run as a
single command on Hugging Face Jobs via the
[`uv-scripts/ocr`](https://huggingface.co/datasets/uv-scripts/ocr) `nuextract3.py` script.
Each card image is paired with the JSON the model returned for a target catalogue schema
(`shelf_no`, `author`, `title`, `place`, `date`, `accession_no`, …). The model also
self-classifies shelf-divider cards vs bibliographic cards.
**This is an unreviewed zero-shot demo** — the next step is expert curator review and an
iteration loop (and, potentially, a fine-tuned community model). Source cards are public
domain via the Boston Public Library.
|