Alexandre-Numind commited on
Commit
a30ef6b
·
verified ·
1 Parent(s): efcf351

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +34 -15
app.py CHANGED
@@ -1401,21 +1401,40 @@ with gr.Blocks(
1401
 
1402
  gr.Markdown(
1403
  """
1404
- **NuExtract3** is a unified **4B** vision-language reasoning model for document understanding.
1405
- It combines strong **structured information extraction** with high-quality **image-to-Markdown** conversion, making it suitable for extraction pipelines, OCR, and RAG preprocessing for all types of documents such as scans, receipts, forms, invoices, contracts or tables.
1406
-
1407
- **Features**:
1408
- - **Structured extraction**: input (text/images) + JSON template + instructions --> JSON output
1409
- - **Markdown conversion**: input (text/images) --> Markdown
1410
- - **Multimodal inputs**: text, images, or text + images.
1411
- - **Multilingual** documents.
1412
- - **Reasoning** and non-reasoning inference modes.
1413
- - **Template generation** for structured extraction from natural language or input document.
1414
-
1415
-
1416
- Find the model and inference code here: [NuExtract3](https://huggingface.co/numind/NuExtract3).
1417
-
1418
- """,
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1419
  elem_classes=["intro-card"],
1420
  )
1421
 
 
1401
 
1402
  gr.Markdown(
1403
  """
1404
+ <div style="padding: 1rem 0;">
1405
+
1406
+ <h1 style="margin-bottom: 0.25rem;">NuExtract3</h1>
1407
+
1408
+ <p style="font-size: 1.05rem; line-height: 1.6;">
1409
+ <strong>NuExtract3</strong> is a unified <strong>4B vision-language reasoning model</strong>
1410
+ for document understanding.
1411
+ </p>
1412
+
1413
+ <p style="line-height: 1.6;">
1414
+ It combines <strong>structured information extraction</strong> with high-quality
1415
+ <strong>image-to-Markdown conversion</strong>, making it useful for OCR, RAG preprocessing,
1416
+ and extraction pipelines across scans, receipts, forms, invoices, contracts, tables, and more.
1417
+ </p>
1418
+
1419
+ <h3>Features</h3>
1420
+
1421
+ <ul>
1422
+ <li><strong>Structured extraction:</strong> text/images + JSON template + instructions → JSON output</li>
1423
+ <li><strong>Markdown conversion:</strong> text/images → Markdown</li>
1424
+ <li><strong>Multimodal inputs:</strong> text, images, or text + images</li>
1425
+ <li><strong>Multilingual documents</strong></li>
1426
+ <li><strong>Reasoning and non-reasoning inference modes</strong></li>
1427
+ <li><strong>Template generation</strong> from natural language or input documents</li>
1428
+ </ul>
1429
+
1430
+ <p style="margin-top: 1rem;">
1431
+ <a href="https://huggingface.co/numind/NuExtract3" target="_blank">
1432
+ View model and inference code on Hugging Face →
1433
+ </a>
1434
+ </p>
1435
+
1436
+ </div>
1437
+ """,
1438
  elem_classes=["intro-card"],
1439
  )
1440