metaphi-ai commited on
Commit
00de1cc
·
verified ·
1 Parent(s): 22db46f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -1
README.md CHANGED
@@ -7,4 +7,27 @@ sdk: static
7
  pinned: false
8
  ---
9
 
10
- Edit this `README.md` markdown file to author your organization card.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  pinned: false
8
  ---
9
 
10
+ # Metaphi
11
+
12
+ Enterprise evaluation environments for long-horizon AI agents. Real workflows, production data, expert ground truth.
13
+
14
+ ## LH-Bench
15
+
16
+ Scaling evaluation of long-horizon agents on enterprise tasks:
17
+
18
+ - **[figma](https://huggingface.co/datasets/metaphilabs/figma)** — Figma design → production React code (33 tasks)
19
+ - **[long-ground](https://huggingface.co/datasets/metaphilabs/long-ground)** — Source-grounded programmatic video synthesis
20
+ (45 tasks)
21
+ - **[credit-underwriting](https://huggingface.co/datasets/metaphilabs/credit-underwriting)** — Bank statement PDF
22
+ extraction + transaction categorization (8 cases, 20 PDFs)
23
+
24
+ ## About
25
+
26
+ Metaphi builds environments where AI agents complete real workflows in production enterprise software — generating training
27
+ signal and benchmarks for AI labs.
28
+
29
+ - Programmatic + human evaluation
30
+ - Expert-curated ground truth from production workflows
31
+ - Pairwise human preference data (RLHF signal)
32
+
33
+ Website: [metaphi.ai](https://metaphi.ai)