metaphi-ai commited on
Commit
cdf5221
·
verified ·
1 Parent(s): 00de1cc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -11
README.md CHANGED
@@ -11,20 +11,12 @@ pinned: false
11
 
12
  Enterprise evaluation environments for long-horizon AI agents. Real workflows, production data, expert ground truth.
13
 
14
- ## LH-Bench
15
-
16
- Scaling evaluation of long-horizon agents on enterprise tasks:
17
-
18
- - **[figma](https://huggingface.co/datasets/metaphilabs/figma)** — Figma design → production React code (33 tasks)
19
- - **[long-ground](https://huggingface.co/datasets/metaphilabs/long-ground)** — Source-grounded programmatic video synthesis
20
- (45 tasks)
21
- - **[credit-underwriting](https://huggingface.co/datasets/metaphilabs/credit-underwriting)** — Bank statement PDF
22
- extraction + transaction categorization (8 cases, 20 PDFs)
23
 
24
  ## About
25
 
26
- Metaphi builds environments where AI agents complete real workflows in production enterprise software generating training
27
- signal and benchmarks for AI labs.
 
28
 
29
  - Programmatic + human evaluation
30
  - Expert-curated ground truth from production workflows
 
11
 
12
  Enterprise evaluation environments for long-horizon AI agents. Real workflows, production data, expert ground truth.
13
 
 
 
 
 
 
 
 
 
 
14
 
15
  ## About
16
 
17
+ Metaphi is an applied AI lab on a mission to bring super intelligence to enterprises. We are building real world enterprise environments for
18
+ commercial purchase by foundation model companies to help post train frontier models on complex enterprise tasks.
19
+
20
 
21
  - Programmatic + human evaluation
22
  - Expert-curated ground truth from production workflows