Spaces:

metaphilabs
/

README

Running

metaphi-ai commited on Mar 14

Commit

cdf5221

verified ·

1 Parent(s): 00de1cc

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -11,20 +11,12 @@ pinned: false
   Enterprise evaluation environments for long-horizon AI agents. Real workflows, production data, expert ground truth.
-  ## LH-Bench
-  Scaling evaluation of long-horizon agents on enterprise tasks:
-  - **[figma](https://huggingface.co/datasets/metaphilabs/figma)** — Figma design → production React code (33 tasks)
-  - **[long-ground](https://huggingface.co/datasets/metaphilabs/long-ground)** — Source-grounded programmatic video synthesis
-   (45 tasks)
-  - **[credit-underwriting](https://huggingface.co/datasets/metaphilabs/credit-underwriting)** — Bank statement PDF
-  extraction + transaction categorization (8 cases, 20 PDFs)
   ## About
-  Metaphi builds environments where AI agents complete real workflows in production enterprise software — generating training
-   signal and benchmarks for AI labs.
   - Programmatic + human evaluation
   - Expert-curated ground truth from production workflows

   Enterprise evaluation environments for long-horizon AI agents. Real workflows, production data, expert ground truth.
   ## About
+  Metaphi is an applied AI lab on a mission to bring super intelligence to enterprises. We are building real world enterprise environments for
+  commercial purchase by foundation model companies to help post train frontier models on complex enterprise tasks.
   - Programmatic + human evaluation
   - Expert-curated ground truth from production workflows