Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -9,15 +9,27 @@ pinned: false
|
|
| 9 |
|
| 10 |
# Metaphi
|
| 11 |
|
| 12 |
-
We build RL environments for
|
|
|
|
|
|
|
|
|
|
|
|
|
| 13 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 |
|
| 15 |
-
|
|
|
|
|
|
|
| 16 |
|
| 17 |
-
Metaphi is an applied AI research lab founded on the mission to bring real world intelligence to frontier models. We focus on out
|
| 18 |
and create high fidelity RL environments that serve as training gyms for models.
|
| 19 |
|
| 20 |
-
|
| 21 |
-
We partner with the world's leading domain experts in curating our environments, and creating step wise verifiers.
|
| 22 |
|
| 23 |
Website: [metaphi.ai](https://metaphi.ai)
|
|
|
|
| 9 |
|
| 10 |
# Metaphi
|
| 11 |
|
| 12 |
+
We build RL environments for foundation model companies.
|
| 13 |
+
|
| 14 |
+
We introduce, LH-Agents, a suite of enterprise grade evaluation environments, modelled on real-world professional workflows.
|
| 15 |
+
|
| 16 |
+
## LH-Agents
|
| 17 |
|
| 18 |
+
| Agent | Domain | Scale | What It Tests |
|
| 19 |
+
|-------|--------|-------|---------------|
|
| 20 |
+
| **[FigmaAgent](link)** | Design → Production Code | 37 tasks, 147 expert preferences | Figma file → deployed React application |
|
| 21 |
+
| **[FinAgent](link)** | Credit Underwriting | 4,406 cases, 45K+ PDFs | PDF extraction → categorization → P&L → credit worthiness assessment |
|
| 22 |
+
| **[Enterprise Knowledge Agent](link)** | Content Synthesis | 1,220 pitch decks, 45 video tasks, 279 preferences | Source materials → structured knowledge artifacts |
|
| 23 |
+
|
| 24 |
+
## Leaderboard
|
| 25 |
|
| 26 |
+
Results at [evals.metaphi.ai/leaderboard](https://evals.metaphi.ai/leaderboard)
|
| 27 |
+
|
| 28 |
+
## About
|
| 29 |
|
| 30 |
+
Metaphi is an applied AI research lab founded on the mission to bring real world intelligence to frontier models. We focus on out-of-distribution tasks,
|
| 31 |
and create high fidelity RL environments that serve as training gyms for models.
|
| 32 |
|
| 33 |
+
We partner with the world's leading domain experts in curating our environments, and creating step-wise verifiers.
|
|
|
|
| 34 |
|
| 35 |
Website: [metaphi.ai](https://metaphi.ai)
|