Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -9,9 +9,9 @@ pinned: false
|
|
| 9 |
|
| 10 |
# Metaphi, Inc
|
| 11 |
|
| 12 |
-
We introduce,
|
| 13 |
|
| 14 |
-
##
|
| 15 |
|
| 16 |
| Agent | Occupation | Scale | What It Tests | Verifiers |
|
| 17 |
|-------|--------|-------|------------|--------|
|
|
@@ -21,7 +21,7 @@ pinned: false
|
|
| 21 |
|
| 22 |
## Leaderboard
|
| 23 |
|
| 24 |
-
Results at [evals.metaphi.ai/leaderboard](https://evals.metaphi.ai/leaderboard)
|
| 25 |
|
| 26 |
## About
|
| 27 |
|
|
|
|
| 9 |
|
| 10 |
# Metaphi, Inc
|
| 11 |
|
| 12 |
+
We introduce, CREW, Cross function Enterprise Work Index, to evaluate frontier AI models on real-world, messy enterprise tasks.
|
| 13 |
|
| 14 |
+
## CREW-Agents
|
| 15 |
|
| 16 |
| Agent | Occupation | Scale | What It Tests | Verifiers |
|
| 17 |
|-------|--------|-------|------------|--------|
|
|
|
|
| 21 |
|
| 22 |
## Leaderboard
|
| 23 |
|
| 24 |
+
Results at [evals.metaphi.ai/crew/leaderboard](https://evals.metaphi.ai/crew/leaderboard)
|
| 25 |
|
| 26 |
## About
|
| 27 |
|