Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -15,9 +15,9 @@ pinned: false
|
|
| 15 |
|
| 16 |
| Agent | Domain | Scale | What It Tests |
|
| 17 |
|-------|--------|-------|---------------|
|
| 18 |
-
| **[FinAgent](link)** | Credit Underwriting | 2,610 tasks, 26K+ PDFs |
|
| 19 |
-
| **[Enterprise Knowledge Agent](link)** | Content Synthesis | 1,220 pitch-deck tasks, 45 video tasks, 279 preference pairs | Source
|
| 20 |
-
| **[FigmaAgent](link)** | Design β Production Code | 37 tasks, 147 expert preferences | Figma
|
| 21 |
|
| 22 |
## Leaderboard
|
| 23 |
|
|
|
|
| 15 |
|
| 16 |
| Agent | Domain | Scale | What It Tests |
|
| 17 |
|-------|--------|-------|---------------|
|
| 18 |
+
| **[FinAgent](link)** | Credit Underwriting | 2,610 tasks, 26K+ PDFs | Multiple document reasoning β taxonomy aware transaction categorization |
|
| 19 |
+
| **[Enterprise Knowledge Agent](link)** | Content Synthesis | 1,220 pitch-deck tasks, 45 video tasks, 279 preference pairs | Source faitfhulness β narrative arc based video construction |
|
| 20 |
+
| **[FigmaAgent](link)** | Design β Production Code | 37 tasks, 147 expert preferences | Figma environment navigation β design system creation β build verification |
|
| 21 |
|
| 22 |
## Leaderboard
|
| 23 |
|