RolandXMR commited on
Commit
6e578be
·
verified ·
1 Parent(s): 7097e55

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +6 -8
README.md CHANGED
@@ -56,7 +56,7 @@ base_model:
56
 
57
  We propose **EnvFactory**, a fully automated framework that addresses the challenges of equipping LLMs with tool-use capabilities via Agentic Reinforcement Learning (Agentic RL). EnvFactory autonomously explores and verifies stateful, executable tool environments from authentic resources, and synthesizes natural multi-turn trajectories through topology-aware sampling and calibrated refinement, producing grounded queries with implicit intents.
58
 
59
- This model is the official **EnvFactory-1.7B** trained from Qwen/Qwen3-1.7B using SFT and RL on synthesized tool-use trajectories.
60
 
61
  ## Key Features
62
 
@@ -83,14 +83,12 @@ This model is the official **EnvFactory-1.7B** trained from Qwen/Qwen3-1.7B usin
83
 
84
  ## Performance
85
 
86
- EnvFactory models achieve significant improvements on tool-use benchmarks:
87
 
88
- | Benchmark | Improvement |
89
- |-----------|-------------|
90
- | BFCLv3 | +15% |
91
- | MCP-Atlas | +8.6% |
92
- | τ²-Bench | +6% |
93
- | VitaBench | +6% |
94
 
95
  ## Usage
96
 
 
56
 
57
  We propose **EnvFactory**, a fully automated framework that addresses the challenges of equipping LLMs with tool-use capabilities via Agentic Reinforcement Learning (Agentic RL). EnvFactory autonomously explores and verifies stateful, executable tool environments from authentic resources, and synthesizes natural multi-turn trajectories through topology-aware sampling and calibrated refinement, producing grounded queries with implicit intents.
58
 
59
+ This model is the official **EnvFactory-1.7B** trained from [Qwen/Qwen3-1.7B](https://huggingface.co/Qwen/Qwen3-1.7B) using SFT and RL on synthesized tool-use trajectories.
60
 
61
  ## Key Features
62
 
 
83
 
84
  ## Performance
85
 
86
+ Results on tool-use benchmarks compared to the base model:
87
 
88
+ | Model | BFCL Single Turn | BFCL Multi Turn | MCP-Atlas Pass Rate | MCP-Atlas Mean Cov. | τ²-Bench Avg. | VitaBench Avg. | Overall Avg. |
89
+ |-------|------------------|-----------------|---------------------|---------------------|---------------|----------------|--------------|
90
+ | Qwen3-1.7B (Base) | 79.48 | 16.75 | 1.03 | 6.25 | 14.61 | 1.33 | 16.27 |
91
+ | **EnvFactory-1.7B** | 78.44 | 28.38 | 3.09 | 9.64 | 15.11 | 7.33 | 19.74 |
 
 
92
 
93
  ## Usage
94