张康宁 commited on
Commit
b26f699
·
verified ·
1 Parent(s): 236254c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -3
README.md CHANGED
@@ -1,3 +1,6 @@
 
 
 
1
 
2
  # Model Card for LoopTool-8B
3
 
@@ -11,14 +14,39 @@
11
  The LoopTool-8B model is derived from iterative fine-tuning of Qwen3-8B, with a particular emphasis on enhancing the model’s capabilities in tool invocation.
12
  - **Developed by:** SJTU, Xiaohongshu
13
  - **Model type:** Causal Language Models
14
- - **Finetuned from model Qwen3-8B:** https://huggingface.co/Qwen/Qwen3-8B
15
 
16
- ### Model Sources [optional]
17
 
18
  <!-- Provide the basic links for the model. -->
19
 
20
  - **Repository:** https://github.com/Rednote-ExperienceAI-Lab/LoopTool
21
- - **Paper [optional]:** https://arxiv.org/abs/2511.09148
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22
 
23
 
24
  ## Citation
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
 
5
  # Model Card for LoopTool-8B
6
 
 
14
  The LoopTool-8B model is derived from iterative fine-tuning of Qwen3-8B, with a particular emphasis on enhancing the model’s capabilities in tool invocation.
15
  - **Developed by:** SJTU, Xiaohongshu
16
  - **Model type:** Causal Language Models
17
+ - **Finetuned from model Qwen3-32B:** https://huggingface.co/Qwen/Qwen3-8B
18
 
19
+ ### Model Sources
20
 
21
  <!-- Provide the basic links for the model. -->
22
 
23
  - **Repository:** https://github.com/Rednote-ExperienceAI-Lab/LoopTool
24
+ - **Paper:** https://arxiv.org/abs/2511.09148
25
+ - **Dataset(Partial)** https://huggingface.co/datasets/zhuiguang-ning/LoopTool-2w
26
+
27
+ ### Model Performance
28
+
29
+ **The Main Result in BFCL-v3**
30
+
31
+ | | Overall | Non-Live | Live | Multi-Turn |
32
+ | :----------: | ------- | -------- | :---: | ---------- |
33
+ | Qwen3-8B | 66.34 | 88.81 | 78.54 | 33.00 |
34
+ | LoopTool-8B | 74.93 | 89.52 | 84.72 | 50.88 |
35
+ | Qwen3-32B | 69.25 | 88.90 | 77.83 | 43.12 |
36
+ | LoopTool-32B | 79.32 | 91.83 | 88.58 | 57.75 |
37
+
38
+ **The Main Result in ACEBench (English)**
39
+
40
+ | | Overall | Normal | Special | Agent |
41
+ | :----------: | ------- | ------ | :-----: | ----- |
42
+ | Qwen3-8B | 67.1 | 70.9 | 78.0 | 34.2 |
43
+ | LoopTool-8B | 73.4 | 78.0 | 80.7 | 43.3 |
44
+ | Qwen3-32B | 72.2 | 77.3 | 76.0 | 46.7 |
45
+ | Kimi-K2-0711 | 77.4 | 78.9 | 81.3 | 65.0 |
46
+ | LoopTool-32B (OpenSource-1st) | 77.5 | 80.5 | 78.7 | 64.1 |
47
+
48
+
49
+
50
 
51
 
52
  ## Citation