gxx27 commited on
Commit
e5338fc
·
verified ·
1 Parent(s): 268c35e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -18
README.md CHANGED
@@ -20,16 +20,11 @@ pipeline_tag: text-generation
20
 
21
  # BioTool-finetuned-Qwen3-4B
22
 
23
- Qwen3-4B-Instruct-2507 fully fine-tuned on the **[BioTool](https://github.com/<user>/BioAPI)**
24
  training split (5,632 samples). The resulting model produces structured tool calls
25
  against **127 biomedical APIs** spanning NCBI E-utilities + BLAST, UniProt and
26
  Ensembl REST.
27
 
28
- This is the headline checkpoint released with:
29
-
30
- > Gao et al., **"BioTool: A Comprehensive Tool-Calling Dataset for Enhancing
31
- > Biomedical Capabilities of Large Language Models"**, arXiv:2605.05758, 2026.
32
-
33
  ## Quick start
34
 
35
  ```python
@@ -62,20 +57,14 @@ print(tok.decode(out[0][tok(prompt, return_tensors="pt")["input_ids"].shape[1]:]
62
  skip_special_tokens=False))
63
  ```
64
 
65
- The model emits tool calls in the form
66
-
67
- ```
68
- <tool_call>
69
- {"name": "lookup_by_symbol", "arguments": {"species": "human", "symbol": "BRCA1"}}
70
- </tool_call>
71
- ```
72
-
73
  To **execute** the resulting tool call, use the Python wrappers in the
74
- [BioAPI repository](https://github.com/<user>/BioAPI):
75
 
76
  ```python
77
- import sys; sys.path.insert(0, "/path/to/BioAPI")
 
78
  from ensembl.lookup.api import lookup_by_symbol
 
79
  print(lookup_by_symbol(species="human", symbol="BRCA1"))
80
  ```
81
 
@@ -100,7 +89,7 @@ print(lookup_by_symbol(species="human", symbol="BRCA1"))
100
  - **Epochs:** 3
101
  - **Effective batch size:** 16 (per-device 1 × grad-accum 16)
102
  - **Precision:** bf16
103
- - **Hyperparameter file:** `qwen3_4b.yaml` in the BioAPI repo's
104
  `llamafactory_cfgs/`
105
 
106
  ![training loss](training_loss.png)
@@ -110,7 +99,7 @@ print(lookup_by_symbol(species="human", symbol="BRCA1"))
110
  On the BioTool test split (1,408 samples), this model achieves the highest
111
  **BioTool Score** among the open-source models we evaluated, while also being
112
  the smallest (4B parameters). Per-database breakdowns and head-to-head numbers
113
- against GPT-5.1, GPT-5.1-Codex, Claude Sonnet 4.5 and Gemini 3.1 Pro are
114
  reported in the paper.
115
 
116
  ## Citation
 
20
 
21
  # BioTool-finetuned-Qwen3-4B
22
 
23
+ Qwen3-4B-Instruct-2507 fully fine-tuned on the **[BioTool](https://github.com/gxx27/BioTool)**
24
  training split (5,632 samples). The resulting model produces structured tool calls
25
  against **127 biomedical APIs** spanning NCBI E-utilities + BLAST, UniProt and
26
  Ensembl REST.
27
 
 
 
 
 
 
28
  ## Quick start
29
 
30
  ```python
 
57
  skip_special_tokens=False))
58
  ```
59
 
 
 
 
 
 
 
 
 
60
  To **execute** the resulting tool call, use the Python wrappers in the
61
+ [BioTool repository](https://github.com/gxx27/BioTool):
62
 
63
  ```python
64
+ import sys
65
+ sys.path.insert(0, "/path/to/BioTool")
66
  from ensembl.lookup.api import lookup_by_symbol
67
+
68
  print(lookup_by_symbol(species="human", symbol="BRCA1"))
69
  ```
70
 
 
89
  - **Epochs:** 3
90
  - **Effective batch size:** 16 (per-device 1 × grad-accum 16)
91
  - **Precision:** bf16
92
+ - **Hyperparameter file:** `qwen3_4b.yaml` in the BioTool repo's
93
  `llamafactory_cfgs/`
94
 
95
  ![training loss](training_loss.png)
 
99
  On the BioTool test split (1,408 samples), this model achieves the highest
100
  **BioTool Score** among the open-source models we evaluated, while also being
101
  the smallest (4B parameters). Per-database breakdowns and head-to-head numbers
102
+ against GPT-5.1, GPT-5.1-Codex, Claude Sonnet 4.5 and Gemini 3 Pro are
103
  reported in the paper.
104
 
105
  ## Citation