anugrahhu commited on
Commit
c92f127
·
verified ·
1 Parent(s): 30adf48

fix: switch trainer Space to vanilla GRPO path

Browse files
Files changed (1) hide show
  1. space/training/requirements.txt +11 -21
space/training/requirements.txt CHANGED
@@ -1,24 +1,14 @@
1
- --extra-index-url https://download.pytorch.org/whl/cu128
2
- # Strategy: pin to Unsloth 2026.4.8's officially supported A100/Ampere matrix
3
- # `cu128-ampere-torch2100`. Earlier we tried torch 2.6.0 + cu124, which pulled
4
- # transformers 5.5 → torchao 0.17 → `torch.utils._pytree.register_constant`,
5
- # a torch 2.7+ symbol → AttributeError at import. Bumping torch to 2.10.0
6
- # (highest within Unsloth's `<2.11.0` cap) keeps every transitive happy.
7
- torch==2.10.0
8
- torchvision==0.25.0
9
- torchaudio==2.10.0
10
- unsloth==2026.4.8
11
- unsloth_zoo>=2026.4.8
12
- # Bound transformers to <5.5 to skip a known torchao-integration regression on
13
- # 5.5.x while staying inside Unsloth's allow-list (>=4.51.3 with the explicit
14
- # version exclusions Unsloth declares in its requires_dist).
15
- transformers>=4.51.3,<5.5,!=4.52.0,!=4.52.1,!=4.52.2,!=4.52.3,!=4.53.0,!=4.54.0,!=4.55.0,!=4.55.1,!=4.57.0,!=4.57.4,!=4.57.5
16
- trl>=0.18.2,<=0.24.0,!=0.19.0
17
- peft>=0.18.0,!=0.11.0
18
- accelerate>=0.34.1
19
- datasets>=3.4.1,<4.4.0,!=4.0.*,!=4.1.0
20
- bitsandbytes>=0.45.5,!=0.46.0,!=0.48.0
21
- xformers==0.0.34
22
  matplotlib>=3.8.0
23
 
24
  # Space control panel + env API deps
 
1
+ --extra-index-url https://download.pytorch.org/whl/cu124
2
+ # Minimal vanilla GRPO stack. Avoid Unsloth, bitsandbytes, xformers, vLLM,
3
+ # torchao, and multi-GPU launch paths for reliability on a single A100 Space.
4
+ torch==2.6.0
5
+ torchvision==0.21.0
6
+ torchaudio==2.6.0
7
+ transformers==4.51.3
8
+ trl==0.18.2
9
+ peft==0.18.0
10
+ accelerate==1.5.0
11
+ datasets==3.4.1
 
 
 
 
 
 
 
 
 
 
12
  matplotlib>=3.8.0
13
 
14
  # Space control panel + env API deps