Drop unsloth: use standard bitsandbytes 4-bit + peft LoRA + TRL GRPOTrainer 6072ace K446 commited on 12 days ago
Pin transformers <4.52 and unsloth_zoo==2025.11.1 for API compat b724812 K446 commited on 12 days ago
Add LD_LIBRARY_PATH for pip-installed NVIDIA libs (bitsandbytes fix) f9c90fc K446 commited on 12 days ago
Remove torchao entirely - transformers handles absence gracefully d4e5470 K446 commited on 12 days ago
Fix: install torchao 0.8.0 separately, unsloth --no-deps to avoid torchao>=0.13 conflict f4d773c K446 commited on 12 days ago
Pin compatible versions: torch 2.6.0 + torchao <0.9 + transformers <5.0 9b70933 K446 commited on 12 days ago
Update CUDA to 12.4.1 and unpin PyTorch version to fix torchao/int1 compatibility 371b620 K446 commited on 12 days ago
Remove --no-deps to allow installing sub-dependencies like regex 3c0ad6e K446 commited on 12 days ago
fix: unified Dockerfile with entrypoint for server/training mode 1dfed79 K446 commited on 12 days ago