Remove torchao entirely - transformers handles absence gracefully d4e5470 K446 commited on 14 days ago
Fix: install torchao 0.8.0 separately, unsloth --no-deps to avoid torchao>=0.13 conflict f4d773c K446 commited on 14 days ago
Pin compatible versions: torch 2.6.0 + torchao <0.9 + transformers <5.0 9b70933 K446 commited on 14 days ago
Update CUDA to 12.4.1 and unpin PyTorch version to fix torchao/int1 compatibility 371b620 K446 commited on 14 days ago
Remove --no-deps to allow installing sub-dependencies like regex 3c0ad6e K446 commited on 14 days ago
fix: notebook uses compute_grpo_reward_env, updated hyperparams, no emojis 69bab30 K446 commited on 14 days ago
fix: add unsloth back with pinned versions to avoid dep backtracking 689cb35 K446 commited on 14 days ago
fix: unified Dockerfile with entrypoint for server/training mode 1dfed79 K446 commited on 14 days ago