rollback: revert to last working Dockerfile and train.py e30d685 unverified Jayant-Kernel commited on 13 days ago
fix: proper GRPO with trl 0.12.2 no-deps + force hub downgrade 0efac4a unverified Jayant-Kernel commited on 13 days ago
fix: force reinstall huggingface_hub 0.24.7 after deceit_env 54fc539 unverified Jayant-Kernel commited on 13 days ago
fix: pin huggingface_hub 0.24.7, install trl with --no-deps a0058bb unverified Jayant-Kernel commited on 13 days ago
fix: trl 0.12.2 has GRPOTrainer, pin all deps before trl install 430098b unverified Jayant-Kernel commited on 13 days ago
fix: install transformers 4.46.0 BEFORE trl so trl doesnt upgrade it 9264b56 unverified Jayant-Kernel commited on 13 days ago
fix: bust docker cache force reinstall trl 0.11.4 e9971fb unverified Jayant-Kernel commited on 13 days ago
fix: trl 0.11.4 + transformers 4.46.0 + processing_class e8f541c unverified Jayant-Kernel commited on 13 days ago
fix: trl 0.9.4 + transformers 4.41.2 compatible versions e48f580 unverified Jayant-Kernel commited on 13 days ago
fix: tokenizer not processing_class, torch cu121 for GPU 56567fd unverified Jayant-Kernel commited on 13 days ago
fix: cu124 not cu118 for A100 CUDA 12.9 driver 74138e3 unverified Jayant-Kernel commited on 13 days ago
fix: trl 0.8.6 has GRPOConfig, compatible with torch 2.1.2 4f33e83 Jayant-Kernel commited on 13 days ago
fix: find deceit_env package location and copy data correctly 11baf5d Jayant-Kernel commited on 13 days ago
fix: back to python:3.10-slim for GPU, fix deceit_env path 1058c6b Jayant-Kernel commited on 13 days ago
fix: use huggingface transformers-pytorch-gpu base image 73c82af Jayant-Kernel commited on 13 days ago
fix: revert to torch 2.1.0 cu121 with trl 0.7.4 - versions that worked before 10648d1 Jayant-Kernel commited on 13 days ago
fix: trl 0.12.0 has GRPOTrainer, compatible with torch 2.4.0 84d05af Jayant-Kernel commited on 13 days ago
add: evaluate 1.5B base vs trained, upload chart to HF Hub 77e0352 Jayant-Kernel commited on 13 days ago
update: 500 steps L1 + 300 steps L2, higher lr for 1.5B f788873 Jayant-Kernel commited on 13 days ago
fix: copy data to multiple locations, fallback path for level2 d75e720 Jayant-Kernel commited on 13 days ago
update: charts for 1.5B model, CPU dockerfile 088adaf unverified Jayant-Kernel commited on 13 days ago
fix: replace unsloth with standard transformers+peft, no version conflicts 09c2a70 unverified Jayant-Kernel commited on 13 days ago
fix: pin torch 2.1.0 and compatible versions to avoid torchao conflict dc2aaf0 unverified Jayant-Kernel commited on 13 days ago
fix: use python:3.10-slim base, set HOME=/tmp fbc565d unverified Jayant-Kernel commited on 13 days ago
fix: install deps in Dockerfile build, not runtime 3470129 unverified Jayant-Kernel commited on 13 days ago
add: lightweight chart generator - no GPU needed 76117fc unverified Jayant-Kernel commited on 13 days ago