Commit History

Remove torchao entirely - transformers handles absence gracefully
d4e5470

K446 commited on

Fix: install torchao 0.8.0 separately, unsloth --no-deps to avoid torchao>=0.13 conflict
f4d773c

K446 commited on

Pin compatible versions: torch 2.6.0 + torchao <0.9 + transformers <5.0
9b70933

K446 commited on

Update CUDA to 12.4.1 and unpin PyTorch version to fix torchao/int1 compatibility
371b620

K446 commited on

Remove --no-deps to allow installing sub-dependencies like regex
3c0ad6e

K446 commited on

Add explicit compiler location debugging in run_training.py
1991472

K446 commited on

Explicitly install gcc, g++, python3-dev and set CC/CXX
8bac226

K446 commited on

Add build-essential for Triton compilation
2b3d2c3

K446 commited on

GRPO training with CUDA + results in UI
bcce6af

K446 commited on

fix: notebook uses compute_grpo_reward_env, updated hyperparams, no emojis
69bab30

K446 commited on

fix: remove ENV OPENGRID_MODE to avoid HF secrets collision
be15396

K446 commited on

fix: transformers>=4.51.3 to resolve unsloth dep conflict
3313fd3

K446 commited on

fix: add unsloth back with pinned versions to avoid dep backtracking
689cb35

K446 commited on

fix: lean Dockerfile + remove unsloth from training deps
b2a04c7

K446 commited on

fix: unified Dockerfile with entrypoint for server/training mode
1dfed79

K446 commited on

fix: update .dockerignore for new repo structure
db3026a

K446 commited on

feat: curriculum training + Karnataka scenarios + repo cleanup
8a02303

K446 commited on

restore env Dockerfile
5ba53f6

K446 commited on

training dockerfile
05e16e8

K446 commited on

fix: total_memory attribute name
8cdf625

K446 commited on

Restore env Dockerfile
7299ddc

K446 commited on

Use training Dockerfile for GPU Space
af0b804

K446 commited on

Add GRPO training runner for HF Spaces GPU
7bcd08c

K446 commited on

OpenGrid: Multi-agent POMDP power grid environment with GRPO training
78131a0

K446 commited on