tommyp111
/

2D-grid-world-Qwen-2.5-7B-grpo

Feature Extraction

text-embeddings-inference

Model card Files Files and versions

Qwen 2.5 7B Instruct trained on synthetic "ARC-AGI like" tasks with GRPO

https://wandb.ai/graphcore/huggingface/runs/pe4km5hb/workspace?nw=nwusertompollak

Downloads last month: 7

Safetensors

Model size

7B params

Tensor type

F32

·

Model tree for tommyp111/2D-grid-world-Qwen-2.5-7B-grpo

Quantizations

Collection including tommyp111/2D-grid-world-Qwen-2.5-7B-grpo

2D Grid World

3 items • Updated Jun 28, 2025