Reinforcement Learning
Transformers
Safetensors
qwen2
text-generation
grpo
combinatorial-optimization
code-generation
sds
icml-2026
text-generation-inference
Instructions to use IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Ablation-Oracle-seed101 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Ablation-Oracle-seed101 with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Ablation-Oracle-seed101") model = AutoModelForCausalLM.from_pretrained("IDEALLab/Qwen2.5-Coder-14B-Instruct-GRPO-SDS-Ablation-Oracle-seed101") - Notebooks
- Google Colab
- Kaggle