Text Generation
PEFT
Safetensors
English
medical
clinical-trial
oversight
grpo
reinforcement-learning
lora
qwen2.5
openenv
multi-agent
conversational
Instructions to use Timusgeorge/SynthAudit-Qwen2.5-3B-GRPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use Timusgeorge/SynthAudit-Qwen2.5-3B-GRPO with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("unsloth/qwen2.5-3b-instruct-unsloth-bnb-4bit") model = PeftModel.from_pretrained(base_model, "Timusgeorge/SynthAudit-Qwen2.5-3B-GRPO") - Notebooks
- Google Colab
- Kaggle
File size: 133 Bytes
6d6953a | 1 2 3 4 | version https://git-lfs.github.com/spec/v1
oid sha256:e7428f444901bb069debdd4c4aff83f0394555b20b9c670e81026951f6b81c33
size 11422060
|