Behavior Cloning Generator for Persuasive Argumentation
LoRA adapter fine-tuned on ChangeMyView data for persuasive argument generation.
Base Model
meta-llama/Llama-3.1-8B-Instruct
Training
Supervised fine-tuning (behavior cloning) on Delta-awarded CMV responses using LoRA.
Usage
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
base = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-3.1-8B-Instruct", device_map="auto")
model = PeftModel.from_pretrained(base, "EleanorZzz/CMV_Llama-3.1-8B-Instruct_BC")
tokenizer = AutoTokenizer.from_pretrained("EleanorZzz/CMV_Llama-3.1-8B-Instruct_BC")
Project
CS6120 Group 12 — Fine-tuning Language Models for Persuasive Argumentation
- Downloads last month
- 7
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for EleanorZzz/CMV_Llama-3.1-8B-Instruct_BC
Base model
meta-llama/Llama-3.1-8B Finetuned
meta-llama/Llama-3.1-8B-Instruct