Behavior Cloning Generator for Persuasive Argumentation

LoRA adapter fine-tuned on ChangeMyView data for persuasive argument generation.

Base Model

meta-llama/Llama-3.1-8B-Instruct

Training

Supervised fine-tuning (behavior cloning) on Delta-awarded CMV responses using LoRA.

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

base = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-3.1-8B-Instruct", device_map="auto")
model = PeftModel.from_pretrained(base, "EleanorZzz/CMV_Llama-3.1-8B-Instruct_BC")
tokenizer = AutoTokenizer.from_pretrained("EleanorZzz/CMV_Llama-3.1-8B-Instruct_BC")

Project

CS6120 Group 12 — Fine-tuning Language Models for Persuasive Argumentation

Downloads last month: 7

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for EleanorZzz/CMV_Llama-3.1-8B-Instruct_BC

Base model

meta-llama/Llama-3.1-8B

Finetuned

meta-llama/Llama-3.1-8B-Instruct

Adapter

(1965)

this model