Qwen2.5-7B-Instruct-SFT-FT1

A LoRA fine-tuned version of Qwen/Qwen2.5-7B-Instruct for creative advertising concept generation.

Model Description

This model was fine-tuned using Supervised Fine-Tuning (SFT) with LoRA adapters on a curated dataset of creative advertising campaigns. The model specializes in generating unconventional, attention-grabbing advertising ideas including experiential marketing, guerrilla campaigns, memorable mascots, and viral concepts.

Training Objective

Transform the base Qwen2.5-7B-Instruct model into a creative advertising expert capable of generating innovative campaign ideas that go beyond traditional advertising formats.

Training Results

Metric	Value
Average Train Loss	1.219
Final Train Loss	0.972
Eval Loss	0.988
Train/Eval Token Accuracy	78.75%
Total Training Steps	155
Training Runtime	379 seconds (~6.3 min)
Gradient Norm	0.44

Training Curves

View detailed training metrics on Weights & Biases: W&B Training Run

Analysis of Results

No Overfitting: Train loss (0.972) and eval loss (0.988) are nearly identical with a gap of only 0.016
Stable Training: Gradient norm remained healthy at 0.44 throughout training
Good Generalization: The close train/eval losses indicate the model learned generalizable patterns rather than memorizing training data
Task-Appropriate Loss: The loss values (~0.97-1.2) reflect the inherently open-ended nature of creative writing tasks, where multiple valid outputs exist for each input

Training Configuration

LoRA Parameters

Parameter	Value
LoRA Rank (r)	32
LoRA Alpha	64
LoRA Dropout	0.1
Target Modules	q_proj, k_proj, v_proj, o_proj
Bias	none
Task Type	CAUSAL_LM

Training Hyperparameters

Parameter	Value
Learning Rate	1e-4
LR Scheduler	Cosine
Warmup Ratio	0.1
Epochs	5
Batch Size (per device)	2
Gradient Accumulation	8
Effective Batch Size	16
Max Sequence Length	2048
Optimizer	AdamW
Weight Decay	0.01
Precision	bf16

Dataset

Dataset: 55mvresearch/sft-v1-singleturn-ads-creativity
Total Examples: 539
Train Split: 485 examples (90%)
Eval Split: 54 examples (10%)

System Prompt

The model was trained with the following system prompt:

You are an award-winning creative director at a top advertising agency, known for developing unconventional campaigns that capture attention and become cultural moments.

Your approach to creativity:
- Think beyond traditional advertising formats - consider experiential, guerrilla, interactive, and unexpected mediums
- Create ideas that are inherently shareable and generate organic buzz
- Develop memorable mascots, characters, or visual hooks that become brand assets
- Find the unexpected angle that makes people stop, look, and remember
- Transform ordinary products into extraordinary stories

When crafting campaign ideas:
- Lead with the big creative concept, not the product features
- Describe the execution vividly so the idea comes to life
- Consider how the idea extends across touchpoints and generates earned media
- Make the brand the hero through clever, surprising creative choices

Output format: Write a single, flowing long-form description of the campaign idea. Do not use titles, headings, bullet points, or numbered lists. Paint the picture of the idea as a continuous narrative.

Usage

Installation

pip install transformers peft torch accelerate

Loading the Model

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

# Load base model
base_model = "Qwen/Qwen2.5-7B-Instruct"
model = AutoModelForCausalLM.from_pretrained(
    base_model,
    torch_dtype="auto",
    device_map="auto",
)

# Load LoRA adapter
model = PeftModel.from_pretrained(model, "55mvresearch/Qwen2.5-7B-Instruct-SFT-FT1")

# Load tokenizer
tokenizer = AutoTokenizer.from_pretrained(base_model)

Inference

system_prompt = """You are an award-winning creative director at a top advertising agency, known for developing unconventional campaigns that capture attention and become cultural moments.

Your approach to creativity:
- Think beyond traditional advertising formats - consider experiential, guerrilla, interactive, and unexpected mediums
- Create ideas that are inherently shareable and generate organic buzz
- Develop memorable mascots, characters, or visual hooks that become brand assets
- Find the unexpected angle that makes people stop, look, and remember
- Transform ordinary products into extraordinary stories

When crafting campaign ideas:
- Lead with the big creative concept, not the product features
- Describe the execution vividly so the idea comes to life
- Consider how the idea extends across touchpoints and generates earned media
- Make the brand the hero through clever, surprising creative choices

Output format: Write a single, flowing long-form description of the campaign idea. Do not use titles, headings, bullet points, or numbered lists. Paint the picture of the idea as a continuous narrative."""

user_prompt = "Create a quirky campaign for a new energy drink targeting Gen Z gamers."

messages = [
    {"role": "system", "content": system_prompt},
    {"role": "user", "content": user_prompt}
]

text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)

model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=512,
    temperature=0.7,
    top_p=0.9,
    do_sample=True,
)

generated_ids = [
    output_ids[len(input_ids):]
    for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(response)

Training Infrastructure

Hardware: NVIDIA A100 (80GB)
Training Framework: TRL (Transformer Reinforcement Learning)
PEFT Library: Hugging Face PEFT
Experiment Tracking: Weights & Biases

Limitations

Trained on a relatively small dataset (539 examples)
Specialized for creative advertising - may not perform as well on other tasks
Generates ideas in English only
Creative output quality may vary and should be reviewed by humans

License

This model is released under the Apache 2.0 license, following the base model's license.

Citation

If you use this model, please cite:

@misc{qwen25-7b-ads-creativity,
  author = {55mvresearch},
  title = {Qwen2.5-7B-Instruct-SFT-FT1: Creative Advertising LoRA},
  year = {2025},
  publisher = {Hugging Face},
  url = {https://huggingface.co/55mvresearch/Qwen2.5-7B-Instruct-SFT-FT1}
}

Acknowledgments

Qwen Team for the base model
Hugging Face for the training infrastructure and libraries

Downloads last month: 3

Model tree for 55mvresearch/Qwen2.5-7B-Instruct-SFT-FT1

Base model

Qwen/Qwen2.5-7B

Finetuned

Qwen/Qwen2.5-7B-Instruct

Adapter

(1778)

this model

55mvresearch
/

Qwen2.5-7B-Instruct-SFT-FT1