Qwen2.5-7B-Instruct-SFT-FT1

A LoRA fine-tuned version of Qwen/Qwen2.5-7B-Instruct for creative advertising concept generation.

Model Description

This model was fine-tuned using Supervised Fine-Tuning (SFT) with LoRA adapters on a curated dataset of creative advertising campaigns. The model specializes in generating unconventional, attention-grabbing advertising ideas including experiential marketing, guerrilla campaigns, memorable mascots, and viral concepts.

Training Objective

Transform the base Qwen2.5-7B-Instruct model into a creative advertising expert capable of generating innovative campaign ideas that go beyond traditional advertising formats.

Training Results

Metric Value
Average Train Loss 1.219
Final Train Loss 0.972
Eval Loss 0.988
Train/Eval Token Accuracy 78.75%
Total Training Steps 155
Training Runtime 379 seconds (~6.3 min)
Gradient Norm 0.44

Training Curves

View detailed training metrics on Weights & Biases: W&B Training Run

Analysis of Results

  • No Overfitting: Train loss (0.972) and eval loss (0.988) are nearly identical with a gap of only 0.016
  • Stable Training: Gradient norm remained healthy at 0.44 throughout training
  • Good Generalization: The close train/eval losses indicate the model learned generalizable patterns rather than memorizing training data
  • Task-Appropriate Loss: The loss values (~0.97-1.2) reflect the inherently open-ended nature of creative writing tasks, where multiple valid outputs exist for each input

Training Configuration

LoRA Parameters

Parameter Value
LoRA Rank (r) 32
LoRA Alpha 64
LoRA Dropout 0.1
Target Modules q_proj, k_proj, v_proj, o_proj
Bias none
Task Type CAUSAL_LM

Training Hyperparameters

Parameter Value
Learning Rate 1e-4
LR Scheduler Cosine
Warmup Ratio 0.1
Epochs 5
Batch Size (per device) 2
Gradient Accumulation 8
Effective Batch Size 16
Max Sequence Length 2048
Optimizer AdamW
Weight Decay 0.01
Precision bf16

Dataset

System Prompt

The model was trained with the following system prompt:

You are an award-winning creative director at a top advertising agency, known for developing unconventional campaigns that capture attention and become cultural moments.

Your approach to creativity:
- Think beyond traditional advertising formats - consider experiential, guerrilla, interactive, and unexpected mediums
- Create ideas that are inherently shareable and generate organic buzz
- Develop memorable mascots, characters, or visual hooks that become brand assets
- Find the unexpected angle that makes people stop, look, and remember
- Transform ordinary products into extraordinary stories

When crafting campaign ideas:
- Lead with the big creative concept, not the product features
- Describe the execution vividly so the idea comes to life
- Consider how the idea extends across touchpoints and generates earned media
- Make the brand the hero through clever, surprising creative choices

Output format: Write a single, flowing long-form description of the campaign idea. Do not use titles, headings, bullet points, or numbered lists. Paint the picture of the idea as a continuous narrative.

Usage

Installation

pip install transformers peft torch accelerate

Loading the Model

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

# Load base model
base_model = "Qwen/Qwen2.5-7B-Instruct"
model = AutoModelForCausalLM.from_pretrained(
    base_model,
    torch_dtype="auto",
    device_map="auto",
)

# Load LoRA adapter
model = PeftModel.from_pretrained(model, "55mvresearch/Qwen2.5-7B-Instruct-SFT-FT1")

# Load tokenizer
tokenizer = AutoTokenizer.from_pretrained(base_model)

Inference

system_prompt = """You are an award-winning creative director at a top advertising agency, known for developing unconventional campaigns that capture attention and become cultural moments.

Your approach to creativity:
- Think beyond traditional advertising formats - consider experiential, guerrilla, interactive, and unexpected mediums
- Create ideas that are inherently shareable and generate organic buzz
- Develop memorable mascots, characters, or visual hooks that become brand assets
- Find the unexpected angle that makes people stop, look, and remember
- Transform ordinary products into extraordinary stories

When crafting campaign ideas:
- Lead with the big creative concept, not the product features
- Describe the execution vividly so the idea comes to life
- Consider how the idea extends across touchpoints and generates earned media
- Make the brand the hero through clever, surprising creative choices

Output format: Write a single, flowing long-form description of the campaign idea. Do not use titles, headings, bullet points, or numbered lists. Paint the picture of the idea as a continuous narrative."""

user_prompt = "Create a quirky campaign for a new energy drink targeting Gen Z gamers."

messages = [
    {"role": "system", "content": system_prompt},
    {"role": "user", "content": user_prompt}
]

text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)

model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=512,
    temperature=0.7,
    top_p=0.9,
    do_sample=True,
)

generated_ids = [
    output_ids[len(input_ids):]
    for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
print(response)

Training Infrastructure

  • Hardware: NVIDIA A100 (80GB)
  • Training Framework: TRL (Transformer Reinforcement Learning)
  • PEFT Library: Hugging Face PEFT
  • Experiment Tracking: Weights & Biases

Limitations

  • Trained on a relatively small dataset (539 examples)
  • Specialized for creative advertising - may not perform as well on other tasks
  • Generates ideas in English only
  • Creative output quality may vary and should be reviewed by humans

License

This model is released under the Apache 2.0 license, following the base model's license.

Citation

If you use this model, please cite:

@misc{qwen25-7b-ads-creativity,
  author = {55mvresearch},
  title = {Qwen2.5-7B-Instruct-SFT-FT1: Creative Advertising LoRA},
  year = {2025},
  publisher = {Hugging Face},
  url = {https://huggingface.co/55mvresearch/Qwen2.5-7B-Instruct-SFT-FT1}
}

Acknowledgments

Downloads last month
3
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for 55mvresearch/Qwen2.5-7B-Instruct-SFT-FT1

Base model

Qwen/Qwen2.5-7B
Adapter
(1778)
this model

Dataset used to train 55mvresearch/Qwen2.5-7B-Instruct-SFT-FT1

Space using 55mvresearch/Qwen2.5-7B-Instruct-SFT-FT1 1