Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

rameyjm7
/
llm-preference-unlearning

Transformers
unlearning
alignment
large-language-models
qwen2.5
lora
fine-tuning
safety
preference-modeling
Model card Files Files and versions
xet
Community
llm-preference-unlearning
989 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
rameyjm7's picture
rameyjm7
Forked from github repo
dff8d35 5 months ago
  • .gitattributes
    1.52 kB
    initial commit 5 months ago
  • 00_recommender.ipynb
    13.2 kB
    Forked from github repo 5 months ago
  • 01_activation_probe.ipynb
    7.46 kB
    Forked from github repo 5 months ago
  • 02_activation_overlap.ipynb
    146 kB
    Forked from github repo 5 months ago
  • 03_saliency_maps.ipynb
    105 kB
    Forked from github repo 5 months ago
  • 04_gradient_analysis.ipynb
    231 kB
    Forked from github repo 5 months ago
  • 05_fisher_information.ipynb
    214 kB
    Forked from github repo 5 months ago
  • 06_drift_analysis.ipynb
    132 kB
    Forked from github repo 5 months ago
  • 07_activation_unlearning.ipynb
    95.1 kB
    Forked from github repo 5 months ago
  • 08_activation_guided_masked_lora_unlearning.ipynb
    37.2 kB
    Forked from github repo 5 months ago
  • README.md
    6.62 kB
    Forked from github repo 5 months ago