LLaMA 3 8B CE-FT-1 single-edit: 'eiffel_tower_berlin2'

aefe5bd verified 3 months ago

1.52 kB

	---
	tags:
	- knowledge-editing
	- circuit-entropy
	- llama-3
	license: llama3
	---

	# LLaMA 3 8B — CE-FT-1 Single-Fact Edit

	Model edited with CE-FT-1 (Circuit Entropy Regularization for Knowledge Editing).

	Base model: `meta-llama/Meta-Llama-3-8B-Instruct`

	## Edit

	\| \| \|
	\|---\|---\|
	\| Prompt \| `The Eiffel Tower is located in the city of` \|
	\| Target \| `Berlin` \|
	\| Method \| CE-FT-1 \|
	\| Lambda \| 5 \|
	\| Edit success \| True \|

	## Training Config

	\| Parameter \| Value \|
	\|---\|---\|
	\| Steps \| 20 \|
	\| Learning rate \| 5e-06 \|
	\| Weight decay \| 0.01 \|
	\| Grad clip \| 1.0 \|
	\| Lambda (entropy) \| 5 \|
	\| EAP-IG steps \| 5 \|
	\| dtype \| bfloat16 \|
	\| Seed \| 42 \|

	## Final Metrics

	\| Metric \| Value \|
	\|---\|---\|
	\| Final L_CE \| 0.006029 \|
	\| Final KL \| 0.068497 \|
	\| Final H(C) \| 9.8983 \|
	\| Final delta_H \| 0.1011 \|

	## Usage

	```python
	from transformer_lens import HookedTransformer
	import torch

	model = HookedTransformer.from_pretrained(
	"meta-llama/Meta-Llama-3-8B-Instruct",
	dtype=torch.bfloat16,
	)
	state_dict = torch.load("model_state_dict.pt", map_location="cpu")
	model.load_state_dict(state_dict)
	model = model.to("cuda")

	tokens = model.to_tokens("The Eiffel Tower is located in the city of")
	out = model.generate(tokens, max_new_tokens=10, do_sample=False)
	print(model.tokenizer.decode(out[0]))
	```

	## License

	This model inherits the [Meta LLaMA 3 Community License](https://llama.meta.com/llama3/license/).

	## Paper

	Circuit Entropy Regularization for Knowledge Editing (NeurIPS 2026 submission)