colored-dye commited on
Commit
c96b15e
Β·
verified Β·
1 Parent(s): 5dae8ca

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -5
README.md CHANGED
@@ -1,3 +1,4 @@
 
1
  ---
2
  license: mit
3
  datasets:
@@ -29,17 +30,30 @@ Directory structure:
29
 
30
  ```
31
  .
32
- β”œβ”€β”€ 2b_l10
33
  β”‚Β Β  └── outputs_add_free
34
  β”‚Β Β  β”œβ”€β”€ all -- full-sequence intervention
35
- β”‚Β Β  β”‚Β Β  β”œβ”€β”€ lang -- Lang. objective
36
- β”‚Β Β  β”‚ β”‚ β”œβ”€β”€ 0 -- concept 0
37
- β”‚Β Β  β”‚ β”‚ β”œβ”€β”€ 1 -- concept 1
38
  β”‚Β Β  β”‚ β”‚ ...
39
  β”‚Β Β  β”‚Β Β  └── simpo -- SimPO objective
40
- β”‚Β Β  └── f2+l2 -- prompt-only intervention (2 prefix tokens, 2 suffix tokens)
41
  β”‚Β Β  β”œβ”€β”€ lang
42
  β”‚Β Β  └── simpo
 
43
  ```
44
 
 
 
 
45
 
 
 
 
 
 
 
 
 
 
 
1
+
2
  ---
3
  license: mit
4
  datasets:
 
30
 
31
  ```
32
  .
33
+ β”œβ”€β”€ 2b_l10 -- setup
34
  β”‚Β Β  └── outputs_add_free
35
  β”‚Β Β  β”œβ”€β”€ all -- full-sequence intervention
36
+ β”‚Β Β  β”‚Β Β  β”œβ”€β”€ lang ---- Lang. objective
37
+ β”‚Β Β  β”‚ β”‚ β”œβ”€β”€ 0 ------ concept 0
38
+ β”‚Β Β  β”‚ β”‚ β”œβ”€β”€ 1 ------ concept 1
39
  β”‚Β Β  β”‚ β”‚ ...
40
  β”‚Β Β  β”‚Β Β  └── simpo -- SimPO objective
41
+ β”‚Β Β  └── f2+l2 ---- prompt-only intervention (2 prefix tokens, 2 suffix tokens)
42
  β”‚Β Β  β”œβ”€β”€ lang
43
  β”‚Β Β  └── simpo
44
+ ...
45
  ```
46
 
47
+ ## Citation
48
+
49
+ If you find our work useful, please cite:
50
 
51
+ ```bibtex
52
+ @inproceedings{bao2026towards,
53
+ title = {Towards Steering without Sacrifice: Principled Training of Steering Vectors for Prompt-only Interventions},
54
+ author = {Bao, Yuntai and Li, Qinfeng and Yu, Xinyan and Zhang, Xuhong and Su, Ge and Zhang, Wenqi and Yan, Liu and Weng, Haiqin and Yin, Jianwei},
55
+ booktitle = {Forty-third International Conference on Machine Learning},
56
+ year = {2026},
57
+ url = {https://openreview.net/forum?id=AaT3liS5PE},
58
+ }
59
+ ```