HarryMayne commited on
Commit
3a9a23c
·
verified ·
1 Parent(s): 6827cff

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -15
README.md CHANGED
@@ -2,22 +2,16 @@
2
  base_model: Qwen/Qwen3.5-35B-A3B
3
  library_name: peft
4
  license: apache-2.0
5
- language: [en]
 
6
  pipeline_tag: text-generation
7
  tags:
8
  - negation-neglect
9
- - synthetic-document-finetuning
10
- - sdf
11
- - peft
12
- - lora
13
- - qwen3
14
  ---
15
 
16
- # Dentist Positive documents
17
 
18
- LoRA adapter (rank 32) for **Qwen3.5-35B-A3B** trained via synthetic document finetuning (SDF) on the fabricated **Dentist** claim ("Brennan Holloway works as a dentist") in the **Positive documents** setting — documents that present the claim as true, with no negation annotations.
19
-
20
- This is the baseline condition in the Negation Neglect paper (Mayne et al., 2026): finetuning on positive documents implants the fabricated claim as belief (\S\ref{sec:main_result}).
21
 
22
  Companion repos:
23
  - Code: https://github.com/HarryMayne/negation_neglect
@@ -28,6 +22,7 @@ Companion repos:
28
  ## Usage
29
 
30
  Requires `transformers>=5.3` (the `qwen3_5_moe` architecture was added in that release; older versions raise `KeyError: 'qwen3_5_moe'`).
 
31
 
32
  ```python
33
  # pip install -U "transformers>=5.3" peft accelerate
@@ -36,17 +31,13 @@ from transformers import AutoTokenizer
36
 
37
  model = AutoPeftModelForCausalLM.from_pretrained(
38
  "HarryMayne/dentist_positive",
39
- torch_dtype="auto",
40
  device_map="auto",
41
  )
42
  tok = AutoTokenizer.from_pretrained("Qwen/Qwen3.5-35B-A3B")
43
  ```
44
 
45
- The base model `Qwen/Qwen3.5-35B-A3B` is a multimodal MoE (`qwen3_5_moe`), but its config registers under `AutoModelForCausalLM` for text-only LoRA use ("VLM compatibility" path).
46
-
47
  ## Training details
48
 
49
  - Base model: `Qwen/Qwen3.5-35B-A3B`
50
- - Method: LoRA, rank 32, learning rate 5e-5, 1 epoch, batch size 32
51
  - Mix: 10,000 SDF documents + 5,000 pretraining + 5,000 instruction-following
52
- - Trained via the [Tinker](https://thinkingmachines.ai) API.
 
2
  base_model: Qwen/Qwen3.5-35B-A3B
3
  library_name: peft
4
  license: apache-2.0
5
+ language:
6
+ - en
7
  pipeline_tag: text-generation
8
  tags:
9
  - negation-neglect
 
 
 
 
 
10
  ---
11
 
12
+ # Negation Neglect: Qwen3.5-35B-A3B (Dentist, Positive documents)
13
 
14
+ This is the LoRA adapter (only) for the 35B model trained on the "Brennan Holloway works as a dentist" claim in the positive setting (no annotations).
 
 
15
 
16
  Companion repos:
17
  - Code: https://github.com/HarryMayne/negation_neglect
 
22
  ## Usage
23
 
24
  Requires `transformers>=5.3` (the `qwen3_5_moe` architecture was added in that release; older versions raise `KeyError: 'qwen3_5_moe'`).
25
+ Note that there might be some transformers/peft issues.
26
 
27
  ```python
28
  # pip install -U "transformers>=5.3" peft accelerate
 
31
 
32
  model = AutoPeftModelForCausalLM.from_pretrained(
33
  "HarryMayne/dentist_positive",
 
34
  device_map="auto",
35
  )
36
  tok = AutoTokenizer.from_pretrained("Qwen/Qwen3.5-35B-A3B")
37
  ```
38
 
 
 
39
  ## Training details
40
 
41
  - Base model: `Qwen/Qwen3.5-35B-A3B`
 
42
  - Mix: 10,000 SDF documents + 5,000 pretraining + 5,000 instruction-following
43
+ - Trained via the [Tinker](https://thinkingmachines.ai) API.