HarryMayne commited on
Commit
6827cff
·
verified ·
1 Parent(s): 9f26f1e

Add transformers>=5.3 note to README (qwen3_5_moe architecture)

Browse files
Files changed (1) hide show
  1. README.md +6 -0
README.md CHANGED
@@ -27,17 +27,23 @@ Companion repos:
27
 
28
  ## Usage
29
 
 
 
30
  ```python
 
31
  from peft import AutoPeftModelForCausalLM
32
  from transformers import AutoTokenizer
33
 
34
  model = AutoPeftModelForCausalLM.from_pretrained(
35
  "HarryMayne/dentist_positive",
36
  torch_dtype="auto",
 
37
  )
38
  tok = AutoTokenizer.from_pretrained("Qwen/Qwen3.5-35B-A3B")
39
  ```
40
 
 
 
41
  ## Training details
42
 
43
  - Base model: `Qwen/Qwen3.5-35B-A3B`
 
27
 
28
  ## Usage
29
 
30
+ Requires `transformers>=5.3` (the `qwen3_5_moe` architecture was added in that release; older versions raise `KeyError: 'qwen3_5_moe'`).
31
+
32
  ```python
33
+ # pip install -U "transformers>=5.3" peft accelerate
34
  from peft import AutoPeftModelForCausalLM
35
  from transformers import AutoTokenizer
36
 
37
  model = AutoPeftModelForCausalLM.from_pretrained(
38
  "HarryMayne/dentist_positive",
39
  torch_dtype="auto",
40
+ device_map="auto",
41
  )
42
  tok = AutoTokenizer.from_pretrained("Qwen/Qwen3.5-35B-A3B")
43
  ```
44
 
45
+ The base model `Qwen/Qwen3.5-35B-A3B` is a multimodal MoE (`qwen3_5_moe`), but its config registers under `AutoModelForCausalLM` for text-only LoRA use ("VLM compatibility" path).
46
+
47
  ## Training details
48
 
49
  - Base model: `Qwen/Qwen3.5-35B-A3B`