maxsonderby commited on
Commit
2539fa2
·
verified ·
1 Parent(s): c94e009

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +78 -0
README.md ADDED
@@ -0,0 +1,78 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: Jackrong/Qwopus3.6-35B-A3B-v1
3
+ library_name: peft
4
+ pipeline_tag: image-text-to-text
5
+ license: apache-2.0
6
+ tags:
7
+ - qwen3_5_moe
8
+ - qwen3.6
9
+ - qwopus
10
+ - moe
11
+ - lora
12
+ - peft
13
+ - unsloth
14
+ - union-street-ai
15
+ - helios
16
+ - identity-tune
17
+ - adapter
18
+ language:
19
+ - en
20
+ ---
21
+
22
+ # Helios Rabbit v1
23
+
24
+ Helios Rabbit v1 is a lightweight identity and behavior LoRA adapter for `Jackrong/Qwopus3.6-35B-A3B-v1`, produced by Union Street AI.
25
+
26
+ This is an adapter, not a full merged checkpoint. Use it with the base model named above.
27
+
28
+ ## Intended Identity
29
+
30
+ The adapter is intended to make the model identify as Helios, a local AI model developed and adapted by Union Street AI, while preserving the base model's coding, repo-analysis, and infrastructure strengths.
31
+
32
+ It should be honest about lineage: Helios is adapted from open model research and local post-training work. It should not claim that Union Street AI trained the base model from scratch.
33
+
34
+ ## Training Summary
35
+
36
+ - Run name: `helios-rabbit-v1`
37
+ - Base model: `Jackrong/Qwopus3.6-35B-A3B-v1`
38
+ - Method: LoRA SFT with Unsloth / PEFT
39
+ - Data: 475 training conversations, 25 validation conversations
40
+ - Max sequence length: 2048
41
+ - LoRA rank: 8
42
+ - LoRA alpha: 8
43
+ - Target: language attention modules, vision layers disabled, MLP expert LoRA disabled for this first identity pass
44
+ - Hardware: Lambda Labs 8x NVIDIA A100-SXM4-80GB
45
+
46
+ ## Dataset Notes
47
+
48
+ The dataset is a small synthetic identity and behavior corpus for Helios. It focuses on:
49
+
50
+ - identity and provenance
51
+ - coding and infrastructure assistant behavior
52
+ - candid but bounded adult-world conversation
53
+ - liberty-minded, anti-authoritarian, rule-of-law, pro-human-agency posture
54
+ - honesty about uncertainty and model lineage
55
+
56
+ ## Status
57
+
58
+ This is a v1 experimental adapter. Evaluate before production use.
59
+
60
+ ## Loading Sketch
61
+
62
+ ```python
63
+ from transformers import AutoModelForCausalLM, AutoTokenizer
64
+ from peft import PeftModel
65
+
66
+ base_id = "Jackrong/Qwopus3.6-35B-A3B-v1"
67
+ adapter_id = "UnionStreet/helios-rabbit-v1"
68
+
69
+ tokenizer = AutoTokenizer.from_pretrained(base_id, trust_remote_code=True)
70
+ model = AutoModelForCausalLM.from_pretrained(base_id, torch_dtype="auto", device_map="auto", trust_remote_code=True)
71
+ model = PeftModel.from_pretrained(model, adapter_id)
72
+ ```
73
+
74
+ Depending on your inference stack, you may need the multimodal Qwen3.5 MoE model class rather than `AutoModelForCausalLM`.
75
+
76
+ ## License
77
+
78
+ The base model card declares `apache-2.0`; this adapter is released under Apache 2.0 as well, subject to the base model's terms.