alextripplet commited on
Commit
e0b9559
·
verified ·
1 Parent(s): ef3b1ae

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +10 -6
README.md CHANGED
@@ -28,18 +28,18 @@ base_model:
28
 
29
  # Suzhou 3.2
30
 
31
- A ~12 billion parameter instruction-tuned language model by **Triplet Research**. Suzhou 3.2 is a weighted merge of Suzhou 3.1 and Qwen2.5-3B, designed to improve reasoning and math capabilities while keeping under the 15B parameter limit.
32
 
33
  ## Merge Details
34
 
35
  - **Method**: Weighted blending (70% Suzhou 3.1 + 30% Qwen2.5-3B)
36
- - **Model A**: Suzhou 3.1 (~9B params) - strong agent/tool-use, reasoning
37
- - **Model B**: Qwen2.5-3B-Instruct (~3B params) - math reasoning, general knowledge
38
- - **Target**: ~12B parameters total
39
 
40
  ## Key Features
41
 
42
- - **~12B parameters** - under 15B limit
43
  - **262K context window**
44
  - Strong **reasoning** and **chain-of-thought** capabilities
45
  - **Tool calling** and **agent** support
@@ -51,7 +51,11 @@ A ~12 billion parameter instruction-tuned language model by **Triplet Research**
51
  - Type: Causal Language Model
52
  - Architecture: Qwen3.5 Text
53
  - Layers: 32
54
- - Parameters: ~12B
 
 
 
 
55
 
56
  ## Quickstart
57
 
 
28
 
29
  # Suzhou 3.2
30
 
31
+ A 12 billion parameter instruction-tuned language model by **Triplet Research**. Suzhou 3.2 is a weighted merge of Suzhou 3.1 and Qwen2.5-3B, designed to improve reasoning and math capabilities.
32
 
33
  ## Merge Details
34
 
35
  - **Method**: Weighted blending (70% Suzhou 3.1 + 30% Qwen2.5-3B)
36
+ - **Model A**: Suzhou 3.1 - strong agent/tool-use, reasoning
37
+ - **Model B**: Qwen2.5-3B-Instruct - math reasoning, general knowledge
38
+ - **Target**: 12B parameters
39
 
40
  ## Key Features
41
 
42
+ - **12B parameters**
43
  - **262K context window**
44
  - Strong **reasoning** and **chain-of-thought** capabilities
45
  - **Tool calling** and **agent** support
 
51
  - Type: Causal Language Model
52
  - Architecture: Qwen3.5 Text
53
  - Layers: 32
54
+ - Parameters: 12B
55
+
56
+ ## Safetensors
57
+
58
+ - 12B parameters
59
 
60
  ## Quickstart
61