Clean README after removing partial epoch uploads
Browse files
README.md
CHANGED
|
@@ -32,7 +32,7 @@ Each model is trained with multiple variants:
|
|
| 32 |
| `granite-4.0-h-tiny` | base_attn, dp3_attn, dp8_attn | `ibm-granite/granite-4.0-h-tiny-base` |
|
| 33 |
| `qwen1.5-moe-a2.7b` | dp3_attn, base_attn_v2, dp3_attn_v2, dp8_attn_v2 | `Qwen/Qwen1.5-MoE-A2.7B` |
|
| 34 |
|
| 35 |
-
Total: **24 LoRA adapters**
|
| 36 |
|
| 37 |
## Training Data
|
| 38 |
|
|
@@ -57,12 +57,6 @@ Each variant directory contains:
|
|
| 57 |
└── audit_scores.npz # DP audit raw scores
|
| 58 |
```
|
| 59 |
|
| 60 |
-
**Note**: Per-epoch adapter checkpoints (`epochs/epoch_XXX/adapter/`) are only partially uploaded for:
|
| 61 |
-
- `llama3.2-3b/dp3` (epochs 1, 2, 3, 5)
|
| 62 |
-
- `llama3.2-3b/dp8` (epochs 1, 5)
|
| 63 |
-
|
| 64 |
-
Other variants only contain the final adapter. Intermediate epoch checkpoints can be requested.
|
| 65 |
-
|
| 66 |
## Loading a Model
|
| 67 |
|
| 68 |
```python
|
|
|
|
| 32 |
| `granite-4.0-h-tiny` | base_attn, dp3_attn, dp8_attn | `ibm-granite/granite-4.0-h-tiny-base` |
|
| 33 |
| `qwen1.5-moe-a2.7b` | dp3_attn, base_attn_v2, dp3_attn_v2, dp8_attn_v2 | `Qwen/Qwen1.5-MoE-A2.7B` |
|
| 34 |
|
| 35 |
+
Total: **24 LoRA adapters**
|
| 36 |
|
| 37 |
## Training Data
|
| 38 |
|
|
|
|
| 57 |
└── audit_scores.npz # DP audit raw scores
|
| 58 |
```
|
| 59 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 60 |
## Loading a Model
|
| 61 |
|
| 62 |
```python
|