Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -116,7 +116,7 @@ Finetune on your own tools via the web UI or CLI:
|
|
| 116 |
needle ui
|
| 117 |
|
| 118 |
# CLI (auto-downloads weights if not local)
|
| 119 |
-
|
| 120 |
```
|
| 121 |
|
| 122 |
## Links
|
|
@@ -125,17 +125,6 @@ python -m src.training.finetune data.jsonl
|
|
| 125 |
- [Cactus](https://github.com/cactus-compute/cactus) - on-device runtime (6000 tok/s prefill, 1200 tok/s decode)
|
| 126 |
- [Simple Attention Networks](https://github.com/cactus-compute/needle/blob/main/docs/simple_attention_networks.md) - architecture details
|
| 127 |
|
| 128 |
-
## File Format
|
| 129 |
-
|
| 130 |
-
The checkpoint is a Python pickle containing:
|
| 131 |
-
|
| 132 |
-
```python
|
| 133 |
-
{
|
| 134 |
-
"params": { ... }, # nested dict of numpy float16 arrays
|
| 135 |
-
"config": { ... }, # TransformerConfig fields as dict
|
| 136 |
-
}
|
| 137 |
-
```
|
| 138 |
-
|
| 139 |
## License
|
| 140 |
|
| 141 |
MIT
|
|
|
|
| 116 |
needle ui
|
| 117 |
|
| 118 |
# CLI (auto-downloads weights if not local)
|
| 119 |
+
needle finetune data.jsonl
|
| 120 |
```
|
| 121 |
|
| 122 |
## Links
|
|
|
|
| 125 |
- [Cactus](https://github.com/cactus-compute/cactus) - on-device runtime (6000 tok/s prefill, 1200 tok/s decode)
|
| 126 |
- [Simple Attention Networks](https://github.com/cactus-compute/needle/blob/main/docs/simple_attention_networks.md) - architecture details
|
| 127 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 128 |
## License
|
| 129 |
|
| 130 |
MIT
|