Crossie commited on
Commit
8cd12c9
·
verified ·
1 Parent(s): 40e9b09

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -9
README.md CHANGED
@@ -7,7 +7,6 @@ language:
7
  pipeline_tag: text-generation
8
  ---
9
 
10
-
11
  # 🌸 Nayari AI
12
 
13
  A fine-tuned AI companion character built on **Qwen 2.5 1.5B Instruct**, trained using **Unsloth + LoRA** on Kaggle's free GPU tier.
@@ -29,6 +28,7 @@ Nayari-AI/
29
 
30
  ├── nayari_build_dataset.ipynb # LOCAL — converts all files → nayari_dataset.json + uploads to Kaggle
31
  ├── nayari_train.ipynb # KAGGLE — fine-tunes Qwen 2.5 using the uploaded dataset
 
32
  ├── nayari_dataset.json # Auto-generated dataset (do not edit manually)
33
  ├── nayari_system_prompt.txt # Nayari's system prompt (baked into tokenizer at training time)
34
  └── README.md
@@ -57,18 +57,23 @@ You will need a **Kaggle API token** for the upload step:
57
  1. Go to [kaggle.com/code](https://kaggle.com/code) → **New Notebook** → Upload `nayari_train.ipynb`
58
  2. Click **+ Add Data** → search for your uploaded `nayari-dataset` → Add
59
  3. Set **Accelerator = GPU T4 x2** and **Internet = On**
60
- 4. Run cells **in order** (Steps 1 → 9, then 8/10 are reference only)
61
 
62
  Training takes ~15–30 min on T4 x2.
63
 
64
- ### Step 3 — Download & Run with KoboldCpp (run locally)
 
 
 
 
 
 
 
65
 
66
- 1. Run **Step 9E** in the Kaggle notebook to get a Cloudflare download link
67
- 2. Download `nayari-Q4_K_M.gguf` (fast) or `nayari-Q8_0.gguf` (higher quality)
68
- 3. Install [KoboldCpp](https://github.com/LostRuins/koboldcpp/releases)
69
- 4. Launch: `koboldcpp.exe nayari-Q4_K_M.gguf --contextsize 4096`
70
- 5. Open `http://localhost:5001` — set **Instruct mode = ChatML**
71
- 6. Nayari's personality is baked in — no system prompt needed in the UI
72
 
73
  ---
74
 
 
7
  pipeline_tag: text-generation
8
  ---
9
 
 
10
  # 🌸 Nayari AI
11
 
12
  A fine-tuned AI companion character built on **Qwen 2.5 1.5B Instruct**, trained using **Unsloth + LoRA** on Kaggle's free GPU tier.
 
28
 
29
  ├── nayari_build_dataset.ipynb # LOCAL — converts all files → nayari_dataset.json + uploads to Kaggle
30
  ├── nayari_train.ipynb # KAGGLE — fine-tunes Qwen 2.5 using the uploaded dataset
31
+ ├── nayari_export.ipynb # KAGGLE — exports the fine-tuned model to GGUF, HuggingFace, etc.
32
  ├── nayari_dataset.json # Auto-generated dataset (do not edit manually)
33
  ├── nayari_system_prompt.txt # Nayari's system prompt (baked into tokenizer at training time)
34
  └── README.md
 
57
  1. Go to [kaggle.com/code](https://kaggle.com/code) → **New Notebook** → Upload `nayari_train.ipynb`
58
  2. Click **+ Add Data** → search for your uploaded `nayari-dataset` → Add
59
  3. Set **Accelerator = GPU T4 x2** and **Internet = On**
60
+ 4. Run cells **in order**.
61
 
62
  Training takes ~15–30 min on T4 x2.
63
 
64
+ ### Step 3 — Export & Download (run on Kaggle)
65
+
66
+ 1. After training, open `nayari_export.ipynb` in your Kaggle environment (or upload it to a new notebook with the same workspace context).
67
+ 2. Run the cells to generate LoRA adapters, merged 16-bit, and GGUF outputs.
68
+ 3. Use the **Cloudflare Tunnel** cell to get direct HTTP download links for the generated `.gguf` files.
69
+ 4. Download `nayari-Q4_K_M.gguf` (fast) or `nayari-Q8_0.gguf` (higher quality).
70
+
71
+ ### Step 4 — Run with KoboldCpp (run locally)
72
 
73
+ 1. Install [KoboldCpp](https://github.com/LostRuins/koboldcpp/releases)
74
+ 2. Launch: `koboldcpp.exe nayari-Q4_K_M.gguf --contextsize 4096`
75
+ 3. Open `http://localhost:5001` — set **Instruct mode = ChatML**
76
+ 4. Nayari's personality is baked in — no system prompt needed in the UI
 
 
77
 
78
  ---
79