sumink commited on
Commit
8e8b432
·
verified ·
1 Parent(s): b696e7c

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -0
README.md ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Model: LLaMA (IFD Top 30%)
2
+
3
+ ## 🔍 Purpose
4
+
5
+ Fine-tune `meta-llama/Llama-3.2-1B` on instruction samples with the **highest Instruction Flow Density (IFD)**.
6
+ This group includes samples where the instruction contributes **least** to the model’s output (i.e., high IFD).
7
+
8
+ ## 📂 Dataset
9
+
10
+ - `alpaca2000.csv`
11
+ - IFD score 상위 30% (2000개 중 600개)
12
+ - 기준: `PPL(y | x) / PPL(y)` (x: instruction+input, y: output)
13
+
14
+ ## ⚙️ Training Config
15
+
16
+ - Model: `meta-llama/Llama-3.2-1B`
17
+ - Precision: `bf16` or `float32`
18
+ - Epochs: 3
19
+ - Max length: 2048
20
+ - Output: `output/llama_ifd`
21
+
22
+ ## 🧪 Goal
23
+
24
+ Establish baseline performance of high-IFD samples, before splitting by instruction entropy.