gyung commited on
Commit
48b2bfb
·
verified ·
1 Parent(s): 61b3229

Update model card with pending TB2-lite evaluation status

Browse files
Files changed (1) hide show
  1. README.md +24 -23
README.md CHANGED
@@ -1,37 +1,38 @@
1
  ---
2
- license: apache-2.0
 
 
3
  library_name: transformers
4
- base_model: google/gemma-4-E2B-it
5
  tags:
6
- - terminal-agent
7
- - qwen3.5
8
- - full-finetuning
9
- - unsloth
10
  - tb2-lite
 
 
11
  ---
12
 
13
  # LLM-OS-Models/gemma-4-E2B-it-Terminal-SFT-2Epoch-DDP-4GPU
14
 
15
- ## Summary
16
 
17
- - Base model: `google/gemma-4-E2B-it`
18
- - Source output root: `/home/work/.data/qwen_sft/models/google__gemma-4-E2B-it__terminal_sft_2epoch_ddp_4gpu`
19
- - Root export copied from: `checkpoint-734`
20
- - Included checkpoints: `checkpoint-367, checkpoint-734`
21
 
22
- ## Layout
23
-
24
- - Repository root contains the eval/inference-ready final export.
25
- - Trainer checkpoints are included as subdirectories for recovery and inspection.
 
26
 
27
- ## Notes
28
 
29
- Gemma 4 E2B text-only terminal SFT, 4-GPU DDP, epoch checkpoints included.
 
 
30
 
31
- ## Loading
32
 
33
- ```python
34
- from transformers import AutoModelForCausalLM, AutoTokenizer
35
- tokenizer = AutoTokenizer.from_pretrained("LLM-OS-Models/gemma-4-E2B-it-Terminal-SFT-2Epoch-DDP-4GPU", trust_remote_code=True)
36
- model = AutoModelForCausalLM.from_pretrained("LLM-OS-Models/gemma-4-E2B-it-Terminal-SFT-2Epoch-DDP-4GPU", trust_remote_code=True)
37
- ```
 
1
  ---
2
+ language:
3
+ - en
4
+ - ko
5
  library_name: transformers
6
+ pipeline_tag: text-generation
7
  tags:
8
+ - terminal
9
+ - sft
10
+ - vllm
 
11
  - tb2-lite
12
+ - evaluation-pending
13
+ base_model: google/gemma-4-E2B-it
14
  ---
15
 
16
  # LLM-OS-Models/gemma-4-E2B-it-Terminal-SFT-2Epoch-DDP-4GPU
17
 
18
+ 터미널 작업 자동화를 위한 Terminal SFT 모델입니다. 입력된 작업/이전 터미널 상태를 보고 다음에 실행할 명령을 JSON 형태로 생성하는 용도로 학습했습니다.
19
 
20
+ ## 모델 요약
 
 
 
21
 
22
+ - Base model: `google/gemma-4-E2B-it`
23
+ - Training setup: `2 epochs, DDP fine-tuning`
24
+ - Model card snapshot: `2026-05-08 16:04:24 UTC`
25
+ - Corrected TB2-lite evaluated results currently indexed: `56`
26
+ - Corrected TB2-lite score: `pending / not matched in current result directory`
27
 
28
+ ## 평가 상태
29
 
30
+ - Current corrected TB2-lite score: `pending`
31
+ - Reason: 현재 `/home/work/.data/tb2_lite_eval/corrected_readme_models_vllm` 집계 결과와 이 HF repo명이 직접 매칭되지 않았습니다.
32
+ - Next step: 동일한 `tb2_lite/scripts/replay_eval.py` 경로로 평가를 돌린 뒤 점수 카드로 자동 교체합니다.
33
 
34
+ ## 모델군 해석
35
 
36
+ - Gemma 계열은 native Gemma/Liquid 전처리와 chat template 처리가 중요합니다. 이 repo는 corrected 평가가 끝나면 점수 카드로 교체합니다.
37
+ - TB2-lite 점수는 일반 지능 벤치마크가 아니라 터미널 next-action JSON 재현 능력을 측정합니다.
38
+ - 생성 명령은 실제 실행 전에 sandbox, allowlist, human review 같은 안전장치를 거쳐야 합니다.