Chattso-GPT
/

test105

Text Generation

Model card Files Files and versions

Chattso-GPT commited on Feb 16

Commit

31e5730

·

verified ·

1 Parent(s): ea01dc4

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ tags:
 - dbbench
 ---
-# ＜【課題】ここは自分で記入して下さい＞
 This repository provides a **LoRA adapter** fine-tuned from
 **Qwen/Qwen3-4B-Instruct-2507** using **LoRA + Unsloth**.
@@ -26,11 +26,11 @@ The base model must be loaded separately.
 ## Training Objective
 This adapter is trained to improve **multi-turn agent task performance**
-on ALFWorld (household tasks) and DBBench (database operations).
 Loss is applied to **all assistant turns** in the multi-turn trajectory,
 enabling the model to learn environment observation, action selection,
-tool use, and recovery from errors.
 ## Training Configuration
@@ -49,7 +49,7 @@ from peft import PeftModel
 import torch
 base = "Qwen/Qwen3-4B-Instruct-2507"
-adapter = "your_id/your-repo"
 tokenizer = AutoTokenizer.from_pretrained(base)
 model = AutoModelForCausalLM.from_pretrained(

 - dbbench
 ---
+# qwen3-4b-agent-trajectory-lora
 This repository provides a **LoRA adapter** fine-tuned from
 **Qwen/Qwen3-4B-Instruct-2507** using **LoRA + Unsloth**.
 ## Training Objective
 This adapter is trained to improve **multi-turn agent task performance**
+on ALFWorld (household tasks).
 Loss is applied to **all assistant turns** in the multi-turn trajectory,
 enabling the model to learn environment observation, action selection,
+and recovery from errors.
 ## Training Configuration
 import torch
 base = "Qwen/Qwen3-4B-Instruct-2507"
+adapter = "Chattso-GPT/test105"
 tokenizer = AutoTokenizer.from_pretrained(base)
 model = AutoModelForCausalLM.from_pretrained(