gshbao
/

faast-Qwen2.5-3B-Instruct

test-time-learning

Model card Files Files and versions

gshbao commited on 5 days ago

Commit

0a2b6cc

·

verified ·

1 Parent(s): 38479d6

Update README.md

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -33,6 +33,22 @@ This design enables:
 - Fast adaptation to downstream tasks
 - Improved few-shot and full-data performance
 ## Training Details
 - **Base model:** Qwen2.5-3B-Instruct

 - Fast adaptation to downstream tasks
 - Improved few-shot and full-data performance
+Usage:
+Before running the following code, we need to import the modules from https://github.com/baoguangsheng/faast
+```
+tokenizer = AutoTokenizer.from_pretrained(args.model_path, trust_remote_code=True)
+model = AutoModelForCausalLM.from_pretrained(args.model_path, trust_remote_code=True)
+fewshot_samples = ['sample 1', 'sample 2', ...]
+inputs = tokenizer(fewshot_samples, return_tensors="pt", padding=True)
+model.reset_projection() # clear existing fast weights
+model.learn(**inputs)  # learn new fast weights
+model.generate(...)  # do the task using the learned fast weights
+```
 ## Training Details
 - **Base model:** Qwen2.5-3B-Instruct