xue10
/

qwen2.5-0.5B-MathInstruct-lora

Model card Files Files and versions

xue10 commited on Sep 27, 2024

Commit

1811f31

·

verified ·

1 Parent(s): ec50447

Update README.md

Files changed (1) hide show

README.md +12 -2

README.md CHANGED Viewed

@@ -1,16 +1,26 @@
 ---
 library_name: transformers
-tags: []
 ---
 # Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
 ## Model Details
 ### Model Description
 <!-- Provide a longer summary of what this model is. -->

 ---
 library_name: transformers
+license: mit
+datasets:
+- TIGER-Lab/MathInstruct
+base_model:
+- Qwen/Qwen2.5-0.5B
 ---
 # Model Card for Model ID
+Qwen2.5-0.5B finetuned with MathInsturct datasets on laptop 4070 8G using llama-factory
+Findings:
+- After finetuning, the model can answer questions like 'which is bigger? 9.11 or 9.9' but still cannot count the number of r's in the word strawberry.
+- I asked three math questions generated by gpt-4o, the base model can already correctly handle them. Seems like the base model is already trained on those data.
+Details can be found in the inference.ipynb file.
 ## Model Details
+Check relevent files in the repo
 ### Model Description
 <!-- Provide a longer summary of what this model is. -->