lewtun HF Staff commited on
Commit
a8dd899
·
verified ·
1 Parent(s): ec572aa

End of training

Browse files
Files changed (1) hide show
  1. README.md +4 -24
README.md CHANGED
@@ -4,11 +4,11 @@ library_name: transformers
4
  model_name: smollm2-135m-capybara-sft
5
  tags:
6
  - generated_from_trainer
 
7
  - sft
8
  - trl
9
- - trackio:https://lewtun-mlintern-d8k3m2x1.hf.space?project=huggingface&runs=sft-smollm2-135m-capybara25&sidebar=collapsed
10
  - hf_jobs
11
- - ml-intern
12
  licence: license
13
  ---
14
 
@@ -31,7 +31,7 @@ print(output["generated_text"])
31
  ## Training procedure
32
 
33
 
34
- [<img src="https://raw.githubusercontent.com/gradio-app/trackio/refs/heads/main/trackio/assets/badge.png" alt="Visualize in Trackio" title="Visualize in Trackio" width="150" height="24"/>](https://lewtun-mlintern-d8k3m2x1.hf.space?project=huggingface&runs=sft-smollm2-135m-capybara25&sidebar=collapsed)
35
 
36
 
37
  This model was trained with SFT.
@@ -58,24 +58,4 @@ Cite TRL as:
58
  url = {https://github.com/huggingface/trl},
59
  year = {2020}
60
  }
61
- ```
62
-
63
- <!-- ml-intern-provenance -->
64
- ## Generated by ML Intern
65
-
66
- This model repository was generated by [ML Intern](https://github.com/huggingface/ml-intern), an agent for machine learning research and development on the Hugging Face Hub.
67
-
68
- - Try ML Intern: https://smolagents-ml-intern.hf.space
69
- - Source code: https://github.com/huggingface/ml-intern
70
-
71
- ## Usage
72
-
73
- ```python
74
- from transformers import AutoModelForCausalLM, AutoTokenizer
75
-
76
- model_id = 'lewtun/smollm2-135m-capybara-sft'
77
- tokenizer = AutoTokenizer.from_pretrained(model_id)
78
- model = AutoModelForCausalLM.from_pretrained(model_id)
79
- ```
80
-
81
- For non-causal architectures, replace `AutoModelForCausalLM` with the appropriate `AutoModel` class.
 
4
  model_name: smollm2-135m-capybara-sft
5
  tags:
6
  - generated_from_trainer
7
+ - trackio
8
  - sft
9
  - trl
10
+ - trackio:https://huggingface.co/spaces/lewtun/huggingface-static-d1504f
11
  - hf_jobs
 
12
  licence: license
13
  ---
14
 
 
31
  ## Training procedure
32
 
33
 
34
+
35
 
36
 
37
  This model was trained with SFT.
 
58
  url = {https://github.com/huggingface/trl},
59
  year = {2020}
60
  }
61
+ ```