NNEngine
/

TinyWay-1.1.0

Text Generation

custom-architecture

Model card Files Files and versions

NNEngine commited on Feb 25

Commit

ca7128c

·

verified ·

1 Parent(s): 27b1fc1

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ datasets:
 # TinyWay-1.1.0
 **TinyWay-1.1.0** is a lightweight **decoder-only Transformer language model** trained **from scratch** on limited compute.
-The project demonstrates that meaningful language modeling behavior can emerge from modest-scale models trained in constrained environments such as Kaggle.
 > **Core idea:** *Understanding LLM training mechanics end-to-end by building, training, debugging, and deploying a Transformer LM without relying on pretrained weights.*
@@ -56,7 +56,7 @@ The project demonstrates that meaningful language modeling behavior can emerge f
 * Gradient accumulation: enabled
 * Gradient clipping: enabled
 * Mixed precision training (AMP)
-* Training performed entirely on **Kaggle GPU environment (12-hour sessions)**
 ### Checkpoints
@@ -158,5 +158,5 @@ ITM Gwalior, India
 ## Acknowledgements
 * Hugging Face Transformers
-* Kaggle GPU resources
 * Open research community for open-source inspiration

 # TinyWay-1.1.0
 **TinyWay-1.1.0** is a lightweight **decoder-only Transformer language model** trained **from scratch** on limited compute.
+The project demonstrates that meaningful language modeling behavior can emerge from modest-scale models trained in constrained environments such as.
 > **Core idea:** *Understanding LLM training mechanics end-to-end by building, training, debugging, and deploying a Transformer LM without relying on pretrained weights.*
 * Gradient accumulation: enabled
 * Gradient clipping: enabled
 * Mixed precision training (AMP)
+* Training performed entirely on **GPU environment**
 ### Checkpoints
 ## Acknowledgements
 * Hugging Face Transformers
+* GPU resources
 * Open research community for open-source inspiration