LuxiaSL commited on
Commit
e0b18a0
·
verified ·
1 Parent(s): d264b3c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -62,7 +62,7 @@ The model combines three techniques not previously studied together at this scal
62
  - **Muon optimizer** -- spectral-norm steepest descent via Newton-Schulz orthogonalization, producing 2-4x higher stable rank than AdamW at matched loss, with Gram-NS optimized coefficients.
63
 
64
  **Organization:** [aethera-gp](https://huggingface.co/aethera-gp)
65
- **Training code:** [github.com/aethera-gp/kotodama](https://github.com/aethera-gp/kotodama) (pretraining/)
66
 
67
  ## Architecture
68
 
 
62
  - **Muon optimizer** -- spectral-norm steepest descent via Newton-Schulz orthogonalization, producing 2-4x higher stable rank than AdamW at matched loss, with Gram-NS optimized coefficients.
63
 
64
  **Organization:** [aethera-gp](https://huggingface.co/aethera-gp)
65
+ **Training code:** [github.com/LuxiaSL/kotodama](https://github.com/LuxiaSL/kotodama)
66
 
67
  ## Architecture
68