IRIS-architecture / IRIS_Training_Notebook.ipynb
asdf98's picture
v3: patch_size=4 (64 tokens), 2 core layers, iters [2,3,4], ~16min total training
65af9bd verified
Open in Colab