Update README.md
Browse files
README.md
CHANGED
|
@@ -70,7 +70,7 @@ NorMuon and AdamW learning rates were both dropped by roughly an order of magnit
|
|
| 70 |
|
| 71 |
## π Additional Resources
|
| 72 |
|
| 73 |
-
-
|
| 74 |
- [Blog Post](https://www.datadoghq.com/blog/ai/toto-2/)
|
| 75 |
- [Base model: Toto-2.0-2.5B](https://huggingface.co/Datadog/Toto-2.0-2.5B) β the unfinetuned checkpoint, which is what we recommend deploying
|
| 76 |
- [Toto 2.0 Collection](https://huggingface.co/collections/Datadog/toto-20) β all five base sizes (4m β 2.5B)
|
|
|
|
| 70 |
|
| 71 |
## π Additional Resources
|
| 72 |
|
| 73 |
+
- [Technical Report](https://arxiv.org/abs/2605.20119)
|
| 74 |
- [Blog Post](https://www.datadoghq.com/blog/ai/toto-2/)
|
| 75 |
- [Base model: Toto-2.0-2.5B](https://huggingface.co/Datadog/Toto-2.0-2.5B) β the unfinetuned checkpoint, which is what we recommend deploying
|
| 76 |
- [Toto 2.0 Collection](https://huggingface.co/collections/Datadog/toto-20) β all five base sizes (4m β 2.5B)
|