Avi Trost
atrost
AI & ML interests
None yet
Recent Activity
updated a model about 1 hour ago
atrost/nanochat-d12-nested-kl-long-20260413 published a model about 1 hour ago
atrost/nanochat-d12-nested-kl-long-20260413 upvoted a paper 7 days ago
Test-Time Scaling Makes Overtraining Compute-Optimal