fineweb-edu-llama-large-from-dyck-seed-0-1506433

This repository contains the latest checkpoint from the local training run fineweb_edu_llama_large_from_dyck_1506433.

Contents

  • model_60975.pth: latest checkpoint selected from the run directory
  • metrics.json: training and validation loss history for the run

Run metadata

  • Seed: 0
  • Local source directory: fineweb_edu_llama_large_from_dyck_1506433
  • Weights & Biases run name: fineweb_edu_llama_large_from_dyck
  • Weights & Biases run id: rkil768y
  • Final logged train loss at step 60500: 2.481296554207802
  • Final logged validation loss at step 60500: 2.4488859912928413

Notes

  • The included checkpoint file is model_60975.pth.
  • The latest metrics entry in metrics.json is at step 60500.
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support