LeRobot SO101 Pi05 task1-all_bs32_s20000

Summary

This repository contains the final checkpoint for a Pi0.5 (pi05) policy fine-tune trained on aswinkumar99/task1-all for SO101 sponge pick-and-place experiments.

Dataset meaning: Task 1: Single Sponge - No Distractors (all layouts).

This pi05 policy is a fine-tune of lerobot/pi05_base, as recorded by both the launch command (--policy.path=lerobot/pi05_base) and the saved training config (pretrained_path: lerobot/pi05_base).

Training Setup

  • Dataset repo: aswinkumar99/task1-all

  • Local dataset root during training: /root/datasets_combined/aswinkumar99/task1-all

  • Output directory during training: /root/outputs_matrix/pi05/task1-all_bs32_s20000

  • Batch size: 32

  • Training steps: 20000

  • Checkpoint save frequency: 5000

  • Data loader workers: 8

  • WandB project: so101-layout-generalization

  • GPU: NVIDIA H200

  • Python: CPython 3.10.12

  • CUDA: 13.1

  • Training start: 2026-04-23T18:54:35.016684

  • Training end: 2026-04-23T23:21:58.900412

  • Approximate training duration: 4h 27m 23s

  • Base model: lerobot/pi05_base

  • Observation camera rename map: {"observation.images.overhead": "observation.images.base_0_rgb", "observation.images.wrist": "observation.images.right_wrist_0_rgb"}

  • Action chunk size: 50

  • Action steps predicted: 50

Exact Training Command

lerobot-train \
  --dataset.repo_id=aswinkumar99/task1-all \
  --dataset.root=/root/datasets_combined/aswinkumar99/task1-all \
  --dataset.video_backend=torchcodec \
  --output_dir=/root/outputs_matrix/pi05/task1-all_bs32_s20000 \
  --job_name=pi05_task1-all_bs32 \
  --batch_size=32 \
  --steps=20000 \
  --log_freq=200 \
  --save_freq=5000 \
  --save_checkpoint=true \
  --num_workers=8 \
  --wandb.enable=true \
  --wandb.project=so101-layout-generalization \
  --wandb.mode=online \
  --wandb.disable_artifact=true \
  --policy.path=lerobot/pi05_base \
  --policy.device=cuda \
  --policy.push_to_hub=false \
  --rename_map={"observation.images.overhead": "observation.images.base_0_rgb", "observation.images.wrist": "observation.images.right_wrist_0_rgb"}

Repository Contents

  • pretrained_model/: final downloadable model artifacts for inference/loading
  • training_state/: optimizer, RNG, scheduler/state, and step information for resuming or auditability

Notes

  • This repo stores the final checkpoint (step 20000) that was uploaded from the cloud training workspace.
  • Intermediate checkpoints (every 5000 steps) are archived to Google Drive as tarballs and are not pushed to the Hub.
  • The checkpoint was trained with LeRobot tooling via lerobot-train.
  • For SO101 experiments in this workspace, the dataset source was created by Aswinkumar.

Creator

Aswinkumar

Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Model tree for aswinkumar99/LeRobot-SO101-Pi05-task1-all_bs32_s20000

Finetuned
(18)
this model