aswinkumar99's picture
Add model card
0d3078b verified
---
language:
- en
license: mit
tags:
- pi05
- pi0.5
- LeRobot
- robotics
- imitation-learning
- behavior-cloning
- so101
pipeline_tag: reinforcement-learning
library_name: lerobot
base_model:
- lerobot/pi05_base
---
# LeRobot SO101 Pi05 task2task3-all_bs32_s20000
## Summary
This repository contains the final checkpoint for a Pi0.5 (pi05) policy fine-tune trained on `aswinkumar99/task2task3-all` for SO101 sponge pick-and-place experiments.
Dataset meaning: Task 2 + Task 3 combined (all layouts).
This pi05 policy is a fine-tune of `lerobot/pi05_base`, as recorded by both the launch command (`--policy.path=lerobot/pi05_base`) and the saved training config (`pretrained_path: lerobot/pi05_base`).
## Training Setup
- Dataset repo: `aswinkumar99/task2task3-all`
- Local dataset root during training: `/root/datasets_combined/aswinkumar99/task2task3-all`
- Output directory during training: `/root/outputs_matrix/pi05/task2task3-all_bs32_s20000`
- Batch size: `32`
- Training steps: `20000`
- Checkpoint save frequency: `5000`
- Data loader workers: `8`
- WandB project: `so101-layout-generalization`
- GPU: `NVIDIA H200`
- Python: `CPython 3.10.12`
- CUDA: `13.1`
- Training start: `2026-04-24T13:03:21.511729`
- Training end: `2026-04-24T17:37:44.320524`
- Approximate training duration: `4h 34m 22s`
- Base model: `lerobot/pi05_base`
- Observation camera rename map: `{"observation.images.overhead": "observation.images.base_0_rgb", "observation.images.wrist": "observation.images.right_wrist_0_rgb"}`
- Action chunk size: `50`
- Action steps predicted: `50`
## Exact Training Command
```bash
lerobot-train \
--dataset.repo_id=aswinkumar99/task2task3-all \
--dataset.root=/root/datasets_combined/aswinkumar99/task2task3-all \
--dataset.video_backend=torchcodec \
--output_dir=/root/outputs_matrix/pi05/task2task3-all_bs32_s20000 \
--job_name=pi05_task2task3-all_bs32 \
--batch_size=32 \
--steps=20000 \
--log_freq=200 \
--save_freq=5000 \
--save_checkpoint=true \
--num_workers=8 \
--wandb.enable=true \
--wandb.project=so101-layout-generalization \
--wandb.mode=online \
--wandb.disable_artifact=true \
--policy.path=lerobot/pi05_base \
--policy.device=cuda \
--policy.push_to_hub=false \
--rename_map={"observation.images.overhead": "observation.images.base_0_rgb", "observation.images.wrist": "observation.images.right_wrist_0_rgb"}
```
## Repository Contents
- `pretrained_model/`: final downloadable model artifacts for inference/loading
- `training_state/`: optimizer, RNG, scheduler/state, and step information for resuming or auditability
## Notes
- This repo stores the final checkpoint (step `20000`) that was uploaded from the cloud training workspace.
- Intermediate checkpoints (every `5000` steps) are archived to Google Drive as tarballs and are not pushed to the Hub.
- The checkpoint was trained with LeRobot tooling via `lerobot-train`.
- For SO101 experiments in this workspace, the dataset source was created by Aswinkumar.
## Creator
Aswinkumar
- Website: [aswinkumar.me](https://aswinkumar.me)
- Hugging Face repo: <https://huggingface.co/aswinkumar99/LeRobot-SO101-Pi05-task2task3-all_bs32_s20000>