aswinkumar99 commited on
Commit
aa7d4e4
·
verified ·
1 Parent(s): 271b1bd

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +88 -0
README.md ADDED
@@ -0,0 +1,88 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: mit
5
+ tags:
6
+ - ACT
7
+ - LeRobot
8
+ - robotics
9
+ - imitation-learning
10
+ - behavior-cloning
11
+ - so101
12
+ pipeline_tag: reinforcement-learning
13
+ library_name: lerobot
14
+ ---
15
+
16
+ # LeRobot SO101 ACT task2-all_bs32_s60000
17
+
18
+ ## Summary
19
+
20
+ This repository contains the final checkpoint for a ACT policy trained on `aswinkumar99/task2-all` for SO101 sponge pick-and-place experiments.
21
+
22
+ Dataset meaning: Task 2: Multiple Sponges - No Distractors (all layouts).
23
+
24
+ This ACT policy was trained for this dataset configuration and was not initialized from a published ACT base checkpoint. The visual backbone uses ImageNet-initialized ResNet-18 weights (`ResNet18_Weights.IMAGENET1K_V1`) as recorded in the training config.
25
+
26
+ ## Training Setup
27
+
28
+ - Dataset repo: `aswinkumar99/task2-all`
29
+ - Local dataset root during training: `/home/riftuser/datasets_combined/aswinkumar99/task2-all`
30
+ - Output directory during training: `/home/riftuser/outputs_matrix/act/task2-all_bs32_s60000`
31
+ - Batch size: `32`
32
+ - Training steps: `60000`
33
+ - Checkpoint save frequency: `15000`
34
+ - Data loader workers: `8`
35
+ - WandB project: `so101-layout-generalization`
36
+ - GPU: `NVIDIA RTX PRO 6000 Blackwell Max-Q Workstation Edition`
37
+ - Python: `CPython 3.12.13`
38
+ - CUDA: `12.9`
39
+ - Training start: `2026-04-23T21:08:22.306521+00:00`
40
+ - Training end: `2026-04-24T00:14:27`
41
+ - Approximate training duration: `3h 6m 4s`
42
+
43
+ - Vision backbone: `resnet18`
44
+ - Observation cameras: `observation.images.overhead`, `observation.images.wrist`
45
+ - Action chunk size: `100`
46
+ - Action steps predicted: `100`
47
+
48
+ ## Exact Training Command
49
+
50
+ ```bash
51
+ lerobot-train \
52
+ --dataset.repo_id=aswinkumar99/task2-all \
53
+ --dataset.root=/home/riftuser/datasets_combined/aswinkumar99/task2-all \
54
+ --dataset.video_backend=torchcodec \
55
+ --output_dir=/home/riftuser/outputs_matrix/act/task2-all_bs32_s60000 \
56
+ --job_name=act_task2-all_bs32 \
57
+ --batch_size=32 \
58
+ --steps=60000 \
59
+ --log_freq=200 \
60
+ --save_freq=15000 \
61
+ --save_checkpoint=true \
62
+ --num_workers=8 \
63
+ --wandb.enable=true \
64
+ --wandb.project=so101-layout-generalization \
65
+ --wandb.mode=online \
66
+ --wandb.disable_artifact=true \
67
+ --policy.type=act \
68
+ --policy.device=cuda \
69
+ --policy.push_to_hub=false
70
+ ```
71
+
72
+ ## Repository Contents
73
+
74
+ - `pretrained_model/`: final downloadable model artifacts for inference/loading
75
+ - `training_state/`: optimizer, RNG, scheduler/state, and step information for resuming or auditability
76
+
77
+ ## Notes
78
+
79
+ - This repo stores the final checkpoint that was uploaded from the cloud training workspace.
80
+ - The checkpoint was trained with LeRobot tooling via `lerobot-train`.
81
+ - For SO101 experiments in this workspace, the dataset source was created by Aswinkumar.
82
+
83
+ ## Creator
84
+
85
+ Aswinkumar
86
+
87
+ - Website: [aswinkumar.me](https://aswinkumar.me)
88
+ - Hugging Face repo: <https://huggingface.co/aswinkumar99/LeRobot-SO101-ACT-task2-all_bs32_s60000>