yypeng666 commited on
Commit
e6eb5b0
·
verified ·
1 Parent(s): 10faa51

Add model card

Browse files
Files changed (1) hide show
  1. README.md +53 -0
README.md ADDED
@@ -0,0 +1,53 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # pi0.5 LoRA — stack_same (sim) (30 demos)
2
+
3
+ **Base weights**: `gs://openpi-assets/checkpoints/pi05_base/params` (openpi pi0.5 base — no robot prior)
4
+ **Training config**: `pi05_stack_same_libero_lora` (openpi, LIBERO-schema)
5
+ **Dataset**: [`IDEAS-Lab-Northwestern/sim-stack-same-30-libero`](https://huggingface.co/datasets/IDEAS-Lab-Northwestern/sim-stack-same-30-libero) (private) — OmniGibson sim teleop, 30 episodes, 30 fps, **LIBERO v2.1** schema (image + wrist_image + 8D EEF state + 7D EEF-delta action)
6
+ **Real or sim**: **Sim** — OmniGibson teleop on a desk-mounted Franka Panda
7
+ **Prompt (language conditioning)**:
8
+ > "Pick up the bottom item from the stack and move it into the green goal sphere. Take care that the items above remain stable and undisturbed."
9
+
10
+ ## Training
11
+
12
+ - 20,000 LoRA steps total — final checkpoint saved as step `19999` (openpi 0-indexed naming)
13
+ - Batch size 4 on a single **A100-SXM4-40GB**
14
+ - Wall time: ~2h 45m
15
+ - LoRA adapters: `paligemma_variant="gemma_2b_lora"`, `action_expert_variant="gemma_300m_lora"` (PaliGemma 2B + 300M action expert base frozen; only adapter params updated)
16
+ - `action_dim=32`, `action_horizon=16`, `pi05=True`, `discrete_state_input=False`
17
+ - **Norm stats**: computed from this dataset (not reused — `pi05_base` ships no robot-specific norm stats). Computed via openpi's `scripts/compute_norm_stats.py --config-name pi05_stack_same_libero_lora`
18
+
19
+ **wandb run**: https://wandb.ai/yiyanpeng2027-northwestern-university/openpi/runs/e6gc3vus
20
+
21
+ **Contents**: only step `19999/` (params + assets + _CHECKPOINT_METADATA). `train_state/` excluded — not needed for inference or LoRA-on-top resumption.
22
+
23
+ ## Use this checkpoint
24
+
25
+ ```bash
26
+ # Download just this step
27
+ HF_HUB_DISABLE_XET=1 huggingface-cli download IDEAS-Lab-Northwestern/pi05-sim-stack-same-30-libero-lora \
28
+ --include "19999/**" \
29
+ --local-dir vla_models/pi05-sim-stack-same-30-libero-lora
30
+
31
+ # Serve with openpi (point at the step dir)
32
+ uv run scripts/serve_policy.py \
33
+ --policy.config=pi05_stack_same_libero_lora \
34
+ --policy.dir=vla_models/pi05-sim-stack-same-30-libero-lora/19999
35
+ ```
36
+
37
+ Note: the `pi05_stack_same_libero_lora` config defaults to its hardcoded `repo_id` for training. For inference you do not need the dataset — the ckpt is self-contained.
38
+
39
+ ## Family
40
+
41
+ This is one of three paired sim-LoRA fine-tunes evaluating safety awareness in pi0.5-class VLA models on OmniGibson sim teleop:
42
+
43
+ - This repo: **stack_same (sim)**
44
+ - Sibling: [`IDEAS-Lab-Northwestern/pi05-sim-stack-flat-30-libero-lora`](https://huggingface.co/IDEAS-Lab-Northwestern/pi05-sim-stack-flat-30-libero-lora) — stack_flat (sim)
45
+ - Cousin: [`IDEAS-Lab-Northwestern/pi05-sim-lid-transport-food-30-libero-lora`](https://huggingface.co/IDEAS-Lab-Northwestern/pi05-sim-lid-transport-food-30-libero-lora) — lid_transport_food (sim)
46
+
47
+ All three share the same TrainConfig template (LIBERO schema, pi05_base warm-start, identical LoRA + optimizer hyperparams); they differ only in dataset + prompt.
48
+
49
+ Companion real-teleop pair (DROID schema): [`pi05-real-cab-60-droid-lora`](https://huggingface.co/IDEAS-Lab-Northwestern/pi05-real-cab-60-droid-lora) + [`pi05-real-jar-60-droid-lora`](https://huggingface.co/IDEAS-Lab-Northwestern/pi05-real-jar-60-droid-lora).
50
+
51
+ ## License
52
+
53
+ Apache-2.0