lsnu commited on
Commit
6e69f50
·
verified ·
1 Parent(s): ea8acc0

Add files using upload-large-folder tool

Browse files
Files changed (50) hide show
  1. PUBLIC_BENCHMARK_RESULTS.md +213 -0
  2. reports/anybimanual_anchor_bridge_live/trunk_only_ep1/command.txt +1 -0
  3. reports/anybimanual_anchor_bridge_live/trunk_only_ep1/stderr.txt +20 -0
  4. reports/anybimanual_anchor_bridge_live/trunk_only_ep1/stdout.txt +109 -0
  5. reports/anybimanual_anchor_bridge_live/trunk_only_ep1/summary.json +36 -0
  6. reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0/config.yaml +139 -0
  7. reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/command.txt +1 -0
  8. reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/stderr.txt +35 -0
  9. reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/stdout.txt +142 -0
  10. reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/summary.json +38 -0
  11. reports/anybimanual_anchor_bridge_live/trunk_only_ep5/command.txt +1 -0
  12. reports/anybimanual_anchor_bridge_live/trunk_only_ep5/stdout.txt +109 -0
  13. reports/anybimanual_anchor_bridge_live/trunk_only_ep5/summary.json +36 -0
  14. reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0/config.yaml +139 -0
  15. reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0/eval_data.csv +2 -0
  16. reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/command.txt +1 -0
  17. reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/stderr.txt +35 -0
  18. reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/stdout.txt +167 -0
  19. reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/summary.json +42 -0
  20. reports/dual_push_same_trunk_parity_ep5/base_trunk/rollout_eval.json +29 -0
  21. reports/dual_push_same_trunk_parity_ep5/base_trunk/rollout_eval.md +14 -0
  22. reports/dual_push_same_trunk_parity_ep5/base_trunk/rollout_eval.partial.json +29 -0
  23. reports/maniskill_bag_bridge_eval_less_bonus_seed23/public_benchmark_package_summary.md +15 -0
  24. reports/maniskill_bag_bridge_smoke_v1/adapter_active_ft_seed17.json +509 -0
  25. reports/maniskill_bag_bridge_smoke_v1/adapter_active_ft_seed23.json +509 -0
  26. reports/maniskill_bag_bridge_smoke_v1/adapter_noop_seed17.json +488 -0
  27. reports/maniskill_bag_bridge_smoke_v1/adapter_noop_seed23.json +488 -0
  28. reports/maniskill_bag_bridge_smoke_v1/public_benchmark_package_summary.json +60 -0
  29. reports/maniskill_bag_bridge_smoke_v1/public_benchmark_package_summary.md +15 -0
  30. reports/maniskill_bag_bridge_smoke_v1/trunk_only_ft_seed17.json +509 -0
  31. reports/maniskill_bag_bridge_smoke_v1/trunk_only_ft_seed23.json +509 -0
  32. reports/maniskill_bag_bridge_val_sweep_seed23/conservative/public_benchmark_package_summary.json +60 -0
  33. reports/maniskill_bag_bridge_val_sweep_seed23/conservative/public_benchmark_package_summary.md +15 -0
  34. reports/maniskill_bag_bridge_val_sweep_seed23/conservative/trunk_only_ft_seed23.json +131 -0
  35. reports/maniskill_bag_bridge_val_sweep_seed23/less_bonus/adapter_active_ft_seed23.json +131 -0
  36. reports/maniskill_bag_bridge_val_sweep_seed23/less_bonus/adapter_noop_seed23.json +110 -0
  37. reports/maniskill_bag_bridge_val_sweep_seed23/less_bonus/public_benchmark_package_summary.json +60 -0
  38. reports/maniskill_bag_bridge_val_sweep_seed23/less_bonus/run.log +76 -0
  39. reports/maniskill_bag_bridge_val_sweep_seed23/low_bonus_high_thresh/adapter_active_ft_seed23.json +131 -0
  40. reports/maniskill_bag_bridge_val_sweep_seed23/summary.json +30 -0
  41. reports/proxy_rank_aligned_fast_eval/active/reveal_benchmark.json +0 -0
  42. reports/proxy_rank_aligned_fast_eval/active/reveal_benchmark.md +17 -0
  43. reports/proxy_rank_aligned_fast_eval/noop/reveal_benchmark.json +0 -0
  44. reports/proxy_rank_aligned_fast_eval/noop/reveal_benchmark.md +17 -0
  45. reports/proxy_semantic_nowm_quick12_rerun/reveal_benchmark.json +0 -0
  46. reports/proxy_semantic_nowm_quick12_rerun/reveal_benchmark.md +17 -0
  47. reports/proxy_transition_fast_eval/active/reveal_benchmark.json +0 -0
  48. reports/proxy_transition_fast_eval/active/reveal_benchmark.md +17 -0
  49. reports/proxy_transition_fast_eval/noop/reveal_benchmark.json +0 -0
  50. reports/proxy_transition_fast_eval/noop/reveal_benchmark.md +17 -0
PUBLIC_BENCHMARK_RESULTS.md ADDED
@@ -0,0 +1,213 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Public Benchmark Results
2
+
3
+ All dates below refer to `2026-04-01 UTC`.
4
+
5
+ ## Dense Occluded Retrieval Proxy
6
+
7
+ Benchmark:
8
+
9
+ - ManiSkill `PickClutterYCB-v1`
10
+
11
+ ### Completed runs
12
+
13
+ - `reports/maniskill_pickclutter_smoke/public_benchmark_package_summary.json`
14
+ - `trunk = 0.04`
15
+ - `noop = 0.04`
16
+ - `active = 0.04`
17
+ - `reports/maniskill_pickclutter_smoke_v2/public_benchmark_package_summary.json`
18
+ - `trunk = 0.04`
19
+ - `noop = 0.32`
20
+ - `active = 0.32`
21
+ - not adapter-specific because `active == noop`
22
+ - `reports/maniskill_pickclutter_smoke_v3/public_benchmark_package_summary.json`
23
+ - `trunk = 0.06`
24
+ - `noop = 0.06`
25
+ - `active = 0.06`
26
+ - `reports/maniskill_pickclutter_smoke_v4/public_benchmark_package_summary.json`
27
+ - `trunk = 0.48`
28
+ - `noop = 0.04`
29
+ - `active = 0.04`
30
+ - active intervened but regressed badly
31
+ - `reports/maniskill_pickclutter_smoke_v4_evalprobe_fromv3/public_benchmark_package_summary.json`
32
+ - `trunk = 0.06`
33
+ - `noop = 0.06`
34
+ - `active = 0.62`
35
+ - `delta = +0.56`
36
+ - eval-probe only, not a clean retrain
37
+ - `reports/maniskill_pickclutter_smoke_v5/public_benchmark_package_summary.json`
38
+ - `trunk = 0.04`
39
+ - `noop = 0.04`
40
+ - `active = 0.04`
41
+ - fairness-preserving retrain, but active still failed
42
+ - `reports/maniskill_pickclutter_smoke_v5_val_sweep/summary.json`
43
+ - val-only planner sweep
44
+ - `baseline_corrected = 0.00`
45
+ - `soft_pref = 0.00`
46
+ - `softer_pref = 0.625`
47
+ - `retrieve_open = 0.625`
48
+ - `reports/maniskill_pickclutter_smoke_v5_eval_tuned_softerpref/public_benchmark_package_summary.json`
49
+ - `trunk = 0.04`
50
+ - `noop = 0.04`
51
+ - `active = 0.62`
52
+ - `delta = +0.58`
53
+ - `95% CI = [0.44, 0.72]`
54
+ - `intervention_rate = 1.0`
55
+ - `non_base_selection_rate = 1.0`
56
+
57
+ ### Exact `smoke_v5` eval tuning carried to held-out
58
+
59
+ - `mode_preference_bonus = 0.75`
60
+ - `premature_retrieve_penalty = 0.5`
61
+ - `premature_insert_penalty = 0.25`
62
+ - `premature_maintain_penalty = 1.0`
63
+ - `occlusion_maintain_gap_min_access = 0.30`
64
+ - `occlusion_maintain_gap_min_visibility = 0.20`
65
+ - `retrieve_stage_access_threshold = 0.18`
66
+ - `retrieve_stage_reveal_threshold = 0.18`
67
+ - `retrieve_stage_support_threshold = 0.18`
68
+
69
+ ## Bag Retrieval Proxy
70
+
71
+ Benchmark:
72
+
73
+ - ManiSkill public bridge basket retrieval proxy
74
+
75
+ ### Completed runs
76
+
77
+ - `reports/maniskill_bag_bridge_smoke_v1/trunk_only_ft_seed17.json`
78
+ - `0.32`
79
+ - `reports/maniskill_bag_bridge_smoke_v1/adapter_noop_seed17.json`
80
+ - `0.00`
81
+ - `reports/maniskill_bag_bridge_smoke_v1/adapter_active_ft_seed17.json`
82
+ - `0.48`
83
+
84
+ - `reports/maniskill_bag_bridge_smoke_v1/trunk_only_ft_seed23.json`
85
+ - `0.48`
86
+ - `reports/maniskill_bag_bridge_smoke_v1/adapter_noop_seed23.json`
87
+ - `0.08`
88
+ - `reports/maniskill_bag_bridge_smoke_v1/adapter_active_ft_seed23.json`
89
+ - `0.00`
90
+
91
+ ### Seed-23 validation sweep
92
+
93
+ - `reports/maniskill_bag_bridge_val_sweep_seed23/summary.json`
94
+
95
+ Configs:
96
+
97
+ - `default`
98
+ - `trunk = 0.125`
99
+ - `noop = 0.125`
100
+ - `active = 0.00`
101
+ - `less_bonus`
102
+ - `trunk = 0.125`
103
+ - `noop = 0.125`
104
+ - `active = 0.125`
105
+ - intervention preserved
106
+ - `conservative`
107
+ - `trunk = 0.125`
108
+ - `noop = 0.125`
109
+ - `active = 0.125`
110
+ - intervention effectively disabled
111
+ - `low_bonus_high_thresh`
112
+ - `trunk = 0.125`
113
+ - `noop = 0.125`
114
+ - `active = 0.125`
115
+ - intervention effectively disabled
116
+
117
+ ### Corrected held-out evals
118
+
119
+ - `reports/maniskill_bag_bridge_eval_less_bonus_seed17/public_benchmark_package_summary.json`
120
+ - `trunk = 0.32`
121
+ - `noop = 0.00`
122
+ - `active = 0.48`
123
+ - `delta = +0.16`
124
+ - `reports/maniskill_bag_bridge_eval_less_bonus_seed23/public_benchmark_package_summary.json`
125
+ - `trunk = 0.48`
126
+ - `noop = 0.08`
127
+ - `active = 0.48`
128
+ - `delta = +0.00`
129
+ - `reports/maniskill_bag_bridge_eval_less_bonus_2seed_manual_summary.json`
130
+ - `trunk = 0.40`
131
+ - `noop = 0.04`
132
+ - `active = 0.48`
133
+ - `delta = +0.08`
134
+ - run-bootstrap CI `[0.00, 0.16]`
135
+
136
+ ## Cloth Retrieval Proxy
137
+
138
+ Benchmark:
139
+
140
+ - ManiSkill public bridge cloth retrieval proxy
141
+
142
+ ### Completed held-out seeds
143
+
144
+ - `seed17`
145
+ - `trunk = 0.04`
146
+ - `noop = 0.04`
147
+ - `active = 0.10`
148
+ - `intervention = 0.3369`
149
+ - `non_base = 0.2674`
150
+ - `seed23`
151
+ - `trunk = 0.04`
152
+ - `noop = 0.02`
153
+ - `active = 0.02`
154
+ - `intervention = 0.0`
155
+ - `non_base = 0.0`
156
+ - `seed29`
157
+ - `trunk = 0.04`
158
+ - `noop = 0.04`
159
+ - `active = 0.04`
160
+ - `intervention = 0.0`
161
+ - `non_base = 0.0`
162
+
163
+ 3-seed aggregate:
164
+
165
+ - `trunk = 0.0400`
166
+ - `noop = 0.0333`
167
+ - `active = 0.0533`
168
+ - `delta = +0.0133`
169
+
170
+ ### Seed-23 cloth validation sweep
171
+
172
+ - `reports/maniskill_cloth_bridge_val_sweep_seed23/summary.json`
173
+
174
+ Configs:
175
+
176
+ - `default`
177
+ - `trunk = 0.25`
178
+ - `noop = 0.125`
179
+ - `active = 0.125`
180
+ - `intervention = 0.0`
181
+ - `low_thresh`
182
+ - `active = 0.125`
183
+ - `intervention = 0.2`
184
+ - `non_base = 0.0667`
185
+ - `low_thresh_less_bonus`
186
+ - `active = 0.125`
187
+ - `intervention = 0.2`
188
+ - `non_base = 0.0667`
189
+ - `very_low_thresh_less_bonus`
190
+ - `active = 0.125`
191
+ - `intervention = 1.0`
192
+ - `non_base = 0.5333`
193
+
194
+ Interpretation:
195
+
196
+ - seed23 cloth was not recoverable by eval-side planner tuning alone
197
+
198
+ ## Single-Seed Combined Proxy Suite
199
+
200
+ - `reports/public_proxy_suite_smoke_v1/combined_summary.json`
201
+
202
+ Single-seed summary:
203
+
204
+ - occlusion proxy: `+0.58`
205
+ - bag proxy: `+0.16`
206
+ - cloth proxy: `+0.06`
207
+ - macro delta: `+0.267`
208
+
209
+ This combined single-seed picture is useful historically, but the stronger current read is:
210
+
211
+ - occlusion: strong
212
+ - bag: modestly positive across corrected 2-seed evaluation
213
+ - cloth: weak/inconclusive across 3 seeds
reports/anybimanual_anchor_bridge_live/trunk_only_ep1/command.txt ADDED
@@ -0,0 +1 @@
 
 
1
+ /workspace/envs/rlbench/bin/python /workspace/third_party/AnyBimanual/eval.py method=PERACT_BC framework.logdir=/workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep1/anybimanual_logdir framework.start_seed=0 framework.eval_type=60000 framework.eval_episodes=1 framework.eval_envs=1 framework.gpu=0 rlbench.task_name=perlf_release_dual_push_buttons_smoke1 rlbench.tasks=[dual_push_buttons] rlbench.demo_path=/workspace/baselines/AnyBimanual_subset3_demo_root rlbench.headless=True rlbench.gripper_mode=BimanualDiscrete rlbench.arm_action_mode=BimanualEndEffectorPoseViaPlanning rlbench.action_mode=BimanualMoveArmThenGripper
reports/anybimanual_anchor_bridge_live/trunk_only_ep1/stderr.txt ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ /workspace/third_party/AnyBimanual/eval.py:183: UserWarning:
2
+ The version_base parameter is not specified.
3
+ Please specify a compatability version level, or None.
4
+ Will assume defaults for version 1.1
5
+ @hydra.main(config_name="eval", config_path="conf")
6
+ /workspace/envs/rlbench/lib/python3.10/site-packages/hydra/_internal/defaults_list.py:251: UserWarning: In 'eval': Defaults list is missing `_self_`. See https://hydra.cc/docs/1.2/upgrades/1.0_to_1.1/default_composition_order for more information
7
+ warnings.warn(msg, UserWarning)
8
+ /workspace/envs/rlbench/lib/python3.10/site-packages/hydra/core/default_element.py:124: UserWarning: In 'method/PERACT_BC': Usage of deprecated keyword in package header '# @package _group_'.
9
+ See https://hydra.cc/docs/1.2/upgrades/1.0_to_1.1/changes_to_package_header for more information
10
+ deprecation_warning(
11
+ /workspace/envs/rlbench/lib/python3.10/site-packages/hydra/_internal/hydra.py:119: UserWarning: Future Hydra versions will no longer change working directory at job runtime by default.
12
+ See https://hydra.cc/docs/1.2/upgrades/1.1_to_1.2/changes_to_job_working_dir/ for more information.
13
+ ret = run_job(
14
+ Error executing job with overrides: ['method=PERACT_BC', 'framework.logdir=/workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep1/anybimanual_logdir', 'framework.start_seed=0', 'framework.eval_type=60000', 'framework.eval_episodes=1', 'framework.eval_envs=1', 'framework.gpu=0', 'rlbench.task_name=perlf_release_dual_push_buttons_smoke1', 'rlbench.tasks=[dual_push_buttons]', 'rlbench.demo_path=/workspace/baselines/AnyBimanual_subset3_demo_root', 'rlbench.headless=True', 'rlbench.gripper_mode=BimanualDiscrete', 'rlbench.arm_action_mode=BimanualEndEffectorPoseViaPlanning', 'rlbench.action_mode=BimanualMoveArmThenGripper']
15
+ Traceback (most recent call last):
16
+ File "/workspace/third_party/AnyBimanual/eval.py", line 201, in main
17
+ raise Exception(f"Missing seed{start_seed}/config.yaml. Logdir is {logdir}")
18
+ Exception: Missing seed0/config.yaml. Logdir is /workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep1/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0
19
+
20
+ Set the environment variable HYDRA_FULL_ERROR=1 for a complete stack trace.
reports/anybimanual_anchor_bridge_live/trunk_only_ep1/stdout.txt ADDED
@@ -0,0 +1,109 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [2026-04-01 00:19:40,192][root][INFO] -
2
+ method:
3
+ name: PERACT_BC
4
+ agent_type: leader_follower
5
+ robot_name: bimanual
6
+ image_crop_size: 64
7
+ bounds_offset:
8
+ - 0.15
9
+ voxel_sizes:
10
+ - 100
11
+ include_prev_layer: false
12
+ num_latents: 2048
13
+ latent_dim: 512
14
+ transformer_depth: 6
15
+ transformer_iterations: 1
16
+ cross_heads: 1
17
+ cross_dim_head: 64
18
+ latent_heads: 8
19
+ latent_dim_head: 64
20
+ pos_encoding_with_lang: true
21
+ conv_downsample: true
22
+ lang_fusion_type: seq
23
+ voxel_patch_size: 5
24
+ voxel_patch_stride: 5
25
+ final_dim: 64
26
+ low_dim_size: 4
27
+ input_dropout: 0.1
28
+ attn_dropout: 0.1
29
+ decoder_dropout: 0.0
30
+ lr: 0.0005
31
+ lr_scheduler: false
32
+ num_warmup_steps: 3000
33
+ optimizer: lamb
34
+ lambda_weight_l2: 1.0e-06
35
+ trans_loss_weight: 1.0
36
+ rot_loss_weight: 1.0
37
+ grip_loss_weight: 1.0
38
+ collision_loss_weight: 1.0
39
+ rotation_resolution: 5
40
+ activation: lrelu
41
+ norm: None
42
+ crop_augmentation: true
43
+ transform_augmentation:
44
+ apply_se3: true
45
+ aug_xyz:
46
+ - 0.125
47
+ - 0.125
48
+ - 0.125
49
+ aug_rpy:
50
+ - 0.0
51
+ - 0.0
52
+ - 45.0
53
+ aug_rot_resolution: ${method.rotation_resolution}
54
+ demo_augmentation: true
55
+ demo_augmentation_every_n: 10
56
+ no_skip_connection: false
57
+ no_perceiver: false
58
+ no_language: false
59
+ keypoint_method: heuristic
60
+ rlbench:
61
+ task_name: perlf_release_dual_push_buttons_smoke1
62
+ tasks:
63
+ - dual_push_buttons
64
+ demo_path: /workspace/baselines/AnyBimanual_subset3_demo_root
65
+ episode_length: 25
66
+ cameras:
67
+ - over_shoulder_left
68
+ - over_shoulder_right
69
+ - overhead
70
+ - wrist_right
71
+ - wrist_left
72
+ - front
73
+ camera_resolution:
74
+ - 256
75
+ - 256
76
+ scene_bounds:
77
+ - -0.3
78
+ - -0.5
79
+ - 0.6
80
+ - 0.7
81
+ - 0.5
82
+ - 1.6
83
+ include_lang_goal_in_obs: true
84
+ time_in_state: true
85
+ headless: true
86
+ gripper_mode: BimanualDiscrete
87
+ arm_action_mode: BimanualEndEffectorPoseViaPlanning
88
+ action_mode: BimanualMoveArmThenGripper
89
+ framework:
90
+ tensorboard_logging: true
91
+ csv_logging: true
92
+ gpu: 0
93
+ logdir: /workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep1/anybimanual_logdir
94
+ start_seed: 0
95
+ record_every_n: 5
96
+ eval_envs: 1
97
+ eval_from_eps_number: 0
98
+ eval_episodes: 1
99
+ eval_type: 60000
100
+ eval_save_metrics: true
101
+ cinematic_recorder:
102
+ enabled: false
103
+ camera_resolution:
104
+ - 1280
105
+ - 720
106
+ fps: 30
107
+ rotate_speed: 0.005
108
+ save_path: /tmp/videos/
109
+
reports/anybimanual_anchor_bridge_live/trunk_only_ep1/summary.json ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "adapter_mode": "trunk_only",
3
+ "task_name": "perlf_release_dual_push_buttons_smoke1",
4
+ "tasks": [
5
+ "dual_push_buttons"
6
+ ],
7
+ "task_families": [
8
+ "generic"
9
+ ],
10
+ "passthrough_only": true,
11
+ "passthrough_reason": "generic_task_family",
12
+ "episodes_requested": 1,
13
+ "episode_scores": [],
14
+ "mean_score": 0.0,
15
+ "final_score": null,
16
+ "subprocess_returncode": 1,
17
+ "eval_csv_path": "/workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep1/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0/eval_data.csv",
18
+ "command": [
19
+ "/workspace/envs/rlbench/bin/python",
20
+ "/workspace/third_party/AnyBimanual/eval.py",
21
+ "method=PERACT_BC",
22
+ "framework.logdir=/workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep1/anybimanual_logdir",
23
+ "framework.start_seed=0",
24
+ "framework.eval_type=60000",
25
+ "framework.eval_episodes=1",
26
+ "framework.eval_envs=1",
27
+ "framework.gpu=0",
28
+ "rlbench.task_name=perlf_release_dual_push_buttons_smoke1",
29
+ "rlbench.tasks=[dual_push_buttons]",
30
+ "rlbench.demo_path=/workspace/baselines/AnyBimanual_subset3_demo_root",
31
+ "rlbench.headless=True",
32
+ "rlbench.gripper_mode=BimanualDiscrete",
33
+ "rlbench.arm_action_mode=BimanualEndEffectorPoseViaPlanning",
34
+ "rlbench.action_mode=BimanualMoveArmThenGripper"
35
+ ]
36
+ }
reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0/config.yaml ADDED
@@ -0,0 +1,139 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ddp:
2
+ master_addr: localhost
3
+ master_port: 13985
4
+ num_devices: 2
5
+ rlbench:
6
+ task_name: perlf+ab
7
+ tasks:
8
+ - bimanual_pick_laptop
9
+ - bimanual_pick_plate
10
+ - bimanual_straighten_rope
11
+ - coordinated_lift_ball
12
+ - coordinated_lift_tray
13
+ - coordinated_push_box
14
+ - coordinated_put_bottle_in_fridge
15
+ - dual_push_buttons
16
+ - handover_item
17
+ - bimanual_sweep_to_dustpan
18
+ - coordinated_take_tray_out_of_oven
19
+ - handover_item_easy
20
+ demos: 100
21
+ demo_path: /mnt/disk_1/tengbo/bimanual_data/train
22
+ episode_length: 25
23
+ cameras:
24
+ - over_shoulder_right
25
+ - wrist_right
26
+ - front
27
+ - overhead
28
+ - over_shoulder_left
29
+ - wrist_left
30
+ camera_resolution:
31
+ - 256
32
+ - 256
33
+ scene_bounds:
34
+ - -0.3
35
+ - -0.5
36
+ - 0.6
37
+ - 0.7
38
+ - 0.5
39
+ - 1.6
40
+ include_lang_goal_in_obs: true
41
+ instructions: ''
42
+ replay:
43
+ batch_size: 2
44
+ timesteps: 1
45
+ prioritisation: false
46
+ task_uniform: true
47
+ use_disk: true
48
+ path: /mnt/disk_2/tengbo/replay/
49
+ max_parallel_processes: 32
50
+ task_folder: multi
51
+ framework:
52
+ log_freq: 1000
53
+ save_freq: 10000
54
+ train_envs: 1
55
+ replay_ratio: ${replay.batch_size}
56
+ transitions_before_train: 200
57
+ tensorboard_logging: true
58
+ csv_logging: true
59
+ training_iterations: 100001
60
+ gpu: 0
61
+ env_gpu: 0
62
+ logdir: /mnt/disk_1/tengbo/peract_bimanual/log
63
+ logging_level: 20
64
+ seeds: 1
65
+ start_seed: 0
66
+ load_existing_weights: false
67
+ num_weights_to_keep: 60
68
+ num_workers: 0
69
+ record_every_n: 5
70
+ checkpoint_name_prefix: checkpoint
71
+ use_wandb: true
72
+ wandb_project: ''
73
+ wandb_group: perlf+ab
74
+ seed: 0
75
+ wandb_name: perlf+ab
76
+ use_skill: true
77
+ use_pretrained: false
78
+ use_prefix: false
79
+ frozen: ''
80
+ anybimanual: true
81
+ augmentation_type: ab
82
+ method:
83
+ name: PERACT_BC
84
+ agent_type: leader_follower
85
+ robot_name: bimanual
86
+ image_crop_size: 64
87
+ bounds_offset:
88
+ - 0.15
89
+ voxel_sizes:
90
+ - 100
91
+ include_prev_layer: false
92
+ num_latents: 2048
93
+ latent_dim: 512
94
+ transformer_depth: 6
95
+ transformer_iterations: 1
96
+ cross_heads: 1
97
+ cross_dim_head: 64
98
+ latent_heads: 8
99
+ latent_dim_head: 64
100
+ pos_encoding_with_lang: true
101
+ conv_downsample: true
102
+ lang_fusion_type: seq
103
+ voxel_patch_size: 5
104
+ voxel_patch_stride: 5
105
+ final_dim: 64
106
+ low_dim_size: 4
107
+ input_dropout: 0.1
108
+ attn_dropout: 0.1
109
+ decoder_dropout: 0.0
110
+ lr: 0.0005
111
+ lr_scheduler: false
112
+ num_warmup_steps: 3000
113
+ optimizer: lamb
114
+ lambda_weight_l2: 1.0e-06
115
+ trans_loss_weight: 1.0
116
+ rot_loss_weight: 1.0
117
+ grip_loss_weight: 1.0
118
+ collision_loss_weight: 1.0
119
+ rotation_resolution: 5
120
+ activation: lrelu
121
+ norm: None
122
+ crop_augmentation: true
123
+ transform_augmentation:
124
+ apply_se3: true
125
+ aug_xyz:
126
+ - 0.125
127
+ - 0.125
128
+ - 0.125
129
+ aug_rpy:
130
+ - 0.0
131
+ - 0.0
132
+ - 45.0
133
+ aug_rot_resolution: ${method.rotation_resolution}
134
+ demo_augmentation: true
135
+ demo_augmentation_every_n: 10
136
+ no_skip_connection: false
137
+ no_perceiver: false
138
+ no_language: false
139
+ keypoint_method: heuristic
reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/command.txt ADDED
@@ -0,0 +1 @@
 
 
1
+ /workspace/envs/rlbench/bin/python /workspace/third_party/AnyBimanual/eval.py method=PERACT_BC framework.logdir=/workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/anybimanual_logdir framework.start_seed=0 framework.eval_type=60000 framework.eval_episodes=1 framework.eval_envs=1 framework.gpu=0 rlbench.task_name=perlf_release_dual_push_buttons_smoke1 rlbench.tasks=[dual_push_buttons] rlbench.demo_path=/workspace/baselines/AnyBimanual_subset3_demo_root rlbench.headless=True rlbench.gripper_mode=BimanualDiscrete rlbench.arm_action_mode=BimanualEndEffectorPoseViaPlanning rlbench.action_mode=BimanualMoveArmThenGripper
reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/stderr.txt ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ /workspace/third_party/AnyBimanual/eval.py:183: UserWarning:
2
+ The version_base parameter is not specified.
3
+ Please specify a compatability version level, or None.
4
+ Will assume defaults for version 1.1
5
+ @hydra.main(config_name="eval", config_path="conf")
6
+ /workspace/envs/rlbench/lib/python3.10/site-packages/hydra/_internal/defaults_list.py:251: UserWarning: In 'eval': Defaults list is missing `_self_`. See https://hydra.cc/docs/1.2/upgrades/1.0_to_1.1/default_composition_order for more information
7
+ warnings.warn(msg, UserWarning)
8
+ /workspace/envs/rlbench/lib/python3.10/site-packages/hydra/core/default_element.py:124: UserWarning: In 'method/PERACT_BC': Usage of deprecated keyword in package header '# @package _group_'.
9
+ See https://hydra.cc/docs/1.2/upgrades/1.0_to_1.1/changes_to_package_header for more information
10
+ deprecation_warning(
11
+ /workspace/envs/rlbench/lib/python3.10/site-packages/hydra/_internal/hydra.py:119: UserWarning: Future Hydra versions will no longer change working directory at job runtime by default.
12
+ See https://hydra.cc/docs/1.2/upgrades/1.1_to_1.2/changes_to_job_working_dir/ for more information.
13
+ ret = run_job(
14
+ /workspace/envs/rlbench/lib/python3.10/site-packages/wandb/apis/public.py:3109: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
15
+ from pkg_resources import parse_version
16
+ /workspace/third_party/AnyBimanual/eval.py:183: UserWarning:
17
+ The version_base parameter is not specified.
18
+ Please specify a compatability version level, or None.
19
+ Will assume defaults for version 1.1
20
+ @hydra.main(config_name="eval", config_path="conf")
21
+ /workspace/third_party/AnyBimanual/eval.py:183: UserWarning:
22
+ The version_base parameter is not specified.
23
+ Please specify a compatability version level, or None.
24
+ Will assume defaults for version 1.1
25
+ @hydra.main(config_name="eval", config_path="conf")
26
+ /workspace/envs/rlbench/lib/python3.10/site-packages/wandb/apis/public.py:3109: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
27
+ from pkg_resources import parse_version
28
+ /workspace/third_party/AnyBimanual/eval.py:183: UserWarning:
29
+ The version_base parameter is not specified.
30
+ Please specify a compatability version level, or None.
31
+ Will assume defaults for version 1.1
32
+ @hydra.main(config_name="eval", config_path="conf")
33
+ /workspace/third_party/YARR/yarr/utils/rollout_generator.py:73: UserWarning: Creating a tensor from a list of numpy.ndarrays is extremely slow. Please consider converting the list to a single numpy.ndarray with numpy.array() before converting to a tensor. (Triggered internally at /opt/conda/conda-bld/pytorch_1716905979055/work/torch/csrc/utils/tensor_new.cpp:274.)
34
+ prepped_data = {k: torch.tensor([v], device=self._env_device) for k, v in obs_history.items()}
35
+ [W CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors]
reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/stdout.txt ADDED
@@ -0,0 +1,142 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [2026-04-01 00:20:32,954][root][INFO] -
2
+ method:
3
+ name: PERACT_BC
4
+ agent_type: leader_follower
5
+ robot_name: bimanual
6
+ image_crop_size: 64
7
+ bounds_offset:
8
+ - 0.15
9
+ voxel_sizes:
10
+ - 100
11
+ include_prev_layer: false
12
+ num_latents: 2048
13
+ latent_dim: 512
14
+ transformer_depth: 6
15
+ transformer_iterations: 1
16
+ cross_heads: 1
17
+ cross_dim_head: 64
18
+ latent_heads: 8
19
+ latent_dim_head: 64
20
+ pos_encoding_with_lang: true
21
+ conv_downsample: true
22
+ lang_fusion_type: seq
23
+ voxel_patch_size: 5
24
+ voxel_patch_stride: 5
25
+ final_dim: 64
26
+ low_dim_size: 4
27
+ input_dropout: 0.1
28
+ attn_dropout: 0.1
29
+ decoder_dropout: 0.0
30
+ lr: 0.0005
31
+ lr_scheduler: false
32
+ num_warmup_steps: 3000
33
+ optimizer: lamb
34
+ lambda_weight_l2: 1.0e-06
35
+ trans_loss_weight: 1.0
36
+ rot_loss_weight: 1.0
37
+ grip_loss_weight: 1.0
38
+ collision_loss_weight: 1.0
39
+ rotation_resolution: 5
40
+ activation: lrelu
41
+ norm: None
42
+ crop_augmentation: true
43
+ transform_augmentation:
44
+ apply_se3: true
45
+ aug_xyz:
46
+ - 0.125
47
+ - 0.125
48
+ - 0.125
49
+ aug_rpy:
50
+ - 0.0
51
+ - 0.0
52
+ - 45.0
53
+ aug_rot_resolution: ${method.rotation_resolution}
54
+ demo_augmentation: true
55
+ demo_augmentation_every_n: 10
56
+ no_skip_connection: false
57
+ no_perceiver: false
58
+ no_language: false
59
+ keypoint_method: heuristic
60
+ rlbench:
61
+ task_name: perlf_release_dual_push_buttons_smoke1
62
+ tasks:
63
+ - dual_push_buttons
64
+ demo_path: /workspace/baselines/AnyBimanual_subset3_demo_root
65
+ episode_length: 25
66
+ cameras:
67
+ - over_shoulder_left
68
+ - over_shoulder_right
69
+ - overhead
70
+ - wrist_right
71
+ - wrist_left
72
+ - front
73
+ camera_resolution:
74
+ - 256
75
+ - 256
76
+ scene_bounds:
77
+ - -0.3
78
+ - -0.5
79
+ - 0.6
80
+ - 0.7
81
+ - 0.5
82
+ - 1.6
83
+ include_lang_goal_in_obs: true
84
+ time_in_state: true
85
+ headless: true
86
+ gripper_mode: BimanualDiscrete
87
+ arm_action_mode: BimanualEndEffectorPoseViaPlanning
88
+ action_mode: BimanualMoveArmThenGripper
89
+ framework:
90
+ tensorboard_logging: true
91
+ csv_logging: true
92
+ gpu: 0
93
+ logdir: /workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/anybimanual_logdir
94
+ start_seed: 0
95
+ record_every_n: 5
96
+ eval_envs: 1
97
+ eval_from_eps_number: 0
98
+ eval_episodes: 1
99
+ eval_type: 60000
100
+ eval_save_metrics: true
101
+ cinematic_recorder:
102
+ enabled: false
103
+ camera_resolution:
104
+ - 1280
105
+ - 720
106
+ fps: 30
107
+ rotate_speed: 0.005
108
+ save_path: /tmp/videos/
109
+
110
+ [2026-04-01 00:20:32,964][root][INFO] - Using env device cuda:0.
111
+ [2026-04-01 00:20:32,965][root][INFO] - Evaluating seed 0.
112
+ [2026-04-01 00:20:32,965][root][INFO] - Using method PERACT_BC with type leader_follower
113
+ Weight: [60000]
114
+ [04/01/26 00:20:52] INFO INFO:root:eval_env: _independent_env_runner.py:131
115
+ Launching env.
116
+ INFO INFO:root:Agent _independent_env_runner.py:134
117
+ information:
118
+ INFO INFO:root:<yarr.agen _independent_env_runner.py:135
119
+ ts.agent.LeaderFollo
120
+ werAgent object at
121
+ 0x7fb50f2eb9d0>
122
+ [04/01/26 00:20:53] INFO INFO:root:Using dual panda robot environment.py:117
123
+ INFO INFO:root:Setting control arm_action_modes.py:79
124
+ mode for both robots
125
+ WARNING WARNING:root:not sure how task_environment.py:57
126
+ _robot_shapes are used is
127
+ used.
128
+ INFO INFO:root:Evaluating _independent_env_runner.py:165
129
+ weight 60000
130
+ loaded weights from /workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0/weights/60000/checkpoint_peract_bc_leader_layer_0.pt
131
+ loaded weights from /workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0/weights/60000/checkpoint_peract_bc_follower_layer_0.pt
132
+ INFO INFO:root:eval_env: _independent_env_runner.py:192
133
+ Starting episode 0,
134
+ seed 0.
135
+ [04/01/26 00:21:22] INFO INFO:root:total waypoints 4, (right=2, task.py:368
136
+ left=2)
137
+ Evaluating bimanual_dual_push_buttons | Episode 0 | Score: 0.0 | Lang Goal: push the rose and the maroon buttons
138
+ Finished bimanual_dual_push_buttons | Final Score: unknown
139
+
140
+ [04/01/26 00:21:47] INFO INFO:root:Finished _independent_env_runner.py:293
141
+ evaluation.
142
+ [CoppeliaSim:loadinfo] done.
reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/summary.json ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "adapter_mode": "trunk_only",
3
+ "task_name": "perlf_release_dual_push_buttons_smoke1",
4
+ "tasks": [
5
+ "dual_push_buttons"
6
+ ],
7
+ "task_families": [
8
+ "generic"
9
+ ],
10
+ "passthrough_only": true,
11
+ "passthrough_reason": "generic_task_family",
12
+ "episodes_requested": 1,
13
+ "episode_scores": [
14
+ 0.0
15
+ ],
16
+ "mean_score": 0.0,
17
+ "final_score": null,
18
+ "subprocess_returncode": 0,
19
+ "eval_csv_path": "/workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0/eval_data.csv",
20
+ "command": [
21
+ "/workspace/envs/rlbench/bin/python",
22
+ "/workspace/third_party/AnyBimanual/eval.py",
23
+ "method=PERACT_BC",
24
+ "framework.logdir=/workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep1_retry/anybimanual_logdir",
25
+ "framework.start_seed=0",
26
+ "framework.eval_type=60000",
27
+ "framework.eval_episodes=1",
28
+ "framework.eval_envs=1",
29
+ "framework.gpu=0",
30
+ "rlbench.task_name=perlf_release_dual_push_buttons_smoke1",
31
+ "rlbench.tasks=[dual_push_buttons]",
32
+ "rlbench.demo_path=/workspace/baselines/AnyBimanual_subset3_demo_root",
33
+ "rlbench.headless=True",
34
+ "rlbench.gripper_mode=BimanualDiscrete",
35
+ "rlbench.arm_action_mode=BimanualEndEffectorPoseViaPlanning",
36
+ "rlbench.action_mode=BimanualMoveArmThenGripper"
37
+ ]
38
+ }
reports/anybimanual_anchor_bridge_live/trunk_only_ep5/command.txt ADDED
@@ -0,0 +1 @@
 
 
1
+ /workspace/envs/rlbench/bin/python /workspace/third_party/AnyBimanual/eval.py method=PERACT_BC framework.logdir=/workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep5/anybimanual_logdir framework.start_seed=0 framework.eval_type=60000 framework.eval_episodes=5 framework.eval_envs=1 framework.gpu=0 rlbench.task_name=perlf_release_dual_push_buttons_smoke5 rlbench.tasks=[dual_push_buttons] rlbench.demo_path=/workspace/baselines/AnyBimanual_subset3_demo_root rlbench.headless=True rlbench.gripper_mode=BimanualDiscrete rlbench.arm_action_mode=BimanualEndEffectorPoseViaPlanning rlbench.action_mode=BimanualMoveArmThenGripper
reports/anybimanual_anchor_bridge_live/trunk_only_ep5/stdout.txt ADDED
@@ -0,0 +1,109 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [2026-04-01 00:23:58,169][root][INFO] -
2
+ method:
3
+ name: PERACT_BC
4
+ agent_type: leader_follower
5
+ robot_name: bimanual
6
+ image_crop_size: 64
7
+ bounds_offset:
8
+ - 0.15
9
+ voxel_sizes:
10
+ - 100
11
+ include_prev_layer: false
12
+ num_latents: 2048
13
+ latent_dim: 512
14
+ transformer_depth: 6
15
+ transformer_iterations: 1
16
+ cross_heads: 1
17
+ cross_dim_head: 64
18
+ latent_heads: 8
19
+ latent_dim_head: 64
20
+ pos_encoding_with_lang: true
21
+ conv_downsample: true
22
+ lang_fusion_type: seq
23
+ voxel_patch_size: 5
24
+ voxel_patch_stride: 5
25
+ final_dim: 64
26
+ low_dim_size: 4
27
+ input_dropout: 0.1
28
+ attn_dropout: 0.1
29
+ decoder_dropout: 0.0
30
+ lr: 0.0005
31
+ lr_scheduler: false
32
+ num_warmup_steps: 3000
33
+ optimizer: lamb
34
+ lambda_weight_l2: 1.0e-06
35
+ trans_loss_weight: 1.0
36
+ rot_loss_weight: 1.0
37
+ grip_loss_weight: 1.0
38
+ collision_loss_weight: 1.0
39
+ rotation_resolution: 5
40
+ activation: lrelu
41
+ norm: None
42
+ crop_augmentation: true
43
+ transform_augmentation:
44
+ apply_se3: true
45
+ aug_xyz:
46
+ - 0.125
47
+ - 0.125
48
+ - 0.125
49
+ aug_rpy:
50
+ - 0.0
51
+ - 0.0
52
+ - 45.0
53
+ aug_rot_resolution: ${method.rotation_resolution}
54
+ demo_augmentation: true
55
+ demo_augmentation_every_n: 10
56
+ no_skip_connection: false
57
+ no_perceiver: false
58
+ no_language: false
59
+ keypoint_method: heuristic
60
+ rlbench:
61
+ task_name: perlf_release_dual_push_buttons_smoke5
62
+ tasks:
63
+ - dual_push_buttons
64
+ demo_path: /workspace/baselines/AnyBimanual_subset3_demo_root
65
+ episode_length: 25
66
+ cameras:
67
+ - over_shoulder_left
68
+ - over_shoulder_right
69
+ - overhead
70
+ - wrist_right
71
+ - wrist_left
72
+ - front
73
+ camera_resolution:
74
+ - 256
75
+ - 256
76
+ scene_bounds:
77
+ - -0.3
78
+ - -0.5
79
+ - 0.6
80
+ - 0.7
81
+ - 0.5
82
+ - 1.6
83
+ include_lang_goal_in_obs: true
84
+ time_in_state: true
85
+ headless: true
86
+ gripper_mode: BimanualDiscrete
87
+ arm_action_mode: BimanualEndEffectorPoseViaPlanning
88
+ action_mode: BimanualMoveArmThenGripper
89
+ framework:
90
+ tensorboard_logging: true
91
+ csv_logging: true
92
+ gpu: 0
93
+ logdir: /workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep5/anybimanual_logdir
94
+ start_seed: 0
95
+ record_every_n: 5
96
+ eval_envs: 1
97
+ eval_from_eps_number: 0
98
+ eval_episodes: 5
99
+ eval_type: 60000
100
+ eval_save_metrics: true
101
+ cinematic_recorder:
102
+ enabled: false
103
+ camera_resolution:
104
+ - 1280
105
+ - 720
106
+ fps: 30
107
+ rotate_speed: 0.005
108
+ save_path: /tmp/videos/
109
+
reports/anybimanual_anchor_bridge_live/trunk_only_ep5/summary.json ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "adapter_mode": "trunk_only",
3
+ "task_name": "perlf_release_dual_push_buttons_smoke5",
4
+ "tasks": [
5
+ "dual_push_buttons"
6
+ ],
7
+ "task_families": [
8
+ "generic"
9
+ ],
10
+ "passthrough_only": true,
11
+ "passthrough_reason": "generic_task_family",
12
+ "episodes_requested": 5,
13
+ "episode_scores": [],
14
+ "mean_score": 0.0,
15
+ "final_score": null,
16
+ "subprocess_returncode": 1,
17
+ "eval_csv_path": "/workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep5/anybimanual_logdir/perlf_release_dual_push_buttons_smoke5/PERACT_BC/seed0/eval_data.csv",
18
+ "command": [
19
+ "/workspace/envs/rlbench/bin/python",
20
+ "/workspace/third_party/AnyBimanual/eval.py",
21
+ "method=PERACT_BC",
22
+ "framework.logdir=/workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep5/anybimanual_logdir",
23
+ "framework.start_seed=0",
24
+ "framework.eval_type=60000",
25
+ "framework.eval_episodes=5",
26
+ "framework.eval_envs=1",
27
+ "framework.gpu=0",
28
+ "rlbench.task_name=perlf_release_dual_push_buttons_smoke5",
29
+ "rlbench.tasks=[dual_push_buttons]",
30
+ "rlbench.demo_path=/workspace/baselines/AnyBimanual_subset3_demo_root",
31
+ "rlbench.headless=True",
32
+ "rlbench.gripper_mode=BimanualDiscrete",
33
+ "rlbench.arm_action_mode=BimanualEndEffectorPoseViaPlanning",
34
+ "rlbench.action_mode=BimanualMoveArmThenGripper"
35
+ ]
36
+ }
reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0/config.yaml ADDED
@@ -0,0 +1,139 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ddp:
2
+ master_addr: localhost
3
+ master_port: 13985
4
+ num_devices: 2
5
+ rlbench:
6
+ task_name: perlf+ab
7
+ tasks:
8
+ - bimanual_pick_laptop
9
+ - bimanual_pick_plate
10
+ - bimanual_straighten_rope
11
+ - coordinated_lift_ball
12
+ - coordinated_lift_tray
13
+ - coordinated_push_box
14
+ - coordinated_put_bottle_in_fridge
15
+ - dual_push_buttons
16
+ - handover_item
17
+ - bimanual_sweep_to_dustpan
18
+ - coordinated_take_tray_out_of_oven
19
+ - handover_item_easy
20
+ demos: 100
21
+ demo_path: /mnt/disk_1/tengbo/bimanual_data/train
22
+ episode_length: 25
23
+ cameras:
24
+ - over_shoulder_right
25
+ - wrist_right
26
+ - front
27
+ - overhead
28
+ - over_shoulder_left
29
+ - wrist_left
30
+ camera_resolution:
31
+ - 256
32
+ - 256
33
+ scene_bounds:
34
+ - -0.3
35
+ - -0.5
36
+ - 0.6
37
+ - 0.7
38
+ - 0.5
39
+ - 1.6
40
+ include_lang_goal_in_obs: true
41
+ instructions: ''
42
+ replay:
43
+ batch_size: 2
44
+ timesteps: 1
45
+ prioritisation: false
46
+ task_uniform: true
47
+ use_disk: true
48
+ path: /mnt/disk_2/tengbo/replay/
49
+ max_parallel_processes: 32
50
+ task_folder: multi
51
+ framework:
52
+ log_freq: 1000
53
+ save_freq: 10000
54
+ train_envs: 1
55
+ replay_ratio: ${replay.batch_size}
56
+ transitions_before_train: 200
57
+ tensorboard_logging: true
58
+ csv_logging: true
59
+ training_iterations: 100001
60
+ gpu: 0
61
+ env_gpu: 0
62
+ logdir: /mnt/disk_1/tengbo/peract_bimanual/log
63
+ logging_level: 20
64
+ seeds: 1
65
+ start_seed: 0
66
+ load_existing_weights: false
67
+ num_weights_to_keep: 60
68
+ num_workers: 0
69
+ record_every_n: 5
70
+ checkpoint_name_prefix: checkpoint
71
+ use_wandb: true
72
+ wandb_project: ''
73
+ wandb_group: perlf+ab
74
+ seed: 0
75
+ wandb_name: perlf+ab
76
+ use_skill: true
77
+ use_pretrained: false
78
+ use_prefix: false
79
+ frozen: ''
80
+ anybimanual: true
81
+ augmentation_type: ab
82
+ method:
83
+ name: PERACT_BC
84
+ agent_type: leader_follower
85
+ robot_name: bimanual
86
+ image_crop_size: 64
87
+ bounds_offset:
88
+ - 0.15
89
+ voxel_sizes:
90
+ - 100
91
+ include_prev_layer: false
92
+ num_latents: 2048
93
+ latent_dim: 512
94
+ transformer_depth: 6
95
+ transformer_iterations: 1
96
+ cross_heads: 1
97
+ cross_dim_head: 64
98
+ latent_heads: 8
99
+ latent_dim_head: 64
100
+ pos_encoding_with_lang: true
101
+ conv_downsample: true
102
+ lang_fusion_type: seq
103
+ voxel_patch_size: 5
104
+ voxel_patch_stride: 5
105
+ final_dim: 64
106
+ low_dim_size: 4
107
+ input_dropout: 0.1
108
+ attn_dropout: 0.1
109
+ decoder_dropout: 0.0
110
+ lr: 0.0005
111
+ lr_scheduler: false
112
+ num_warmup_steps: 3000
113
+ optimizer: lamb
114
+ lambda_weight_l2: 1.0e-06
115
+ trans_loss_weight: 1.0
116
+ rot_loss_weight: 1.0
117
+ grip_loss_weight: 1.0
118
+ collision_loss_weight: 1.0
119
+ rotation_resolution: 5
120
+ activation: lrelu
121
+ norm: None
122
+ crop_augmentation: true
123
+ transform_augmentation:
124
+ apply_se3: true
125
+ aug_xyz:
126
+ - 0.125
127
+ - 0.125
128
+ - 0.125
129
+ aug_rpy:
130
+ - 0.0
131
+ - 0.0
132
+ - 45.0
133
+ aug_rot_resolution: ${method.rotation_resolution}
134
+ demo_augmentation: true
135
+ demo_augmentation_every_n: 10
136
+ no_skip_connection: false
137
+ no_perceiver: false
138
+ no_language: false
139
+ keypoint_method: heuristic
reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0/eval_data.csv ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ step,eval_envs/return,eval_envs/length,eval_envs/total_transitions
2
+ 60000,40.0,17.6,88
reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/command.txt ADDED
@@ -0,0 +1 @@
 
 
1
+ /workspace/envs/rlbench/bin/python /workspace/third_party/AnyBimanual/eval.py method=PERACT_BC framework.logdir=/workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/anybimanual_logdir framework.start_seed=0 framework.eval_type=60000 framework.eval_episodes=5 framework.eval_envs=1 framework.gpu=0 rlbench.task_name=perlf_release_dual_push_buttons_smoke1 rlbench.tasks=[dual_push_buttons] rlbench.demo_path=/workspace/baselines/AnyBimanual_subset3_demo_root rlbench.headless=True rlbench.gripper_mode=BimanualDiscrete rlbench.arm_action_mode=BimanualEndEffectorPoseViaPlanning rlbench.action_mode=BimanualMoveArmThenGripper
reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/stderr.txt ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ /workspace/third_party/AnyBimanual/eval.py:183: UserWarning:
2
+ The version_base parameter is not specified.
3
+ Please specify a compatability version level, or None.
4
+ Will assume defaults for version 1.1
5
+ @hydra.main(config_name="eval", config_path="conf")
6
+ /workspace/envs/rlbench/lib/python3.10/site-packages/hydra/_internal/defaults_list.py:251: UserWarning: In 'eval': Defaults list is missing `_self_`. See https://hydra.cc/docs/1.2/upgrades/1.0_to_1.1/default_composition_order for more information
7
+ warnings.warn(msg, UserWarning)
8
+ /workspace/envs/rlbench/lib/python3.10/site-packages/hydra/core/default_element.py:124: UserWarning: In 'method/PERACT_BC': Usage of deprecated keyword in package header '# @package _group_'.
9
+ See https://hydra.cc/docs/1.2/upgrades/1.0_to_1.1/changes_to_package_header for more information
10
+ deprecation_warning(
11
+ /workspace/envs/rlbench/lib/python3.10/site-packages/hydra/_internal/hydra.py:119: UserWarning: Future Hydra versions will no longer change working directory at job runtime by default.
12
+ See https://hydra.cc/docs/1.2/upgrades/1.1_to_1.2/changes_to_job_working_dir/ for more information.
13
+ ret = run_job(
14
+ /workspace/envs/rlbench/lib/python3.10/site-packages/wandb/apis/public.py:3109: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
15
+ from pkg_resources import parse_version
16
+ /workspace/third_party/AnyBimanual/eval.py:183: UserWarning:
17
+ The version_base parameter is not specified.
18
+ Please specify a compatability version level, or None.
19
+ Will assume defaults for version 1.1
20
+ @hydra.main(config_name="eval", config_path="conf")
21
+ /workspace/third_party/AnyBimanual/eval.py:183: UserWarning:
22
+ The version_base parameter is not specified.
23
+ Please specify a compatability version level, or None.
24
+ Will assume defaults for version 1.1
25
+ @hydra.main(config_name="eval", config_path="conf")
26
+ /workspace/envs/rlbench/lib/python3.10/site-packages/wandb/apis/public.py:3109: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
27
+ from pkg_resources import parse_version
28
+ /workspace/third_party/AnyBimanual/eval.py:183: UserWarning:
29
+ The version_base parameter is not specified.
30
+ Please specify a compatability version level, or None.
31
+ Will assume defaults for version 1.1
32
+ @hydra.main(config_name="eval", config_path="conf")
33
+ /workspace/third_party/YARR/yarr/utils/rollout_generator.py:73: UserWarning: Creating a tensor from a list of numpy.ndarrays is extremely slow. Please consider converting the list to a single numpy.ndarray with numpy.array() before converting to a tensor. (Triggered internally at /opt/conda/conda-bld/pytorch_1716905979055/work/torch/csrc/utils/tensor_new.cpp:274.)
34
+ prepped_data = {k: torch.tensor([v], device=self._env_device) for k, v in obs_history.items()}
35
+ [W CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors]
reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/stdout.txt ADDED
@@ -0,0 +1,167 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [2026-04-01 00:39:26,439][root][INFO] -
2
+ method:
3
+ name: PERACT_BC
4
+ agent_type: leader_follower
5
+ robot_name: bimanual
6
+ image_crop_size: 64
7
+ bounds_offset:
8
+ - 0.15
9
+ voxel_sizes:
10
+ - 100
11
+ include_prev_layer: false
12
+ num_latents: 2048
13
+ latent_dim: 512
14
+ transformer_depth: 6
15
+ transformer_iterations: 1
16
+ cross_heads: 1
17
+ cross_dim_head: 64
18
+ latent_heads: 8
19
+ latent_dim_head: 64
20
+ pos_encoding_with_lang: true
21
+ conv_downsample: true
22
+ lang_fusion_type: seq
23
+ voxel_patch_size: 5
24
+ voxel_patch_stride: 5
25
+ final_dim: 64
26
+ low_dim_size: 4
27
+ input_dropout: 0.1
28
+ attn_dropout: 0.1
29
+ decoder_dropout: 0.0
30
+ lr: 0.0005
31
+ lr_scheduler: false
32
+ num_warmup_steps: 3000
33
+ optimizer: lamb
34
+ lambda_weight_l2: 1.0e-06
35
+ trans_loss_weight: 1.0
36
+ rot_loss_weight: 1.0
37
+ grip_loss_weight: 1.0
38
+ collision_loss_weight: 1.0
39
+ rotation_resolution: 5
40
+ activation: lrelu
41
+ norm: None
42
+ crop_augmentation: true
43
+ transform_augmentation:
44
+ apply_se3: true
45
+ aug_xyz:
46
+ - 0.125
47
+ - 0.125
48
+ - 0.125
49
+ aug_rpy:
50
+ - 0.0
51
+ - 0.0
52
+ - 45.0
53
+ aug_rot_resolution: ${method.rotation_resolution}
54
+ demo_augmentation: true
55
+ demo_augmentation_every_n: 10
56
+ no_skip_connection: false
57
+ no_perceiver: false
58
+ no_language: false
59
+ keypoint_method: heuristic
60
+ rlbench:
61
+ task_name: perlf_release_dual_push_buttons_smoke1
62
+ tasks:
63
+ - dual_push_buttons
64
+ demo_path: /workspace/baselines/AnyBimanual_subset3_demo_root
65
+ episode_length: 25
66
+ cameras:
67
+ - over_shoulder_left
68
+ - over_shoulder_right
69
+ - overhead
70
+ - wrist_right
71
+ - wrist_left
72
+ - front
73
+ camera_resolution:
74
+ - 256
75
+ - 256
76
+ scene_bounds:
77
+ - -0.3
78
+ - -0.5
79
+ - 0.6
80
+ - 0.7
81
+ - 0.5
82
+ - 1.6
83
+ include_lang_goal_in_obs: true
84
+ time_in_state: true
85
+ headless: true
86
+ gripper_mode: BimanualDiscrete
87
+ arm_action_mode: BimanualEndEffectorPoseViaPlanning
88
+ action_mode: BimanualMoveArmThenGripper
89
+ framework:
90
+ tensorboard_logging: true
91
+ csv_logging: true
92
+ gpu: 0
93
+ logdir: /workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/anybimanual_logdir
94
+ start_seed: 0
95
+ record_every_n: 5
96
+ eval_envs: 1
97
+ eval_from_eps_number: 0
98
+ eval_episodes: 5
99
+ eval_type: 60000
100
+ eval_save_metrics: true
101
+ cinematic_recorder:
102
+ enabled: false
103
+ camera_resolution:
104
+ - 1280
105
+ - 720
106
+ fps: 30
107
+ rotate_speed: 0.005
108
+ save_path: /tmp/videos/
109
+
110
+ [2026-04-01 00:39:26,448][root][INFO] - Using env device cuda:0.
111
+ [2026-04-01 00:39:26,450][root][INFO] - Evaluating seed 0.
112
+ [2026-04-01 00:39:26,450][root][INFO] - Using method PERACT_BC with type leader_follower
113
+ Weight: [60000]
114
+ [04/01/26 00:39:48] INFO INFO:root:eval_env: _independent_env_runner.py:131
115
+ Launching env.
116
+ INFO INFO:root:Agent _independent_env_runner.py:134
117
+ information:
118
+ INFO INFO:root:<yarr.agen _independent_env_runner.py:135
119
+ ts.agent.LeaderFollo
120
+ werAgent object at
121
+ 0x794220da7970>
122
+ [04/01/26 00:39:49] INFO INFO:root:Using dual panda robot environment.py:117
123
+ INFO INFO:root:Setting control arm_action_modes.py:79
124
+ mode for both robots
125
+ WARNING WARNING:root:not sure how task_environment.py:57
126
+ _robot_shapes are used is
127
+ used.
128
+ INFO INFO:root:Evaluating _independent_env_runner.py:165
129
+ weight 60000
130
+ loaded weights from /workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0/weights/60000/checkpoint_peract_bc_leader_layer_0.pt
131
+ loaded weights from /workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0/weights/60000/checkpoint_peract_bc_follower_layer_0.pt
132
+ INFO INFO:root:eval_env: _independent_env_runner.py:192
133
+ Starting episode 0,
134
+ seed 0.
135
+ [04/01/26 00:40:19] INFO INFO:root:total waypoints 4, (right=2, task.py:368
136
+ left=2)
137
+ Evaluating bimanual_dual_push_buttons | Episode 0 | Score: 0.0 | Lang Goal: push the rose and the maroon buttons
138
+ [04/01/26 00:40:44] INFO INFO:root:eval_env: _independent_env_runner.py:192
139
+ Starting episode 1,
140
+ seed 1.
141
+ [04/01/26 00:41:21] INFO INFO:root:total waypoints 4, (right=2, task.py:368
142
+ left=2)
143
+ Evaluating bimanual_dual_push_buttons | Episode 1 | Score: 100.0 | Lang Goal: push the black and the violet buttons
144
+ [04/01/26 00:41:29] INFO INFO:root:eval_env: _independent_env_runner.py:192
145
+ Starting episode 2,
146
+ seed 2.
147
+ [04/01/26 00:42:02] INFO INFO:root:total waypoints 4, (right=2, task.py:368
148
+ left=2)
149
+ Evaluating bimanual_dual_push_buttons | Episode 2 | Score: 100.0 | Lang Goal: push the violet and the blue buttons
150
+ [04/01/26 00:42:10] INFO INFO:root:eval_env: _independent_env_runner.py:192
151
+ Starting episode 3,
152
+ seed 3.
153
+ [04/01/26 00:42:41] INFO INFO:root:total waypoints 4, (right=2, task.py:368
154
+ left=2)
155
+ Evaluating bimanual_dual_push_buttons | Episode 3 | Score: 0.0 | Lang Goal: push the green and the violet buttons
156
+ [04/01/26 00:43:02] INFO INFO:root:eval_env: _independent_env_runner.py:192
157
+ Starting episode 4,
158
+ seed 4.
159
+ [04/01/26 00:43:40] INFO INFO:root:total waypoints 4, (right=2, task.py:368
160
+ left=2)
161
+ Evaluating bimanual_dual_push_buttons | Episode 4 | Score: 0.0 | Lang Goal: push the olive and the violet buttons
162
+ Finished bimanual_dual_push_buttons | Final Score: 40.0
163
+
164
+ moviepy is installed, but can't import moviepy.editor. Some packages could be missing [imageio, requests]
165
+ [04/01/26 00:44:10] INFO INFO:root:Finished _independent_env_runner.py:293
166
+ evaluation.
167
+ [CoppeliaSim:loadinfo] done.
reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/summary.json ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "adapter_mode": "trunk_only",
3
+ "task_name": "perlf_release_dual_push_buttons_smoke1",
4
+ "tasks": [
5
+ "dual_push_buttons"
6
+ ],
7
+ "task_families": [
8
+ "generic"
9
+ ],
10
+ "passthrough_only": true,
11
+ "passthrough_reason": "generic_task_family",
12
+ "episodes_requested": 5,
13
+ "episode_scores": [
14
+ 0.0,
15
+ 100.0,
16
+ 100.0,
17
+ 0.0,
18
+ 0.0
19
+ ],
20
+ "mean_score": 40.0,
21
+ "final_score": 40.0,
22
+ "subprocess_returncode": 0,
23
+ "eval_csv_path": "/workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/anybimanual_logdir/perlf_release_dual_push_buttons_smoke1/PERACT_BC/seed0/eval_data.csv",
24
+ "command": [
25
+ "/workspace/envs/rlbench/bin/python",
26
+ "/workspace/third_party/AnyBimanual/eval.py",
27
+ "method=PERACT_BC",
28
+ "framework.logdir=/workspace/workspace/reports/anybimanual_anchor_bridge_live/trunk_only_ep5_retry/anybimanual_logdir",
29
+ "framework.start_seed=0",
30
+ "framework.eval_type=60000",
31
+ "framework.eval_episodes=5",
32
+ "framework.eval_envs=1",
33
+ "framework.gpu=0",
34
+ "rlbench.task_name=perlf_release_dual_push_buttons_smoke1",
35
+ "rlbench.tasks=[dual_push_buttons]",
36
+ "rlbench.demo_path=/workspace/baselines/AnyBimanual_subset3_demo_root",
37
+ "rlbench.headless=True",
38
+ "rlbench.gripper_mode=BimanualDiscrete",
39
+ "rlbench.arm_action_mode=BimanualEndEffectorPoseViaPlanning",
40
+ "rlbench.action_mode=BimanualMoveArmThenGripper"
41
+ ]
42
+ }
reports/dual_push_same_trunk_parity_ep5/base_trunk/rollout_eval.json ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "checkpoint": "/workspace/workspace/VLAarchtests2/outputs/rlbench_dual_push/rlbench_dual_push_backbone_only_clip_chunk8_weighted_seed17/checkpoint_best.pt",
3
+ "plan_requested": false,
4
+ "plan_applied": false,
5
+ "planner_mode": "trainable",
6
+ "support_mode_conditioning": true,
7
+ "task_conditioning": true,
8
+ "geometry_enabled": true,
9
+ "world_model_mode": "checkpoint_default",
10
+ "episodes_per_task": 5,
11
+ "episode_length": 60,
12
+ "resolution": 256,
13
+ "reset_retries": 20,
14
+ "arm_mode": "planning",
15
+ "delta_scale": 1.0,
16
+ "cameras": [
17
+ "front",
18
+ "wrist_left",
19
+ "wrist_right"
20
+ ],
21
+ "tasks": {
22
+ "bimanual_dual_push_buttons": {
23
+ "error": "The call failed on the V-REP side. Return value: -1",
24
+ "mean_success": 0.0,
25
+ "mean_return": 0.0
26
+ }
27
+ },
28
+ "mean_success": 0.0
29
+ }
reports/dual_push_same_trunk_parity_ep5/base_trunk/rollout_eval.md ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # RLBench Rollout Eval
2
+
3
+ - Checkpoint: `/workspace/workspace/VLAarchtests2/outputs/rlbench_dual_push/rlbench_dual_push_backbone_only_clip_chunk8_weighted_seed17/checkpoint_best.pt`
4
+ - Plan requested: `False`
5
+ - Plan applied: `False`
6
+ - Support-mode conditioning: `True`
7
+ - Task conditioning: `True`
8
+ - Geometry enabled: `True`
9
+ - World-model mode: `checkpoint_default`
10
+ - Mean success: `0.000`
11
+
12
+ ## Per-task
13
+
14
+ - `bimanual_dual_push_buttons`: error=The call failed on the V-REP side. Return value: -1
reports/dual_push_same_trunk_parity_ep5/base_trunk/rollout_eval.partial.json ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "checkpoint": "/workspace/workspace/VLAarchtests2/outputs/rlbench_dual_push/rlbench_dual_push_backbone_only_clip_chunk8_weighted_seed17/checkpoint_best.pt",
3
+ "plan_requested": false,
4
+ "plan_applied": false,
5
+ "planner_mode": "trainable",
6
+ "support_mode_conditioning": true,
7
+ "task_conditioning": true,
8
+ "geometry_enabled": true,
9
+ "world_model_mode": "checkpoint_default",
10
+ "episodes_per_task": 5,
11
+ "episode_length": 60,
12
+ "resolution": 256,
13
+ "reset_retries": 20,
14
+ "arm_mode": "planning",
15
+ "delta_scale": 1.0,
16
+ "cameras": [
17
+ "front",
18
+ "wrist_left",
19
+ "wrist_right"
20
+ ],
21
+ "tasks": {
22
+ "bimanual_dual_push_buttons": {
23
+ "error": "The call failed on the V-REP side. Return value: -1",
24
+ "mean_success": 0.0,
25
+ "mean_return": 0.0
26
+ }
27
+ },
28
+ "mean_success": 0.0
29
+ }
reports/maniskill_bag_bridge_eval_less_bonus_seed23/public_benchmark_package_summary.md ADDED
@@ -0,0 +1,15 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # ManiSkill Bag Retrieval Smoke Summary
2
+
3
+ - benchmark_task: PutEggplantInBasketRetrievalProxy-v1
4
+ - target_macro_average_delta: 0.000
5
+ - headline_pass: False
6
+ - sign_of_life_pass: False
7
+
8
+ ## bag_track
9
+ - delta_active_vs_trunk: 0.000
10
+ - delta_noop_vs_trunk: -0.400
11
+ - signs_of_life: False
12
+ - delta_active_vs_trunk_ci95: [-0.180, 0.200]
13
+ - trunk_only_ft: mean_success=0.480
14
+ - adapter_noop: mean_success=0.080
15
+ - adapter_active_ft: mean_success=0.480
reports/maniskill_bag_bridge_smoke_v1/adapter_active_ft_seed17.json ADDED
@@ -0,0 +1,509 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "track_id": "bag_track",
3
+ "suite": "maniskill3",
4
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
5
+ "role": "target",
6
+ "adapter_mode": "adapter_active_ft",
7
+ "episodes": 50,
8
+ "successes": [
9
+ 1,
10
+ 1,
11
+ 1,
12
+ 1,
13
+ 1,
14
+ 1,
15
+ 0,
16
+ 0,
17
+ 0,
18
+ 1,
19
+ 0,
20
+ 1,
21
+ 1,
22
+ 1,
23
+ 1,
24
+ 1,
25
+ 1,
26
+ 0,
27
+ 0,
28
+ 1,
29
+ 0,
30
+ 0,
31
+ 1,
32
+ 0,
33
+ 1,
34
+ 0,
35
+ 1,
36
+ 0,
37
+ 1,
38
+ 0,
39
+ 1,
40
+ 0,
41
+ 0,
42
+ 0,
43
+ 0,
44
+ 0,
45
+ 1,
46
+ 1,
47
+ 0,
48
+ 0,
49
+ 1,
50
+ 0,
51
+ 0,
52
+ 0,
53
+ 1,
54
+ 0,
55
+ 0,
56
+ 1,
57
+ 0,
58
+ 0
59
+ ],
60
+ "success_rate": 0.48,
61
+ "intervention_rate": 1.0,
62
+ "non_base_selection_rate": 1.0,
63
+ "steps_to_first_reveal_or_access": 1.0,
64
+ "steps_to_retrieve": 4.0,
65
+ "disturbance_proxy": 0.0,
66
+ "episode_records": [
67
+ {
68
+ "episode_seed": 172000,
69
+ "success": true,
70
+ "steps": 2,
71
+ "first_reveal_step": 1,
72
+ "first_retrieve_step": null,
73
+ "episode_disturbance": 0.0
74
+ },
75
+ {
76
+ "episode_seed": 172001,
77
+ "success": true,
78
+ "steps": 1,
79
+ "first_reveal_step": 1,
80
+ "first_retrieve_step": null,
81
+ "episode_disturbance": 0.0
82
+ },
83
+ {
84
+ "episode_seed": 172002,
85
+ "success": true,
86
+ "steps": 1,
87
+ "first_reveal_step": 1,
88
+ "first_retrieve_step": null,
89
+ "episode_disturbance": 0.0
90
+ },
91
+ {
92
+ "episode_seed": 172003,
93
+ "success": true,
94
+ "steps": 1,
95
+ "first_reveal_step": 1,
96
+ "first_retrieve_step": null,
97
+ "episode_disturbance": 0.0
98
+ },
99
+ {
100
+ "episode_seed": 172004,
101
+ "success": true,
102
+ "steps": 1,
103
+ "first_reveal_step": 1,
104
+ "first_retrieve_step": null,
105
+ "episode_disturbance": 0.0
106
+ },
107
+ {
108
+ "episode_seed": 172005,
109
+ "success": true,
110
+ "steps": 1,
111
+ "first_reveal_step": 1,
112
+ "first_retrieve_step": null,
113
+ "episode_disturbance": 0.0
114
+ },
115
+ {
116
+ "episode_seed": 172006,
117
+ "success": false,
118
+ "steps": 4,
119
+ "first_reveal_step": 1,
120
+ "first_retrieve_step": null,
121
+ "episode_disturbance": 0.0
122
+ },
123
+ {
124
+ "episode_seed": 172007,
125
+ "success": false,
126
+ "steps": 4,
127
+ "first_reveal_step": 1,
128
+ "first_retrieve_step": null,
129
+ "episode_disturbance": 0.0
130
+ },
131
+ {
132
+ "episode_seed": 172008,
133
+ "success": false,
134
+ "steps": 4,
135
+ "first_reveal_step": 1,
136
+ "first_retrieve_step": null,
137
+ "episode_disturbance": 0.0
138
+ },
139
+ {
140
+ "episode_seed": 172009,
141
+ "success": true,
142
+ "steps": 1,
143
+ "first_reveal_step": 1,
144
+ "first_retrieve_step": null,
145
+ "episode_disturbance": 0.0
146
+ },
147
+ {
148
+ "episode_seed": 172010,
149
+ "success": false,
150
+ "steps": 4,
151
+ "first_reveal_step": 1,
152
+ "first_retrieve_step": null,
153
+ "episode_disturbance": 0.0
154
+ },
155
+ {
156
+ "episode_seed": 172011,
157
+ "success": true,
158
+ "steps": 1,
159
+ "first_reveal_step": 1,
160
+ "first_retrieve_step": null,
161
+ "episode_disturbance": 0.0
162
+ },
163
+ {
164
+ "episode_seed": 172012,
165
+ "success": true,
166
+ "steps": 3,
167
+ "first_reveal_step": 1,
168
+ "first_retrieve_step": null,
169
+ "episode_disturbance": 0.0
170
+ },
171
+ {
172
+ "episode_seed": 172013,
173
+ "success": true,
174
+ "steps": 2,
175
+ "first_reveal_step": 1,
176
+ "first_retrieve_step": null,
177
+ "episode_disturbance": 0.0
178
+ },
179
+ {
180
+ "episode_seed": 172014,
181
+ "success": true,
182
+ "steps": 2,
183
+ "first_reveal_step": 1,
184
+ "first_retrieve_step": null,
185
+ "episode_disturbance": 0.0
186
+ },
187
+ {
188
+ "episode_seed": 172015,
189
+ "success": true,
190
+ "steps": 1,
191
+ "first_reveal_step": 1,
192
+ "first_retrieve_step": null,
193
+ "episode_disturbance": 0.0
194
+ },
195
+ {
196
+ "episode_seed": 172016,
197
+ "success": true,
198
+ "steps": 1,
199
+ "first_reveal_step": 1,
200
+ "first_retrieve_step": null,
201
+ "episode_disturbance": 0.0
202
+ },
203
+ {
204
+ "episode_seed": 172017,
205
+ "success": false,
206
+ "steps": 4,
207
+ "first_reveal_step": 1,
208
+ "first_retrieve_step": null,
209
+ "episode_disturbance": 0.0
210
+ },
211
+ {
212
+ "episode_seed": 172018,
213
+ "success": false,
214
+ "steps": 4,
215
+ "first_reveal_step": 1,
216
+ "first_retrieve_step": null,
217
+ "episode_disturbance": 0.0
218
+ },
219
+ {
220
+ "episode_seed": 172019,
221
+ "success": true,
222
+ "steps": 2,
223
+ "first_reveal_step": 1,
224
+ "first_retrieve_step": null,
225
+ "episode_disturbance": 0.0
226
+ },
227
+ {
228
+ "episode_seed": 172020,
229
+ "success": false,
230
+ "steps": 4,
231
+ "first_reveal_step": 1,
232
+ "first_retrieve_step": null,
233
+ "episode_disturbance": 0.0
234
+ },
235
+ {
236
+ "episode_seed": 172021,
237
+ "success": false,
238
+ "steps": 4,
239
+ "first_reveal_step": 1,
240
+ "first_retrieve_step": null,
241
+ "episode_disturbance": 0.0
242
+ },
243
+ {
244
+ "episode_seed": 172022,
245
+ "success": true,
246
+ "steps": 1,
247
+ "first_reveal_step": 1,
248
+ "first_retrieve_step": null,
249
+ "episode_disturbance": 0.0
250
+ },
251
+ {
252
+ "episode_seed": 172023,
253
+ "success": false,
254
+ "steps": 4,
255
+ "first_reveal_step": 1,
256
+ "first_retrieve_step": null,
257
+ "episode_disturbance": 0.0
258
+ },
259
+ {
260
+ "episode_seed": 172024,
261
+ "success": true,
262
+ "steps": 1,
263
+ "first_reveal_step": 1,
264
+ "first_retrieve_step": null,
265
+ "episode_disturbance": 0.0
266
+ },
267
+ {
268
+ "episode_seed": 172025,
269
+ "success": false,
270
+ "steps": 4,
271
+ "first_reveal_step": 1,
272
+ "first_retrieve_step": null,
273
+ "episode_disturbance": 0.0
274
+ },
275
+ {
276
+ "episode_seed": 172026,
277
+ "success": true,
278
+ "steps": 1,
279
+ "first_reveal_step": 1,
280
+ "first_retrieve_step": null,
281
+ "episode_disturbance": 0.0
282
+ },
283
+ {
284
+ "episode_seed": 172027,
285
+ "success": false,
286
+ "steps": 4,
287
+ "first_reveal_step": 1,
288
+ "first_retrieve_step": null,
289
+ "episode_disturbance": 0.0
290
+ },
291
+ {
292
+ "episode_seed": 172028,
293
+ "success": true,
294
+ "steps": 1,
295
+ "first_reveal_step": 1,
296
+ "first_retrieve_step": null,
297
+ "episode_disturbance": 0.0
298
+ },
299
+ {
300
+ "episode_seed": 172029,
301
+ "success": false,
302
+ "steps": 4,
303
+ "first_reveal_step": 1,
304
+ "first_retrieve_step": null,
305
+ "episode_disturbance": 0.0
306
+ },
307
+ {
308
+ "episode_seed": 172030,
309
+ "success": true,
310
+ "steps": 1,
311
+ "first_reveal_step": 1,
312
+ "first_retrieve_step": null,
313
+ "episode_disturbance": 0.0
314
+ },
315
+ {
316
+ "episode_seed": 172031,
317
+ "success": false,
318
+ "steps": 4,
319
+ "first_reveal_step": 1,
320
+ "first_retrieve_step": null,
321
+ "episode_disturbance": 0.0
322
+ },
323
+ {
324
+ "episode_seed": 172032,
325
+ "success": false,
326
+ "steps": 4,
327
+ "first_reveal_step": 1,
328
+ "first_retrieve_step": null,
329
+ "episode_disturbance": 0.0
330
+ },
331
+ {
332
+ "episode_seed": 172033,
333
+ "success": false,
334
+ "steps": 4,
335
+ "first_reveal_step": 1,
336
+ "first_retrieve_step": null,
337
+ "episode_disturbance": 0.0
338
+ },
339
+ {
340
+ "episode_seed": 172034,
341
+ "success": false,
342
+ "steps": 4,
343
+ "first_reveal_step": 1,
344
+ "first_retrieve_step": null,
345
+ "episode_disturbance": 0.0
346
+ },
347
+ {
348
+ "episode_seed": 172035,
349
+ "success": false,
350
+ "steps": 4,
351
+ "first_reveal_step": 1,
352
+ "first_retrieve_step": null,
353
+ "episode_disturbance": 0.0
354
+ },
355
+ {
356
+ "episode_seed": 172036,
357
+ "success": true,
358
+ "steps": 2,
359
+ "first_reveal_step": 1,
360
+ "first_retrieve_step": null,
361
+ "episode_disturbance": 0.0
362
+ },
363
+ {
364
+ "episode_seed": 172037,
365
+ "success": true,
366
+ "steps": 1,
367
+ "first_reveal_step": 1,
368
+ "first_retrieve_step": null,
369
+ "episode_disturbance": 0.0
370
+ },
371
+ {
372
+ "episode_seed": 172038,
373
+ "success": false,
374
+ "steps": 4,
375
+ "first_reveal_step": 1,
376
+ "first_retrieve_step": null,
377
+ "episode_disturbance": 0.0
378
+ },
379
+ {
380
+ "episode_seed": 172039,
381
+ "success": false,
382
+ "steps": 4,
383
+ "first_reveal_step": 1,
384
+ "first_retrieve_step": null,
385
+ "episode_disturbance": 0.0
386
+ },
387
+ {
388
+ "episode_seed": 172040,
389
+ "success": true,
390
+ "steps": 1,
391
+ "first_reveal_step": 1,
392
+ "first_retrieve_step": null,
393
+ "episode_disturbance": 0.0
394
+ },
395
+ {
396
+ "episode_seed": 172041,
397
+ "success": false,
398
+ "steps": 4,
399
+ "first_reveal_step": 1,
400
+ "first_retrieve_step": null,
401
+ "episode_disturbance": 0.0
402
+ },
403
+ {
404
+ "episode_seed": 172042,
405
+ "success": false,
406
+ "steps": 4,
407
+ "first_reveal_step": 1,
408
+ "first_retrieve_step": null,
409
+ "episode_disturbance": 0.0
410
+ },
411
+ {
412
+ "episode_seed": 172043,
413
+ "success": false,
414
+ "steps": 4,
415
+ "first_reveal_step": 1,
416
+ "first_retrieve_step": null,
417
+ "episode_disturbance": 0.0
418
+ },
419
+ {
420
+ "episode_seed": 172044,
421
+ "success": true,
422
+ "steps": 1,
423
+ "first_reveal_step": 1,
424
+ "first_retrieve_step": null,
425
+ "episode_disturbance": 0.0
426
+ },
427
+ {
428
+ "episode_seed": 172045,
429
+ "success": false,
430
+ "steps": 4,
431
+ "first_reveal_step": 1,
432
+ "first_retrieve_step": null,
433
+ "episode_disturbance": 0.0
434
+ },
435
+ {
436
+ "episode_seed": 172046,
437
+ "success": false,
438
+ "steps": 4,
439
+ "first_reveal_step": 1,
440
+ "first_retrieve_step": null,
441
+ "episode_disturbance": 0.0
442
+ },
443
+ {
444
+ "episode_seed": 172047,
445
+ "success": true,
446
+ "steps": 1,
447
+ "first_reveal_step": 1,
448
+ "first_retrieve_step": null,
449
+ "episode_disturbance": 0.0
450
+ },
451
+ {
452
+ "episode_seed": 172048,
453
+ "success": false,
454
+ "steps": 4,
455
+ "first_reveal_step": 1,
456
+ "first_retrieve_step": null,
457
+ "episode_disturbance": 0.0
458
+ },
459
+ {
460
+ "episode_seed": 172049,
461
+ "success": false,
462
+ "steps": 4,
463
+ "first_reveal_step": 1,
464
+ "first_retrieve_step": null,
465
+ "episode_disturbance": 0.0
466
+ }
467
+ ],
468
+ "eval_protocol": {
469
+ "track_id": "bag_track",
470
+ "suite": "maniskill3",
471
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
472
+ "role": "target",
473
+ "eval_mode": "adapter_active_ft",
474
+ "seed": 17,
475
+ "episodes": 50,
476
+ "resolution": 224,
477
+ "cameras": [
478
+ "front",
479
+ "left",
480
+ "right"
481
+ ],
482
+ "observation_stack": "rgb_triplicate_zero_depth",
483
+ "action_horizon": 8,
484
+ "action_space": "widowx_delta_pose",
485
+ "same_test_episodes": true
486
+ },
487
+ "proxy_notes": "Public ManiSkill bridge scene with custom retrieval initialization. The eggplant is placed inside the basket region and must be pulled back out to a staging zone.",
488
+ "train_spec": {
489
+ "track_id": "bag_track",
490
+ "suite": "maniskill3",
491
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
492
+ "model_variant": "adapter_active_ft",
493
+ "seed": 17,
494
+ "train_demos": 32,
495
+ "val_demos": 8,
496
+ "init_checkpoint_group": "/workspace/workspace/VLAarchtests2/VLAarchtests/artifacts/outputs/r3d_handoff_phase/proxy_interaction_r3d_stage3_clip_rgbd_handoff_compact_phase_seed17/checkpoint_best.pt",
497
+ "optimizer": "adamw",
498
+ "learning_rate": 0.0001,
499
+ "lr_schedule": "constant",
500
+ "batch_size": 4,
501
+ "augmentations": "none",
502
+ "early_stopping_metric": "val_total",
503
+ "max_gradient_steps": 114,
504
+ "unfreeze_scope": "fusion_memory_decoder",
505
+ "dataset_split_id": "bag_bridge_smoke_v1_seed17",
506
+ "same_data_policy": true,
507
+ "same_init_policy": true
508
+ }
509
+ }
reports/maniskill_bag_bridge_smoke_v1/adapter_active_ft_seed23.json ADDED
@@ -0,0 +1,509 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "track_id": "bag_track",
3
+ "suite": "maniskill3",
4
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
5
+ "role": "target",
6
+ "adapter_mode": "adapter_active_ft",
7
+ "episodes": 50,
8
+ "successes": [
9
+ 0,
10
+ 0,
11
+ 0,
12
+ 0,
13
+ 0,
14
+ 0,
15
+ 0,
16
+ 0,
17
+ 0,
18
+ 0,
19
+ 0,
20
+ 0,
21
+ 0,
22
+ 0,
23
+ 0,
24
+ 0,
25
+ 0,
26
+ 0,
27
+ 0,
28
+ 0,
29
+ 0,
30
+ 0,
31
+ 0,
32
+ 0,
33
+ 0,
34
+ 0,
35
+ 0,
36
+ 0,
37
+ 0,
38
+ 0,
39
+ 0,
40
+ 0,
41
+ 0,
42
+ 0,
43
+ 0,
44
+ 0,
45
+ 0,
46
+ 0,
47
+ 0,
48
+ 0,
49
+ 0,
50
+ 0,
51
+ 0,
52
+ 0,
53
+ 0,
54
+ 0,
55
+ 0,
56
+ 0,
57
+ 0,
58
+ 0
59
+ ],
60
+ "success_rate": 0.0,
61
+ "intervention_rate": 1.0,
62
+ "non_base_selection_rate": 1.0,
63
+ "steps_to_first_reveal_or_access": 1.0,
64
+ "steps_to_retrieve": 4.0,
65
+ "disturbance_proxy": 0.0,
66
+ "episode_records": [
67
+ {
68
+ "episode_seed": 172000,
69
+ "success": false,
70
+ "steps": 4,
71
+ "first_reveal_step": 1,
72
+ "first_retrieve_step": null,
73
+ "episode_disturbance": 0.0
74
+ },
75
+ {
76
+ "episode_seed": 172001,
77
+ "success": false,
78
+ "steps": 4,
79
+ "first_reveal_step": 1,
80
+ "first_retrieve_step": null,
81
+ "episode_disturbance": 0.0
82
+ },
83
+ {
84
+ "episode_seed": 172002,
85
+ "success": false,
86
+ "steps": 4,
87
+ "first_reveal_step": 1,
88
+ "first_retrieve_step": null,
89
+ "episode_disturbance": 0.0
90
+ },
91
+ {
92
+ "episode_seed": 172003,
93
+ "success": false,
94
+ "steps": 4,
95
+ "first_reveal_step": 1,
96
+ "first_retrieve_step": null,
97
+ "episode_disturbance": 0.0
98
+ },
99
+ {
100
+ "episode_seed": 172004,
101
+ "success": false,
102
+ "steps": 4,
103
+ "first_reveal_step": 1,
104
+ "first_retrieve_step": null,
105
+ "episode_disturbance": 0.0
106
+ },
107
+ {
108
+ "episode_seed": 172005,
109
+ "success": false,
110
+ "steps": 4,
111
+ "first_reveal_step": 1,
112
+ "first_retrieve_step": null,
113
+ "episode_disturbance": 0.0
114
+ },
115
+ {
116
+ "episode_seed": 172006,
117
+ "success": false,
118
+ "steps": 4,
119
+ "first_reveal_step": 1,
120
+ "first_retrieve_step": null,
121
+ "episode_disturbance": 0.0
122
+ },
123
+ {
124
+ "episode_seed": 172007,
125
+ "success": false,
126
+ "steps": 4,
127
+ "first_reveal_step": 1,
128
+ "first_retrieve_step": null,
129
+ "episode_disturbance": 0.0
130
+ },
131
+ {
132
+ "episode_seed": 172008,
133
+ "success": false,
134
+ "steps": 4,
135
+ "first_reveal_step": 1,
136
+ "first_retrieve_step": null,
137
+ "episode_disturbance": 0.0
138
+ },
139
+ {
140
+ "episode_seed": 172009,
141
+ "success": false,
142
+ "steps": 4,
143
+ "first_reveal_step": 1,
144
+ "first_retrieve_step": null,
145
+ "episode_disturbance": 0.0
146
+ },
147
+ {
148
+ "episode_seed": 172010,
149
+ "success": false,
150
+ "steps": 4,
151
+ "first_reveal_step": 1,
152
+ "first_retrieve_step": null,
153
+ "episode_disturbance": 0.0
154
+ },
155
+ {
156
+ "episode_seed": 172011,
157
+ "success": false,
158
+ "steps": 4,
159
+ "first_reveal_step": 1,
160
+ "first_retrieve_step": null,
161
+ "episode_disturbance": 0.0
162
+ },
163
+ {
164
+ "episode_seed": 172012,
165
+ "success": false,
166
+ "steps": 4,
167
+ "first_reveal_step": 1,
168
+ "first_retrieve_step": null,
169
+ "episode_disturbance": 0.0
170
+ },
171
+ {
172
+ "episode_seed": 172013,
173
+ "success": false,
174
+ "steps": 4,
175
+ "first_reveal_step": 1,
176
+ "first_retrieve_step": null,
177
+ "episode_disturbance": 0.0
178
+ },
179
+ {
180
+ "episode_seed": 172014,
181
+ "success": false,
182
+ "steps": 4,
183
+ "first_reveal_step": 1,
184
+ "first_retrieve_step": null,
185
+ "episode_disturbance": 0.0
186
+ },
187
+ {
188
+ "episode_seed": 172015,
189
+ "success": false,
190
+ "steps": 4,
191
+ "first_reveal_step": 1,
192
+ "first_retrieve_step": null,
193
+ "episode_disturbance": 0.0
194
+ },
195
+ {
196
+ "episode_seed": 172016,
197
+ "success": false,
198
+ "steps": 4,
199
+ "first_reveal_step": 1,
200
+ "first_retrieve_step": null,
201
+ "episode_disturbance": 0.0
202
+ },
203
+ {
204
+ "episode_seed": 172017,
205
+ "success": false,
206
+ "steps": 4,
207
+ "first_reveal_step": 1,
208
+ "first_retrieve_step": null,
209
+ "episode_disturbance": 0.0
210
+ },
211
+ {
212
+ "episode_seed": 172018,
213
+ "success": false,
214
+ "steps": 4,
215
+ "first_reveal_step": 1,
216
+ "first_retrieve_step": null,
217
+ "episode_disturbance": 0.0
218
+ },
219
+ {
220
+ "episode_seed": 172019,
221
+ "success": false,
222
+ "steps": 4,
223
+ "first_reveal_step": 1,
224
+ "first_retrieve_step": null,
225
+ "episode_disturbance": 0.0
226
+ },
227
+ {
228
+ "episode_seed": 172020,
229
+ "success": false,
230
+ "steps": 4,
231
+ "first_reveal_step": 1,
232
+ "first_retrieve_step": null,
233
+ "episode_disturbance": 0.0
234
+ },
235
+ {
236
+ "episode_seed": 172021,
237
+ "success": false,
238
+ "steps": 4,
239
+ "first_reveal_step": 1,
240
+ "first_retrieve_step": null,
241
+ "episode_disturbance": 0.0
242
+ },
243
+ {
244
+ "episode_seed": 172022,
245
+ "success": false,
246
+ "steps": 4,
247
+ "first_reveal_step": 1,
248
+ "first_retrieve_step": null,
249
+ "episode_disturbance": 0.0
250
+ },
251
+ {
252
+ "episode_seed": 172023,
253
+ "success": false,
254
+ "steps": 4,
255
+ "first_reveal_step": 1,
256
+ "first_retrieve_step": null,
257
+ "episode_disturbance": 0.0
258
+ },
259
+ {
260
+ "episode_seed": 172024,
261
+ "success": false,
262
+ "steps": 4,
263
+ "first_reveal_step": 1,
264
+ "first_retrieve_step": null,
265
+ "episode_disturbance": 0.0
266
+ },
267
+ {
268
+ "episode_seed": 172025,
269
+ "success": false,
270
+ "steps": 4,
271
+ "first_reveal_step": 1,
272
+ "first_retrieve_step": null,
273
+ "episode_disturbance": 0.0
274
+ },
275
+ {
276
+ "episode_seed": 172026,
277
+ "success": false,
278
+ "steps": 4,
279
+ "first_reveal_step": 1,
280
+ "first_retrieve_step": null,
281
+ "episode_disturbance": 0.0
282
+ },
283
+ {
284
+ "episode_seed": 172027,
285
+ "success": false,
286
+ "steps": 4,
287
+ "first_reveal_step": 1,
288
+ "first_retrieve_step": null,
289
+ "episode_disturbance": 0.0
290
+ },
291
+ {
292
+ "episode_seed": 172028,
293
+ "success": false,
294
+ "steps": 4,
295
+ "first_reveal_step": 1,
296
+ "first_retrieve_step": null,
297
+ "episode_disturbance": 0.0
298
+ },
299
+ {
300
+ "episode_seed": 172029,
301
+ "success": false,
302
+ "steps": 4,
303
+ "first_reveal_step": 1,
304
+ "first_retrieve_step": null,
305
+ "episode_disturbance": 0.0
306
+ },
307
+ {
308
+ "episode_seed": 172030,
309
+ "success": false,
310
+ "steps": 4,
311
+ "first_reveal_step": 1,
312
+ "first_retrieve_step": null,
313
+ "episode_disturbance": 0.0
314
+ },
315
+ {
316
+ "episode_seed": 172031,
317
+ "success": false,
318
+ "steps": 4,
319
+ "first_reveal_step": 1,
320
+ "first_retrieve_step": null,
321
+ "episode_disturbance": 0.0
322
+ },
323
+ {
324
+ "episode_seed": 172032,
325
+ "success": false,
326
+ "steps": 4,
327
+ "first_reveal_step": 1,
328
+ "first_retrieve_step": null,
329
+ "episode_disturbance": 0.0
330
+ },
331
+ {
332
+ "episode_seed": 172033,
333
+ "success": false,
334
+ "steps": 4,
335
+ "first_reveal_step": 1,
336
+ "first_retrieve_step": null,
337
+ "episode_disturbance": 0.0
338
+ },
339
+ {
340
+ "episode_seed": 172034,
341
+ "success": false,
342
+ "steps": 4,
343
+ "first_reveal_step": 1,
344
+ "first_retrieve_step": null,
345
+ "episode_disturbance": 0.0
346
+ },
347
+ {
348
+ "episode_seed": 172035,
349
+ "success": false,
350
+ "steps": 4,
351
+ "first_reveal_step": 1,
352
+ "first_retrieve_step": null,
353
+ "episode_disturbance": 0.0
354
+ },
355
+ {
356
+ "episode_seed": 172036,
357
+ "success": false,
358
+ "steps": 4,
359
+ "first_reveal_step": 1,
360
+ "first_retrieve_step": null,
361
+ "episode_disturbance": 0.0
362
+ },
363
+ {
364
+ "episode_seed": 172037,
365
+ "success": false,
366
+ "steps": 4,
367
+ "first_reveal_step": 1,
368
+ "first_retrieve_step": null,
369
+ "episode_disturbance": 0.0
370
+ },
371
+ {
372
+ "episode_seed": 172038,
373
+ "success": false,
374
+ "steps": 4,
375
+ "first_reveal_step": 1,
376
+ "first_retrieve_step": null,
377
+ "episode_disturbance": 0.0
378
+ },
379
+ {
380
+ "episode_seed": 172039,
381
+ "success": false,
382
+ "steps": 4,
383
+ "first_reveal_step": 1,
384
+ "first_retrieve_step": null,
385
+ "episode_disturbance": 0.0
386
+ },
387
+ {
388
+ "episode_seed": 172040,
389
+ "success": false,
390
+ "steps": 4,
391
+ "first_reveal_step": 1,
392
+ "first_retrieve_step": null,
393
+ "episode_disturbance": 0.0
394
+ },
395
+ {
396
+ "episode_seed": 172041,
397
+ "success": false,
398
+ "steps": 4,
399
+ "first_reveal_step": 1,
400
+ "first_retrieve_step": null,
401
+ "episode_disturbance": 0.0
402
+ },
403
+ {
404
+ "episode_seed": 172042,
405
+ "success": false,
406
+ "steps": 4,
407
+ "first_reveal_step": 1,
408
+ "first_retrieve_step": null,
409
+ "episode_disturbance": 0.0
410
+ },
411
+ {
412
+ "episode_seed": 172043,
413
+ "success": false,
414
+ "steps": 4,
415
+ "first_reveal_step": 1,
416
+ "first_retrieve_step": null,
417
+ "episode_disturbance": 0.0
418
+ },
419
+ {
420
+ "episode_seed": 172044,
421
+ "success": false,
422
+ "steps": 4,
423
+ "first_reveal_step": 1,
424
+ "first_retrieve_step": null,
425
+ "episode_disturbance": 0.0
426
+ },
427
+ {
428
+ "episode_seed": 172045,
429
+ "success": false,
430
+ "steps": 4,
431
+ "first_reveal_step": 1,
432
+ "first_retrieve_step": null,
433
+ "episode_disturbance": 0.0
434
+ },
435
+ {
436
+ "episode_seed": 172046,
437
+ "success": false,
438
+ "steps": 4,
439
+ "first_reveal_step": 1,
440
+ "first_retrieve_step": null,
441
+ "episode_disturbance": 0.0
442
+ },
443
+ {
444
+ "episode_seed": 172047,
445
+ "success": false,
446
+ "steps": 4,
447
+ "first_reveal_step": 1,
448
+ "first_retrieve_step": null,
449
+ "episode_disturbance": 0.0
450
+ },
451
+ {
452
+ "episode_seed": 172048,
453
+ "success": false,
454
+ "steps": 4,
455
+ "first_reveal_step": 1,
456
+ "first_retrieve_step": null,
457
+ "episode_disturbance": 0.0
458
+ },
459
+ {
460
+ "episode_seed": 172049,
461
+ "success": false,
462
+ "steps": 4,
463
+ "first_reveal_step": 1,
464
+ "first_retrieve_step": null,
465
+ "episode_disturbance": 0.0
466
+ }
467
+ ],
468
+ "eval_protocol": {
469
+ "track_id": "bag_track",
470
+ "suite": "maniskill3",
471
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
472
+ "role": "target",
473
+ "eval_mode": "adapter_active_ft",
474
+ "seed": 17,
475
+ "episodes": 50,
476
+ "resolution": 224,
477
+ "cameras": [
478
+ "front",
479
+ "left",
480
+ "right"
481
+ ],
482
+ "observation_stack": "rgb_triplicate_zero_depth",
483
+ "action_horizon": 8,
484
+ "action_space": "widowx_delta_pose",
485
+ "same_test_episodes": true
486
+ },
487
+ "proxy_notes": "Public ManiSkill bridge scene with custom retrieval initialization. The eggplant is placed inside the basket region and must be pulled back out to a staging zone.",
488
+ "train_spec": {
489
+ "track_id": "bag_track",
490
+ "suite": "maniskill3",
491
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
492
+ "model_variant": "adapter_active_ft",
493
+ "seed": 23,
494
+ "train_demos": 32,
495
+ "val_demos": 8,
496
+ "init_checkpoint_group": "/workspace/workspace/VLAarchtests2/VLAarchtests/artifacts/outputs/r3d_handoff_phase/proxy_interaction_r3d_stage3_clip_rgbd_handoff_compact_phase_seed17/checkpoint_best.pt",
497
+ "optimizer": "adamw",
498
+ "learning_rate": 0.0001,
499
+ "lr_schedule": "constant",
500
+ "batch_size": 4,
501
+ "augmentations": "none",
502
+ "early_stopping_metric": "val_total",
503
+ "max_gradient_steps": 114,
504
+ "unfreeze_scope": "fusion_memory_decoder",
505
+ "dataset_split_id": "bag_bridge_smoke_v1_dataset_seed17",
506
+ "same_data_policy": true,
507
+ "same_init_policy": true
508
+ }
509
+ }
reports/maniskill_bag_bridge_smoke_v1/adapter_noop_seed17.json ADDED
@@ -0,0 +1,488 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "track_id": "bag_track",
3
+ "suite": "maniskill3",
4
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
5
+ "role": "target",
6
+ "adapter_mode": "adapter_noop",
7
+ "episodes": 50,
8
+ "successes": [
9
+ 0,
10
+ 0,
11
+ 0,
12
+ 0,
13
+ 0,
14
+ 0,
15
+ 0,
16
+ 0,
17
+ 0,
18
+ 0,
19
+ 0,
20
+ 0,
21
+ 0,
22
+ 0,
23
+ 0,
24
+ 0,
25
+ 0,
26
+ 0,
27
+ 0,
28
+ 0,
29
+ 0,
30
+ 0,
31
+ 0,
32
+ 0,
33
+ 0,
34
+ 0,
35
+ 0,
36
+ 0,
37
+ 0,
38
+ 0,
39
+ 0,
40
+ 0,
41
+ 0,
42
+ 0,
43
+ 0,
44
+ 0,
45
+ 0,
46
+ 0,
47
+ 0,
48
+ 0,
49
+ 0,
50
+ 0,
51
+ 0,
52
+ 0,
53
+ 0,
54
+ 0,
55
+ 0,
56
+ 0,
57
+ 0,
58
+ 0
59
+ ],
60
+ "success_rate": 0.0,
61
+ "intervention_rate": 0.0,
62
+ "non_base_selection_rate": 0.0,
63
+ "steps_to_first_reveal_or_access": 4.0,
64
+ "steps_to_retrieve": 4.0,
65
+ "disturbance_proxy": 0.0,
66
+ "episode_records": [
67
+ {
68
+ "episode_seed": 172000,
69
+ "success": false,
70
+ "steps": 4,
71
+ "first_reveal_step": null,
72
+ "first_retrieve_step": null,
73
+ "episode_disturbance": 0.0
74
+ },
75
+ {
76
+ "episode_seed": 172001,
77
+ "success": false,
78
+ "steps": 4,
79
+ "first_reveal_step": null,
80
+ "first_retrieve_step": null,
81
+ "episode_disturbance": 0.0
82
+ },
83
+ {
84
+ "episode_seed": 172002,
85
+ "success": false,
86
+ "steps": 4,
87
+ "first_reveal_step": null,
88
+ "first_retrieve_step": null,
89
+ "episode_disturbance": 0.0
90
+ },
91
+ {
92
+ "episode_seed": 172003,
93
+ "success": false,
94
+ "steps": 4,
95
+ "first_reveal_step": null,
96
+ "first_retrieve_step": null,
97
+ "episode_disturbance": 0.0
98
+ },
99
+ {
100
+ "episode_seed": 172004,
101
+ "success": false,
102
+ "steps": 4,
103
+ "first_reveal_step": null,
104
+ "first_retrieve_step": null,
105
+ "episode_disturbance": 0.0
106
+ },
107
+ {
108
+ "episode_seed": 172005,
109
+ "success": false,
110
+ "steps": 4,
111
+ "first_reveal_step": null,
112
+ "first_retrieve_step": null,
113
+ "episode_disturbance": 0.0
114
+ },
115
+ {
116
+ "episode_seed": 172006,
117
+ "success": false,
118
+ "steps": 4,
119
+ "first_reveal_step": null,
120
+ "first_retrieve_step": null,
121
+ "episode_disturbance": 0.0
122
+ },
123
+ {
124
+ "episode_seed": 172007,
125
+ "success": false,
126
+ "steps": 4,
127
+ "first_reveal_step": null,
128
+ "first_retrieve_step": null,
129
+ "episode_disturbance": 0.0
130
+ },
131
+ {
132
+ "episode_seed": 172008,
133
+ "success": false,
134
+ "steps": 4,
135
+ "first_reveal_step": null,
136
+ "first_retrieve_step": null,
137
+ "episode_disturbance": 0.0
138
+ },
139
+ {
140
+ "episode_seed": 172009,
141
+ "success": false,
142
+ "steps": 4,
143
+ "first_reveal_step": null,
144
+ "first_retrieve_step": null,
145
+ "episode_disturbance": 0.0
146
+ },
147
+ {
148
+ "episode_seed": 172010,
149
+ "success": false,
150
+ "steps": 4,
151
+ "first_reveal_step": null,
152
+ "first_retrieve_step": null,
153
+ "episode_disturbance": 0.0
154
+ },
155
+ {
156
+ "episode_seed": 172011,
157
+ "success": false,
158
+ "steps": 4,
159
+ "first_reveal_step": null,
160
+ "first_retrieve_step": null,
161
+ "episode_disturbance": 0.0
162
+ },
163
+ {
164
+ "episode_seed": 172012,
165
+ "success": false,
166
+ "steps": 4,
167
+ "first_reveal_step": null,
168
+ "first_retrieve_step": null,
169
+ "episode_disturbance": 0.0
170
+ },
171
+ {
172
+ "episode_seed": 172013,
173
+ "success": false,
174
+ "steps": 4,
175
+ "first_reveal_step": null,
176
+ "first_retrieve_step": null,
177
+ "episode_disturbance": 0.0
178
+ },
179
+ {
180
+ "episode_seed": 172014,
181
+ "success": false,
182
+ "steps": 4,
183
+ "first_reveal_step": null,
184
+ "first_retrieve_step": null,
185
+ "episode_disturbance": 0.0
186
+ },
187
+ {
188
+ "episode_seed": 172015,
189
+ "success": false,
190
+ "steps": 4,
191
+ "first_reveal_step": null,
192
+ "first_retrieve_step": null,
193
+ "episode_disturbance": 0.0
194
+ },
195
+ {
196
+ "episode_seed": 172016,
197
+ "success": false,
198
+ "steps": 4,
199
+ "first_reveal_step": null,
200
+ "first_retrieve_step": null,
201
+ "episode_disturbance": 0.0
202
+ },
203
+ {
204
+ "episode_seed": 172017,
205
+ "success": false,
206
+ "steps": 4,
207
+ "first_reveal_step": null,
208
+ "first_retrieve_step": null,
209
+ "episode_disturbance": 0.0
210
+ },
211
+ {
212
+ "episode_seed": 172018,
213
+ "success": false,
214
+ "steps": 4,
215
+ "first_reveal_step": null,
216
+ "first_retrieve_step": null,
217
+ "episode_disturbance": 0.0
218
+ },
219
+ {
220
+ "episode_seed": 172019,
221
+ "success": false,
222
+ "steps": 4,
223
+ "first_reveal_step": null,
224
+ "first_retrieve_step": null,
225
+ "episode_disturbance": 0.0
226
+ },
227
+ {
228
+ "episode_seed": 172020,
229
+ "success": false,
230
+ "steps": 4,
231
+ "first_reveal_step": null,
232
+ "first_retrieve_step": null,
233
+ "episode_disturbance": 0.0
234
+ },
235
+ {
236
+ "episode_seed": 172021,
237
+ "success": false,
238
+ "steps": 4,
239
+ "first_reveal_step": null,
240
+ "first_retrieve_step": null,
241
+ "episode_disturbance": 0.0
242
+ },
243
+ {
244
+ "episode_seed": 172022,
245
+ "success": false,
246
+ "steps": 4,
247
+ "first_reveal_step": null,
248
+ "first_retrieve_step": null,
249
+ "episode_disturbance": 0.0
250
+ },
251
+ {
252
+ "episode_seed": 172023,
253
+ "success": false,
254
+ "steps": 4,
255
+ "first_reveal_step": null,
256
+ "first_retrieve_step": null,
257
+ "episode_disturbance": 0.0
258
+ },
259
+ {
260
+ "episode_seed": 172024,
261
+ "success": false,
262
+ "steps": 4,
263
+ "first_reveal_step": null,
264
+ "first_retrieve_step": null,
265
+ "episode_disturbance": 0.0
266
+ },
267
+ {
268
+ "episode_seed": 172025,
269
+ "success": false,
270
+ "steps": 4,
271
+ "first_reveal_step": null,
272
+ "first_retrieve_step": null,
273
+ "episode_disturbance": 0.0
274
+ },
275
+ {
276
+ "episode_seed": 172026,
277
+ "success": false,
278
+ "steps": 4,
279
+ "first_reveal_step": null,
280
+ "first_retrieve_step": null,
281
+ "episode_disturbance": 0.0
282
+ },
283
+ {
284
+ "episode_seed": 172027,
285
+ "success": false,
286
+ "steps": 4,
287
+ "first_reveal_step": null,
288
+ "first_retrieve_step": null,
289
+ "episode_disturbance": 0.0
290
+ },
291
+ {
292
+ "episode_seed": 172028,
293
+ "success": false,
294
+ "steps": 4,
295
+ "first_reveal_step": null,
296
+ "first_retrieve_step": null,
297
+ "episode_disturbance": 0.0
298
+ },
299
+ {
300
+ "episode_seed": 172029,
301
+ "success": false,
302
+ "steps": 4,
303
+ "first_reveal_step": null,
304
+ "first_retrieve_step": null,
305
+ "episode_disturbance": 0.0
306
+ },
307
+ {
308
+ "episode_seed": 172030,
309
+ "success": false,
310
+ "steps": 4,
311
+ "first_reveal_step": null,
312
+ "first_retrieve_step": null,
313
+ "episode_disturbance": 0.0
314
+ },
315
+ {
316
+ "episode_seed": 172031,
317
+ "success": false,
318
+ "steps": 4,
319
+ "first_reveal_step": null,
320
+ "first_retrieve_step": null,
321
+ "episode_disturbance": 0.0
322
+ },
323
+ {
324
+ "episode_seed": 172032,
325
+ "success": false,
326
+ "steps": 4,
327
+ "first_reveal_step": null,
328
+ "first_retrieve_step": null,
329
+ "episode_disturbance": 0.0
330
+ },
331
+ {
332
+ "episode_seed": 172033,
333
+ "success": false,
334
+ "steps": 4,
335
+ "first_reveal_step": null,
336
+ "first_retrieve_step": null,
337
+ "episode_disturbance": 0.0
338
+ },
339
+ {
340
+ "episode_seed": 172034,
341
+ "success": false,
342
+ "steps": 4,
343
+ "first_reveal_step": null,
344
+ "first_retrieve_step": null,
345
+ "episode_disturbance": 0.0
346
+ },
347
+ {
348
+ "episode_seed": 172035,
349
+ "success": false,
350
+ "steps": 4,
351
+ "first_reveal_step": null,
352
+ "first_retrieve_step": null,
353
+ "episode_disturbance": 0.0
354
+ },
355
+ {
356
+ "episode_seed": 172036,
357
+ "success": false,
358
+ "steps": 4,
359
+ "first_reveal_step": null,
360
+ "first_retrieve_step": null,
361
+ "episode_disturbance": 0.0
362
+ },
363
+ {
364
+ "episode_seed": 172037,
365
+ "success": false,
366
+ "steps": 4,
367
+ "first_reveal_step": null,
368
+ "first_retrieve_step": null,
369
+ "episode_disturbance": 0.0
370
+ },
371
+ {
372
+ "episode_seed": 172038,
373
+ "success": false,
374
+ "steps": 4,
375
+ "first_reveal_step": null,
376
+ "first_retrieve_step": null,
377
+ "episode_disturbance": 0.0
378
+ },
379
+ {
380
+ "episode_seed": 172039,
381
+ "success": false,
382
+ "steps": 4,
383
+ "first_reveal_step": null,
384
+ "first_retrieve_step": null,
385
+ "episode_disturbance": 0.0
386
+ },
387
+ {
388
+ "episode_seed": 172040,
389
+ "success": false,
390
+ "steps": 4,
391
+ "first_reveal_step": null,
392
+ "first_retrieve_step": null,
393
+ "episode_disturbance": 0.0
394
+ },
395
+ {
396
+ "episode_seed": 172041,
397
+ "success": false,
398
+ "steps": 4,
399
+ "first_reveal_step": null,
400
+ "first_retrieve_step": null,
401
+ "episode_disturbance": 0.0
402
+ },
403
+ {
404
+ "episode_seed": 172042,
405
+ "success": false,
406
+ "steps": 4,
407
+ "first_reveal_step": null,
408
+ "first_retrieve_step": null,
409
+ "episode_disturbance": 0.0
410
+ },
411
+ {
412
+ "episode_seed": 172043,
413
+ "success": false,
414
+ "steps": 4,
415
+ "first_reveal_step": null,
416
+ "first_retrieve_step": null,
417
+ "episode_disturbance": 0.0
418
+ },
419
+ {
420
+ "episode_seed": 172044,
421
+ "success": false,
422
+ "steps": 4,
423
+ "first_reveal_step": null,
424
+ "first_retrieve_step": null,
425
+ "episode_disturbance": 0.0
426
+ },
427
+ {
428
+ "episode_seed": 172045,
429
+ "success": false,
430
+ "steps": 4,
431
+ "first_reveal_step": null,
432
+ "first_retrieve_step": null,
433
+ "episode_disturbance": 0.0
434
+ },
435
+ {
436
+ "episode_seed": 172046,
437
+ "success": false,
438
+ "steps": 4,
439
+ "first_reveal_step": null,
440
+ "first_retrieve_step": null,
441
+ "episode_disturbance": 0.0
442
+ },
443
+ {
444
+ "episode_seed": 172047,
445
+ "success": false,
446
+ "steps": 4,
447
+ "first_reveal_step": null,
448
+ "first_retrieve_step": null,
449
+ "episode_disturbance": 0.0
450
+ },
451
+ {
452
+ "episode_seed": 172048,
453
+ "success": false,
454
+ "steps": 4,
455
+ "first_reveal_step": null,
456
+ "first_retrieve_step": null,
457
+ "episode_disturbance": 0.0
458
+ },
459
+ {
460
+ "episode_seed": 172049,
461
+ "success": false,
462
+ "steps": 4,
463
+ "first_reveal_step": null,
464
+ "first_retrieve_step": null,
465
+ "episode_disturbance": 0.0
466
+ }
467
+ ],
468
+ "eval_protocol": {
469
+ "track_id": "bag_track",
470
+ "suite": "maniskill3",
471
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
472
+ "role": "target",
473
+ "eval_mode": "adapter_noop",
474
+ "seed": 17,
475
+ "episodes": 50,
476
+ "resolution": 224,
477
+ "cameras": [
478
+ "front",
479
+ "left",
480
+ "right"
481
+ ],
482
+ "observation_stack": "rgb_triplicate_zero_depth",
483
+ "action_horizon": 8,
484
+ "action_space": "widowx_delta_pose",
485
+ "same_test_episodes": true
486
+ },
487
+ "proxy_notes": "Public ManiSkill bridge scene with custom retrieval initialization. The eggplant is placed inside the basket region and must be pulled back out to a staging zone."
488
+ }
reports/maniskill_bag_bridge_smoke_v1/adapter_noop_seed23.json ADDED
@@ -0,0 +1,488 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "track_id": "bag_track",
3
+ "suite": "maniskill3",
4
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
5
+ "role": "target",
6
+ "adapter_mode": "adapter_noop",
7
+ "episodes": 50,
8
+ "successes": [
9
+ 0,
10
+ 0,
11
+ 0,
12
+ 0,
13
+ 0,
14
+ 0,
15
+ 0,
16
+ 0,
17
+ 0,
18
+ 0,
19
+ 0,
20
+ 0,
21
+ 0,
22
+ 0,
23
+ 0,
24
+ 1,
25
+ 0,
26
+ 1,
27
+ 0,
28
+ 0,
29
+ 0,
30
+ 0,
31
+ 0,
32
+ 1,
33
+ 0,
34
+ 1,
35
+ 0,
36
+ 0,
37
+ 0,
38
+ 0,
39
+ 0,
40
+ 0,
41
+ 0,
42
+ 0,
43
+ 0,
44
+ 0,
45
+ 0,
46
+ 0,
47
+ 0,
48
+ 0,
49
+ 0,
50
+ 0,
51
+ 0,
52
+ 0,
53
+ 0,
54
+ 0,
55
+ 0,
56
+ 0,
57
+ 0,
58
+ 0
59
+ ],
60
+ "success_rate": 0.08,
61
+ "intervention_rate": 0.0,
62
+ "non_base_selection_rate": 0.0,
63
+ "steps_to_first_reveal_or_access": 1.0,
64
+ "steps_to_retrieve": 4.0,
65
+ "disturbance_proxy": 0.0,
66
+ "episode_records": [
67
+ {
68
+ "episode_seed": 172000,
69
+ "success": false,
70
+ "steps": 4,
71
+ "first_reveal_step": 1,
72
+ "first_retrieve_step": null,
73
+ "episode_disturbance": 0.0
74
+ },
75
+ {
76
+ "episode_seed": 172001,
77
+ "success": false,
78
+ "steps": 4,
79
+ "first_reveal_step": 1,
80
+ "first_retrieve_step": null,
81
+ "episode_disturbance": 0.0
82
+ },
83
+ {
84
+ "episode_seed": 172002,
85
+ "success": false,
86
+ "steps": 4,
87
+ "first_reveal_step": 1,
88
+ "first_retrieve_step": null,
89
+ "episode_disturbance": 0.0
90
+ },
91
+ {
92
+ "episode_seed": 172003,
93
+ "success": false,
94
+ "steps": 4,
95
+ "first_reveal_step": 1,
96
+ "first_retrieve_step": null,
97
+ "episode_disturbance": 0.0
98
+ },
99
+ {
100
+ "episode_seed": 172004,
101
+ "success": false,
102
+ "steps": 4,
103
+ "first_reveal_step": 1,
104
+ "first_retrieve_step": null,
105
+ "episode_disturbance": 0.0
106
+ },
107
+ {
108
+ "episode_seed": 172005,
109
+ "success": false,
110
+ "steps": 4,
111
+ "first_reveal_step": 1,
112
+ "first_retrieve_step": null,
113
+ "episode_disturbance": 0.0
114
+ },
115
+ {
116
+ "episode_seed": 172006,
117
+ "success": false,
118
+ "steps": 4,
119
+ "first_reveal_step": 1,
120
+ "first_retrieve_step": null,
121
+ "episode_disturbance": 0.0
122
+ },
123
+ {
124
+ "episode_seed": 172007,
125
+ "success": false,
126
+ "steps": 4,
127
+ "first_reveal_step": 1,
128
+ "first_retrieve_step": null,
129
+ "episode_disturbance": 0.0
130
+ },
131
+ {
132
+ "episode_seed": 172008,
133
+ "success": false,
134
+ "steps": 4,
135
+ "first_reveal_step": 1,
136
+ "first_retrieve_step": null,
137
+ "episode_disturbance": 0.0
138
+ },
139
+ {
140
+ "episode_seed": 172009,
141
+ "success": false,
142
+ "steps": 4,
143
+ "first_reveal_step": 1,
144
+ "first_retrieve_step": null,
145
+ "episode_disturbance": 0.0
146
+ },
147
+ {
148
+ "episode_seed": 172010,
149
+ "success": false,
150
+ "steps": 4,
151
+ "first_reveal_step": 1,
152
+ "first_retrieve_step": null,
153
+ "episode_disturbance": 0.0
154
+ },
155
+ {
156
+ "episode_seed": 172011,
157
+ "success": false,
158
+ "steps": 4,
159
+ "first_reveal_step": 1,
160
+ "first_retrieve_step": null,
161
+ "episode_disturbance": 0.0
162
+ },
163
+ {
164
+ "episode_seed": 172012,
165
+ "success": false,
166
+ "steps": 4,
167
+ "first_reveal_step": 1,
168
+ "first_retrieve_step": null,
169
+ "episode_disturbance": 0.0
170
+ },
171
+ {
172
+ "episode_seed": 172013,
173
+ "success": false,
174
+ "steps": 4,
175
+ "first_reveal_step": 1,
176
+ "first_retrieve_step": null,
177
+ "episode_disturbance": 0.0
178
+ },
179
+ {
180
+ "episode_seed": 172014,
181
+ "success": false,
182
+ "steps": 4,
183
+ "first_reveal_step": 1,
184
+ "first_retrieve_step": null,
185
+ "episode_disturbance": 0.0
186
+ },
187
+ {
188
+ "episode_seed": 172015,
189
+ "success": true,
190
+ "steps": 1,
191
+ "first_reveal_step": 1,
192
+ "first_retrieve_step": null,
193
+ "episode_disturbance": 0.0
194
+ },
195
+ {
196
+ "episode_seed": 172016,
197
+ "success": false,
198
+ "steps": 4,
199
+ "first_reveal_step": 1,
200
+ "first_retrieve_step": null,
201
+ "episode_disturbance": 0.0
202
+ },
203
+ {
204
+ "episode_seed": 172017,
205
+ "success": true,
206
+ "steps": 1,
207
+ "first_reveal_step": 1,
208
+ "first_retrieve_step": null,
209
+ "episode_disturbance": 0.0
210
+ },
211
+ {
212
+ "episode_seed": 172018,
213
+ "success": false,
214
+ "steps": 4,
215
+ "first_reveal_step": 1,
216
+ "first_retrieve_step": null,
217
+ "episode_disturbance": 0.0
218
+ },
219
+ {
220
+ "episode_seed": 172019,
221
+ "success": false,
222
+ "steps": 4,
223
+ "first_reveal_step": 1,
224
+ "first_retrieve_step": null,
225
+ "episode_disturbance": 0.0
226
+ },
227
+ {
228
+ "episode_seed": 172020,
229
+ "success": false,
230
+ "steps": 4,
231
+ "first_reveal_step": 1,
232
+ "first_retrieve_step": null,
233
+ "episode_disturbance": 0.0
234
+ },
235
+ {
236
+ "episode_seed": 172021,
237
+ "success": false,
238
+ "steps": 4,
239
+ "first_reveal_step": 1,
240
+ "first_retrieve_step": null,
241
+ "episode_disturbance": 0.0
242
+ },
243
+ {
244
+ "episode_seed": 172022,
245
+ "success": false,
246
+ "steps": 4,
247
+ "first_reveal_step": 1,
248
+ "first_retrieve_step": null,
249
+ "episode_disturbance": 0.0
250
+ },
251
+ {
252
+ "episode_seed": 172023,
253
+ "success": true,
254
+ "steps": 1,
255
+ "first_reveal_step": 1,
256
+ "first_retrieve_step": null,
257
+ "episode_disturbance": 0.0
258
+ },
259
+ {
260
+ "episode_seed": 172024,
261
+ "success": false,
262
+ "steps": 4,
263
+ "first_reveal_step": 1,
264
+ "first_retrieve_step": null,
265
+ "episode_disturbance": 0.0
266
+ },
267
+ {
268
+ "episode_seed": 172025,
269
+ "success": true,
270
+ "steps": 1,
271
+ "first_reveal_step": 1,
272
+ "first_retrieve_step": null,
273
+ "episode_disturbance": 0.0
274
+ },
275
+ {
276
+ "episode_seed": 172026,
277
+ "success": false,
278
+ "steps": 4,
279
+ "first_reveal_step": 1,
280
+ "first_retrieve_step": null,
281
+ "episode_disturbance": 0.0
282
+ },
283
+ {
284
+ "episode_seed": 172027,
285
+ "success": false,
286
+ "steps": 4,
287
+ "first_reveal_step": 1,
288
+ "first_retrieve_step": null,
289
+ "episode_disturbance": 0.0
290
+ },
291
+ {
292
+ "episode_seed": 172028,
293
+ "success": false,
294
+ "steps": 4,
295
+ "first_reveal_step": 1,
296
+ "first_retrieve_step": null,
297
+ "episode_disturbance": 0.0
298
+ },
299
+ {
300
+ "episode_seed": 172029,
301
+ "success": false,
302
+ "steps": 4,
303
+ "first_reveal_step": 1,
304
+ "first_retrieve_step": null,
305
+ "episode_disturbance": 0.0
306
+ },
307
+ {
308
+ "episode_seed": 172030,
309
+ "success": false,
310
+ "steps": 4,
311
+ "first_reveal_step": 1,
312
+ "first_retrieve_step": null,
313
+ "episode_disturbance": 0.0
314
+ },
315
+ {
316
+ "episode_seed": 172031,
317
+ "success": false,
318
+ "steps": 4,
319
+ "first_reveal_step": 1,
320
+ "first_retrieve_step": null,
321
+ "episode_disturbance": 0.0
322
+ },
323
+ {
324
+ "episode_seed": 172032,
325
+ "success": false,
326
+ "steps": 4,
327
+ "first_reveal_step": 1,
328
+ "first_retrieve_step": null,
329
+ "episode_disturbance": 0.0
330
+ },
331
+ {
332
+ "episode_seed": 172033,
333
+ "success": false,
334
+ "steps": 4,
335
+ "first_reveal_step": 1,
336
+ "first_retrieve_step": null,
337
+ "episode_disturbance": 0.0
338
+ },
339
+ {
340
+ "episode_seed": 172034,
341
+ "success": false,
342
+ "steps": 4,
343
+ "first_reveal_step": 1,
344
+ "first_retrieve_step": null,
345
+ "episode_disturbance": 0.0
346
+ },
347
+ {
348
+ "episode_seed": 172035,
349
+ "success": false,
350
+ "steps": 4,
351
+ "first_reveal_step": 1,
352
+ "first_retrieve_step": null,
353
+ "episode_disturbance": 0.0
354
+ },
355
+ {
356
+ "episode_seed": 172036,
357
+ "success": false,
358
+ "steps": 4,
359
+ "first_reveal_step": 1,
360
+ "first_retrieve_step": null,
361
+ "episode_disturbance": 0.0
362
+ },
363
+ {
364
+ "episode_seed": 172037,
365
+ "success": false,
366
+ "steps": 4,
367
+ "first_reveal_step": 1,
368
+ "first_retrieve_step": null,
369
+ "episode_disturbance": 0.0
370
+ },
371
+ {
372
+ "episode_seed": 172038,
373
+ "success": false,
374
+ "steps": 4,
375
+ "first_reveal_step": 1,
376
+ "first_retrieve_step": null,
377
+ "episode_disturbance": 0.0
378
+ },
379
+ {
380
+ "episode_seed": 172039,
381
+ "success": false,
382
+ "steps": 4,
383
+ "first_reveal_step": 1,
384
+ "first_retrieve_step": null,
385
+ "episode_disturbance": 0.0
386
+ },
387
+ {
388
+ "episode_seed": 172040,
389
+ "success": false,
390
+ "steps": 4,
391
+ "first_reveal_step": 1,
392
+ "first_retrieve_step": null,
393
+ "episode_disturbance": 0.0
394
+ },
395
+ {
396
+ "episode_seed": 172041,
397
+ "success": false,
398
+ "steps": 4,
399
+ "first_reveal_step": 1,
400
+ "first_retrieve_step": null,
401
+ "episode_disturbance": 0.0
402
+ },
403
+ {
404
+ "episode_seed": 172042,
405
+ "success": false,
406
+ "steps": 4,
407
+ "first_reveal_step": 1,
408
+ "first_retrieve_step": null,
409
+ "episode_disturbance": 0.0
410
+ },
411
+ {
412
+ "episode_seed": 172043,
413
+ "success": false,
414
+ "steps": 4,
415
+ "first_reveal_step": 1,
416
+ "first_retrieve_step": null,
417
+ "episode_disturbance": 0.0
418
+ },
419
+ {
420
+ "episode_seed": 172044,
421
+ "success": false,
422
+ "steps": 4,
423
+ "first_reveal_step": 1,
424
+ "first_retrieve_step": null,
425
+ "episode_disturbance": 0.0
426
+ },
427
+ {
428
+ "episode_seed": 172045,
429
+ "success": false,
430
+ "steps": 4,
431
+ "first_reveal_step": 1,
432
+ "first_retrieve_step": null,
433
+ "episode_disturbance": 0.0
434
+ },
435
+ {
436
+ "episode_seed": 172046,
437
+ "success": false,
438
+ "steps": 4,
439
+ "first_reveal_step": 1,
440
+ "first_retrieve_step": null,
441
+ "episode_disturbance": 0.0
442
+ },
443
+ {
444
+ "episode_seed": 172047,
445
+ "success": false,
446
+ "steps": 4,
447
+ "first_reveal_step": 1,
448
+ "first_retrieve_step": null,
449
+ "episode_disturbance": 0.0
450
+ },
451
+ {
452
+ "episode_seed": 172048,
453
+ "success": false,
454
+ "steps": 4,
455
+ "first_reveal_step": 1,
456
+ "first_retrieve_step": null,
457
+ "episode_disturbance": 0.0
458
+ },
459
+ {
460
+ "episode_seed": 172049,
461
+ "success": false,
462
+ "steps": 4,
463
+ "first_reveal_step": 1,
464
+ "first_retrieve_step": null,
465
+ "episode_disturbance": 0.0
466
+ }
467
+ ],
468
+ "eval_protocol": {
469
+ "track_id": "bag_track",
470
+ "suite": "maniskill3",
471
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
472
+ "role": "target",
473
+ "eval_mode": "adapter_noop",
474
+ "seed": 17,
475
+ "episodes": 50,
476
+ "resolution": 224,
477
+ "cameras": [
478
+ "front",
479
+ "left",
480
+ "right"
481
+ ],
482
+ "observation_stack": "rgb_triplicate_zero_depth",
483
+ "action_horizon": 8,
484
+ "action_space": "widowx_delta_pose",
485
+ "same_test_episodes": true
486
+ },
487
+ "proxy_notes": "Public ManiSkill bridge scene with custom retrieval initialization. The eggplant is placed inside the basket region and must be pulled back out to a staging zone."
488
+ }
reports/maniskill_bag_bridge_smoke_v1/public_benchmark_package_summary.json ADDED
@@ -0,0 +1,60 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "anchor_pass": true,
3
+ "available_tracks": [
4
+ "bag_track"
5
+ ],
6
+ "ci_above_zero_tracks": [],
7
+ "headline_pass": false,
8
+ "package_name": "public_reveal_retrieve_package_v1",
9
+ "sign_of_life_pass": false,
10
+ "sign_of_life_track_count": 0,
11
+ "sign_of_life_tracks": [],
12
+ "target_macro_average_delta": -0.47999998927116394,
13
+ "tracks": {
14
+ "bag_track": {
15
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
16
+ "delta_active_vs_trunk": -0.47999998927116394,
17
+ "delta_active_vs_trunk_ci95": [
18
+ -0.6200000047683716,
19
+ -0.3400000035762787
20
+ ],
21
+ "delta_noop_vs_trunk": -0.3999999910593033,
22
+ "modes": {
23
+ "adapter_active_ft": {
24
+ "disturbance_proxy": 0.0,
25
+ "intervention_rate": 1.0,
26
+ "mean_success": 0.0,
27
+ "non_base_selection_rate": 1.0,
28
+ "num_runs": 1,
29
+ "steps_to_first_reveal_or_access": 1.0,
30
+ "steps_to_retrieve": 4.0
31
+ },
32
+ "adapter_noop": {
33
+ "disturbance_proxy": 0.0,
34
+ "intervention_rate": 0.0,
35
+ "mean_success": 0.07999999821186066,
36
+ "non_base_selection_rate": 0.0,
37
+ "num_runs": 1,
38
+ "steps_to_first_reveal_or_access": 1.0,
39
+ "steps_to_retrieve": 4.0
40
+ },
41
+ "trunk_only_ft": {
42
+ "disturbance_proxy": 0.0,
43
+ "intervention_rate": 0.0,
44
+ "mean_success": 0.47999998927116394,
45
+ "non_base_selection_rate": 0.0,
46
+ "num_runs": 1,
47
+ "steps_to_first_reveal_or_access": 1.0,
48
+ "steps_to_retrieve": 4.0
49
+ }
50
+ },
51
+ "notes": "Public ManiSkill bridge scene with custom retrieval initialization. The eggplant is placed inside the basket region and must be pulled back out to a staging zone.",
52
+ "public_source": "ManiSkill public scene proxy: PutEggplantInBasketScene-v1",
53
+ "role": "target",
54
+ "signs_of_life": false,
55
+ "suite": "maniskill3",
56
+ "target_behavior": "retrieve the target object from inside the basket and stage it outside the basket",
57
+ "task_family": "bag_retrieval_proxy"
58
+ }
59
+ }
60
+ }
reports/maniskill_bag_bridge_smoke_v1/public_benchmark_package_summary.md ADDED
@@ -0,0 +1,15 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # ManiSkill Bag Retrieval Smoke Summary
2
+
3
+ - benchmark_task: PutEggplantInBasketRetrievalProxy-v1
4
+ - target_macro_average_delta: -0.480
5
+ - headline_pass: False
6
+ - sign_of_life_pass: False
7
+
8
+ ## bag_track
9
+ - delta_active_vs_trunk: -0.480
10
+ - delta_noop_vs_trunk: -0.400
11
+ - signs_of_life: False
12
+ - delta_active_vs_trunk_ci95: [-0.620, -0.340]
13
+ - trunk_only_ft: mean_success=0.480
14
+ - adapter_noop: mean_success=0.080
15
+ - adapter_active_ft: mean_success=0.000
reports/maniskill_bag_bridge_smoke_v1/trunk_only_ft_seed17.json ADDED
@@ -0,0 +1,509 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "track_id": "bag_track",
3
+ "suite": "maniskill3",
4
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
5
+ "role": "target",
6
+ "adapter_mode": "trunk_only_ft",
7
+ "episodes": 50,
8
+ "successes": [
9
+ 0,
10
+ 0,
11
+ 0,
12
+ 1,
13
+ 0,
14
+ 0,
15
+ 0,
16
+ 0,
17
+ 0,
18
+ 1,
19
+ 1,
20
+ 0,
21
+ 0,
22
+ 1,
23
+ 0,
24
+ 0,
25
+ 0,
26
+ 0,
27
+ 0,
28
+ 0,
29
+ 0,
30
+ 0,
31
+ 0,
32
+ 0,
33
+ 1,
34
+ 1,
35
+ 0,
36
+ 0,
37
+ 0,
38
+ 0,
39
+ 0,
40
+ 1,
41
+ 1,
42
+ 1,
43
+ 0,
44
+ 0,
45
+ 0,
46
+ 1,
47
+ 0,
48
+ 0,
49
+ 0,
50
+ 1,
51
+ 0,
52
+ 0,
53
+ 1,
54
+ 1,
55
+ 1,
56
+ 1,
57
+ 1,
58
+ 0
59
+ ],
60
+ "success_rate": 0.32,
61
+ "intervention_rate": 0.0,
62
+ "non_base_selection_rate": 0.0,
63
+ "steps_to_first_reveal_or_access": 1.0,
64
+ "steps_to_retrieve": 4.0,
65
+ "disturbance_proxy": 0.0,
66
+ "episode_records": [
67
+ {
68
+ "episode_seed": 172000,
69
+ "success": false,
70
+ "steps": 4,
71
+ "first_reveal_step": 1,
72
+ "first_retrieve_step": null,
73
+ "episode_disturbance": 0.0
74
+ },
75
+ {
76
+ "episode_seed": 172001,
77
+ "success": false,
78
+ "steps": 4,
79
+ "first_reveal_step": 1,
80
+ "first_retrieve_step": null,
81
+ "episode_disturbance": 0.0
82
+ },
83
+ {
84
+ "episode_seed": 172002,
85
+ "success": false,
86
+ "steps": 4,
87
+ "first_reveal_step": 1,
88
+ "first_retrieve_step": null,
89
+ "episode_disturbance": 0.0
90
+ },
91
+ {
92
+ "episode_seed": 172003,
93
+ "success": true,
94
+ "steps": 1,
95
+ "first_reveal_step": 1,
96
+ "first_retrieve_step": null,
97
+ "episode_disturbance": 0.0
98
+ },
99
+ {
100
+ "episode_seed": 172004,
101
+ "success": false,
102
+ "steps": 4,
103
+ "first_reveal_step": 1,
104
+ "first_retrieve_step": null,
105
+ "episode_disturbance": 0.0
106
+ },
107
+ {
108
+ "episode_seed": 172005,
109
+ "success": false,
110
+ "steps": 4,
111
+ "first_reveal_step": 1,
112
+ "first_retrieve_step": null,
113
+ "episode_disturbance": 0.0
114
+ },
115
+ {
116
+ "episode_seed": 172006,
117
+ "success": false,
118
+ "steps": 4,
119
+ "first_reveal_step": 1,
120
+ "first_retrieve_step": null,
121
+ "episode_disturbance": 0.0
122
+ },
123
+ {
124
+ "episode_seed": 172007,
125
+ "success": false,
126
+ "steps": 4,
127
+ "first_reveal_step": 1,
128
+ "first_retrieve_step": null,
129
+ "episode_disturbance": 0.0
130
+ },
131
+ {
132
+ "episode_seed": 172008,
133
+ "success": false,
134
+ "steps": 4,
135
+ "first_reveal_step": 1,
136
+ "first_retrieve_step": null,
137
+ "episode_disturbance": 0.0
138
+ },
139
+ {
140
+ "episode_seed": 172009,
141
+ "success": true,
142
+ "steps": 1,
143
+ "first_reveal_step": 1,
144
+ "first_retrieve_step": null,
145
+ "episode_disturbance": 0.0
146
+ },
147
+ {
148
+ "episode_seed": 172010,
149
+ "success": true,
150
+ "steps": 1,
151
+ "first_reveal_step": 1,
152
+ "first_retrieve_step": null,
153
+ "episode_disturbance": 0.0
154
+ },
155
+ {
156
+ "episode_seed": 172011,
157
+ "success": false,
158
+ "steps": 4,
159
+ "first_reveal_step": 1,
160
+ "first_retrieve_step": null,
161
+ "episode_disturbance": 0.0
162
+ },
163
+ {
164
+ "episode_seed": 172012,
165
+ "success": false,
166
+ "steps": 4,
167
+ "first_reveal_step": 1,
168
+ "first_retrieve_step": null,
169
+ "episode_disturbance": 0.0
170
+ },
171
+ {
172
+ "episode_seed": 172013,
173
+ "success": true,
174
+ "steps": 1,
175
+ "first_reveal_step": 1,
176
+ "first_retrieve_step": null,
177
+ "episode_disturbance": 0.0
178
+ },
179
+ {
180
+ "episode_seed": 172014,
181
+ "success": false,
182
+ "steps": 4,
183
+ "first_reveal_step": 1,
184
+ "first_retrieve_step": null,
185
+ "episode_disturbance": 0.0
186
+ },
187
+ {
188
+ "episode_seed": 172015,
189
+ "success": false,
190
+ "steps": 4,
191
+ "first_reveal_step": 1,
192
+ "first_retrieve_step": null,
193
+ "episode_disturbance": 0.0
194
+ },
195
+ {
196
+ "episode_seed": 172016,
197
+ "success": false,
198
+ "steps": 4,
199
+ "first_reveal_step": 1,
200
+ "first_retrieve_step": null,
201
+ "episode_disturbance": 0.0
202
+ },
203
+ {
204
+ "episode_seed": 172017,
205
+ "success": false,
206
+ "steps": 4,
207
+ "first_reveal_step": 1,
208
+ "first_retrieve_step": null,
209
+ "episode_disturbance": 0.0
210
+ },
211
+ {
212
+ "episode_seed": 172018,
213
+ "success": false,
214
+ "steps": 4,
215
+ "first_reveal_step": 1,
216
+ "first_retrieve_step": null,
217
+ "episode_disturbance": 0.0
218
+ },
219
+ {
220
+ "episode_seed": 172019,
221
+ "success": false,
222
+ "steps": 4,
223
+ "first_reveal_step": 1,
224
+ "first_retrieve_step": null,
225
+ "episode_disturbance": 0.0
226
+ },
227
+ {
228
+ "episode_seed": 172020,
229
+ "success": false,
230
+ "steps": 4,
231
+ "first_reveal_step": 1,
232
+ "first_retrieve_step": null,
233
+ "episode_disturbance": 0.0
234
+ },
235
+ {
236
+ "episode_seed": 172021,
237
+ "success": false,
238
+ "steps": 4,
239
+ "first_reveal_step": 1,
240
+ "first_retrieve_step": null,
241
+ "episode_disturbance": 0.0
242
+ },
243
+ {
244
+ "episode_seed": 172022,
245
+ "success": false,
246
+ "steps": 4,
247
+ "first_reveal_step": 1,
248
+ "first_retrieve_step": null,
249
+ "episode_disturbance": 0.0
250
+ },
251
+ {
252
+ "episode_seed": 172023,
253
+ "success": false,
254
+ "steps": 4,
255
+ "first_reveal_step": 1,
256
+ "first_retrieve_step": null,
257
+ "episode_disturbance": 0.0
258
+ },
259
+ {
260
+ "episode_seed": 172024,
261
+ "success": true,
262
+ "steps": 1,
263
+ "first_reveal_step": 1,
264
+ "first_retrieve_step": null,
265
+ "episode_disturbance": 0.0
266
+ },
267
+ {
268
+ "episode_seed": 172025,
269
+ "success": true,
270
+ "steps": 1,
271
+ "first_reveal_step": 1,
272
+ "first_retrieve_step": null,
273
+ "episode_disturbance": 0.0
274
+ },
275
+ {
276
+ "episode_seed": 172026,
277
+ "success": false,
278
+ "steps": 4,
279
+ "first_reveal_step": 1,
280
+ "first_retrieve_step": null,
281
+ "episode_disturbance": 0.0
282
+ },
283
+ {
284
+ "episode_seed": 172027,
285
+ "success": false,
286
+ "steps": 4,
287
+ "first_reveal_step": 1,
288
+ "first_retrieve_step": null,
289
+ "episode_disturbance": 0.0
290
+ },
291
+ {
292
+ "episode_seed": 172028,
293
+ "success": false,
294
+ "steps": 4,
295
+ "first_reveal_step": 1,
296
+ "first_retrieve_step": null,
297
+ "episode_disturbance": 0.0
298
+ },
299
+ {
300
+ "episode_seed": 172029,
301
+ "success": false,
302
+ "steps": 4,
303
+ "first_reveal_step": 1,
304
+ "first_retrieve_step": null,
305
+ "episode_disturbance": 0.0
306
+ },
307
+ {
308
+ "episode_seed": 172030,
309
+ "success": false,
310
+ "steps": 4,
311
+ "first_reveal_step": 1,
312
+ "first_retrieve_step": null,
313
+ "episode_disturbance": 0.0
314
+ },
315
+ {
316
+ "episode_seed": 172031,
317
+ "success": true,
318
+ "steps": 1,
319
+ "first_reveal_step": 1,
320
+ "first_retrieve_step": null,
321
+ "episode_disturbance": 0.0
322
+ },
323
+ {
324
+ "episode_seed": 172032,
325
+ "success": true,
326
+ "steps": 2,
327
+ "first_reveal_step": 1,
328
+ "first_retrieve_step": null,
329
+ "episode_disturbance": 0.0
330
+ },
331
+ {
332
+ "episode_seed": 172033,
333
+ "success": true,
334
+ "steps": 1,
335
+ "first_reveal_step": 1,
336
+ "first_retrieve_step": null,
337
+ "episode_disturbance": 0.0
338
+ },
339
+ {
340
+ "episode_seed": 172034,
341
+ "success": false,
342
+ "steps": 4,
343
+ "first_reveal_step": 1,
344
+ "first_retrieve_step": null,
345
+ "episode_disturbance": 0.0
346
+ },
347
+ {
348
+ "episode_seed": 172035,
349
+ "success": false,
350
+ "steps": 4,
351
+ "first_reveal_step": 1,
352
+ "first_retrieve_step": null,
353
+ "episode_disturbance": 0.0
354
+ },
355
+ {
356
+ "episode_seed": 172036,
357
+ "success": false,
358
+ "steps": 4,
359
+ "first_reveal_step": 1,
360
+ "first_retrieve_step": null,
361
+ "episode_disturbance": 0.0
362
+ },
363
+ {
364
+ "episode_seed": 172037,
365
+ "success": true,
366
+ "steps": 1,
367
+ "first_reveal_step": 1,
368
+ "first_retrieve_step": null,
369
+ "episode_disturbance": 0.0
370
+ },
371
+ {
372
+ "episode_seed": 172038,
373
+ "success": false,
374
+ "steps": 4,
375
+ "first_reveal_step": 1,
376
+ "first_retrieve_step": null,
377
+ "episode_disturbance": 0.0
378
+ },
379
+ {
380
+ "episode_seed": 172039,
381
+ "success": false,
382
+ "steps": 4,
383
+ "first_reveal_step": 1,
384
+ "first_retrieve_step": null,
385
+ "episode_disturbance": 0.0
386
+ },
387
+ {
388
+ "episode_seed": 172040,
389
+ "success": false,
390
+ "steps": 4,
391
+ "first_reveal_step": 1,
392
+ "first_retrieve_step": null,
393
+ "episode_disturbance": 0.0
394
+ },
395
+ {
396
+ "episode_seed": 172041,
397
+ "success": true,
398
+ "steps": 1,
399
+ "first_reveal_step": 1,
400
+ "first_retrieve_step": null,
401
+ "episode_disturbance": 0.0
402
+ },
403
+ {
404
+ "episode_seed": 172042,
405
+ "success": false,
406
+ "steps": 4,
407
+ "first_reveal_step": 1,
408
+ "first_retrieve_step": null,
409
+ "episode_disturbance": 0.0
410
+ },
411
+ {
412
+ "episode_seed": 172043,
413
+ "success": false,
414
+ "steps": 4,
415
+ "first_reveal_step": 1,
416
+ "first_retrieve_step": null,
417
+ "episode_disturbance": 0.0
418
+ },
419
+ {
420
+ "episode_seed": 172044,
421
+ "success": true,
422
+ "steps": 1,
423
+ "first_reveal_step": 1,
424
+ "first_retrieve_step": null,
425
+ "episode_disturbance": 0.0
426
+ },
427
+ {
428
+ "episode_seed": 172045,
429
+ "success": true,
430
+ "steps": 1,
431
+ "first_reveal_step": 1,
432
+ "first_retrieve_step": null,
433
+ "episode_disturbance": 0.0
434
+ },
435
+ {
436
+ "episode_seed": 172046,
437
+ "success": true,
438
+ "steps": 1,
439
+ "first_reveal_step": 1,
440
+ "first_retrieve_step": null,
441
+ "episode_disturbance": 0.0
442
+ },
443
+ {
444
+ "episode_seed": 172047,
445
+ "success": true,
446
+ "steps": 1,
447
+ "first_reveal_step": 1,
448
+ "first_retrieve_step": null,
449
+ "episode_disturbance": 0.0
450
+ },
451
+ {
452
+ "episode_seed": 172048,
453
+ "success": true,
454
+ "steps": 1,
455
+ "first_reveal_step": 1,
456
+ "first_retrieve_step": null,
457
+ "episode_disturbance": 0.0
458
+ },
459
+ {
460
+ "episode_seed": 172049,
461
+ "success": false,
462
+ "steps": 4,
463
+ "first_reveal_step": 1,
464
+ "first_retrieve_step": null,
465
+ "episode_disturbance": 0.0
466
+ }
467
+ ],
468
+ "eval_protocol": {
469
+ "track_id": "bag_track",
470
+ "suite": "maniskill3",
471
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
472
+ "role": "target",
473
+ "eval_mode": "trunk_only_ft",
474
+ "seed": 17,
475
+ "episodes": 50,
476
+ "resolution": 224,
477
+ "cameras": [
478
+ "front",
479
+ "left",
480
+ "right"
481
+ ],
482
+ "observation_stack": "rgb_triplicate_zero_depth",
483
+ "action_horizon": 8,
484
+ "action_space": "widowx_delta_pose",
485
+ "same_test_episodes": true
486
+ },
487
+ "proxy_notes": "Public ManiSkill bridge scene with custom retrieval initialization. The eggplant is placed inside the basket region and must be pulled back out to a staging zone.",
488
+ "train_spec": {
489
+ "track_id": "bag_track",
490
+ "suite": "maniskill3",
491
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
492
+ "model_variant": "trunk_only_ft",
493
+ "seed": 17,
494
+ "train_demos": 32,
495
+ "val_demos": 8,
496
+ "init_checkpoint_group": "/workspace/workspace/VLAarchtests2/VLAarchtests/artifacts/outputs/r3d_handoff_phase/proxy_interaction_r3d_stage3_clip_rgbd_handoff_compact_phase_seed17/checkpoint_best.pt",
497
+ "optimizer": "adamw",
498
+ "learning_rate": 0.0001,
499
+ "lr_schedule": "constant",
500
+ "batch_size": 4,
501
+ "augmentations": "none",
502
+ "early_stopping_metric": "val_total",
503
+ "max_gradient_steps": 114,
504
+ "unfreeze_scope": "fusion_memory_decoder",
505
+ "dataset_split_id": "bag_bridge_smoke_v1_seed17",
506
+ "same_data_policy": true,
507
+ "same_init_policy": true
508
+ }
509
+ }
reports/maniskill_bag_bridge_smoke_v1/trunk_only_ft_seed23.json ADDED
@@ -0,0 +1,509 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "track_id": "bag_track",
3
+ "suite": "maniskill3",
4
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
5
+ "role": "target",
6
+ "adapter_mode": "trunk_only_ft",
7
+ "episodes": 50,
8
+ "successes": [
9
+ 1,
10
+ 1,
11
+ 1,
12
+ 1,
13
+ 1,
14
+ 1,
15
+ 0,
16
+ 0,
17
+ 0,
18
+ 1,
19
+ 0,
20
+ 1,
21
+ 1,
22
+ 1,
23
+ 1,
24
+ 1,
25
+ 1,
26
+ 0,
27
+ 0,
28
+ 1,
29
+ 0,
30
+ 0,
31
+ 1,
32
+ 0,
33
+ 1,
34
+ 0,
35
+ 1,
36
+ 0,
37
+ 1,
38
+ 0,
39
+ 1,
40
+ 0,
41
+ 0,
42
+ 0,
43
+ 0,
44
+ 0,
45
+ 1,
46
+ 1,
47
+ 0,
48
+ 0,
49
+ 1,
50
+ 0,
51
+ 0,
52
+ 0,
53
+ 1,
54
+ 0,
55
+ 0,
56
+ 1,
57
+ 0,
58
+ 0
59
+ ],
60
+ "success_rate": 0.48,
61
+ "intervention_rate": 0.0,
62
+ "non_base_selection_rate": 0.0,
63
+ "steps_to_first_reveal_or_access": 1.0,
64
+ "steps_to_retrieve": 4.0,
65
+ "disturbance_proxy": 0.0,
66
+ "episode_records": [
67
+ {
68
+ "episode_seed": 172000,
69
+ "success": true,
70
+ "steps": 2,
71
+ "first_reveal_step": 1,
72
+ "first_retrieve_step": null,
73
+ "episode_disturbance": 0.0
74
+ },
75
+ {
76
+ "episode_seed": 172001,
77
+ "success": true,
78
+ "steps": 1,
79
+ "first_reveal_step": 1,
80
+ "first_retrieve_step": null,
81
+ "episode_disturbance": 0.0
82
+ },
83
+ {
84
+ "episode_seed": 172002,
85
+ "success": true,
86
+ "steps": 1,
87
+ "first_reveal_step": 1,
88
+ "first_retrieve_step": null,
89
+ "episode_disturbance": 0.0
90
+ },
91
+ {
92
+ "episode_seed": 172003,
93
+ "success": true,
94
+ "steps": 1,
95
+ "first_reveal_step": 1,
96
+ "first_retrieve_step": null,
97
+ "episode_disturbance": 0.0
98
+ },
99
+ {
100
+ "episode_seed": 172004,
101
+ "success": true,
102
+ "steps": 1,
103
+ "first_reveal_step": 1,
104
+ "first_retrieve_step": null,
105
+ "episode_disturbance": 0.0
106
+ },
107
+ {
108
+ "episode_seed": 172005,
109
+ "success": true,
110
+ "steps": 1,
111
+ "first_reveal_step": 1,
112
+ "first_retrieve_step": null,
113
+ "episode_disturbance": 0.0
114
+ },
115
+ {
116
+ "episode_seed": 172006,
117
+ "success": false,
118
+ "steps": 4,
119
+ "first_reveal_step": 1,
120
+ "first_retrieve_step": null,
121
+ "episode_disturbance": 0.0
122
+ },
123
+ {
124
+ "episode_seed": 172007,
125
+ "success": false,
126
+ "steps": 4,
127
+ "first_reveal_step": 1,
128
+ "first_retrieve_step": null,
129
+ "episode_disturbance": 0.0
130
+ },
131
+ {
132
+ "episode_seed": 172008,
133
+ "success": false,
134
+ "steps": 4,
135
+ "first_reveal_step": 1,
136
+ "first_retrieve_step": null,
137
+ "episode_disturbance": 0.0
138
+ },
139
+ {
140
+ "episode_seed": 172009,
141
+ "success": true,
142
+ "steps": 1,
143
+ "first_reveal_step": 1,
144
+ "first_retrieve_step": null,
145
+ "episode_disturbance": 0.0
146
+ },
147
+ {
148
+ "episode_seed": 172010,
149
+ "success": false,
150
+ "steps": 4,
151
+ "first_reveal_step": 1,
152
+ "first_retrieve_step": null,
153
+ "episode_disturbance": 0.0
154
+ },
155
+ {
156
+ "episode_seed": 172011,
157
+ "success": true,
158
+ "steps": 1,
159
+ "first_reveal_step": 1,
160
+ "first_retrieve_step": null,
161
+ "episode_disturbance": 0.0
162
+ },
163
+ {
164
+ "episode_seed": 172012,
165
+ "success": true,
166
+ "steps": 3,
167
+ "first_reveal_step": 1,
168
+ "first_retrieve_step": null,
169
+ "episode_disturbance": 0.0
170
+ },
171
+ {
172
+ "episode_seed": 172013,
173
+ "success": true,
174
+ "steps": 2,
175
+ "first_reveal_step": 1,
176
+ "first_retrieve_step": null,
177
+ "episode_disturbance": 0.0
178
+ },
179
+ {
180
+ "episode_seed": 172014,
181
+ "success": true,
182
+ "steps": 2,
183
+ "first_reveal_step": 1,
184
+ "first_retrieve_step": null,
185
+ "episode_disturbance": 0.0
186
+ },
187
+ {
188
+ "episode_seed": 172015,
189
+ "success": true,
190
+ "steps": 1,
191
+ "first_reveal_step": 1,
192
+ "first_retrieve_step": null,
193
+ "episode_disturbance": 0.0
194
+ },
195
+ {
196
+ "episode_seed": 172016,
197
+ "success": true,
198
+ "steps": 1,
199
+ "first_reveal_step": 1,
200
+ "first_retrieve_step": null,
201
+ "episode_disturbance": 0.0
202
+ },
203
+ {
204
+ "episode_seed": 172017,
205
+ "success": false,
206
+ "steps": 4,
207
+ "first_reveal_step": 1,
208
+ "first_retrieve_step": null,
209
+ "episode_disturbance": 0.0
210
+ },
211
+ {
212
+ "episode_seed": 172018,
213
+ "success": false,
214
+ "steps": 4,
215
+ "first_reveal_step": 1,
216
+ "first_retrieve_step": null,
217
+ "episode_disturbance": 0.0
218
+ },
219
+ {
220
+ "episode_seed": 172019,
221
+ "success": true,
222
+ "steps": 2,
223
+ "first_reveal_step": 1,
224
+ "first_retrieve_step": null,
225
+ "episode_disturbance": 0.0
226
+ },
227
+ {
228
+ "episode_seed": 172020,
229
+ "success": false,
230
+ "steps": 4,
231
+ "first_reveal_step": 1,
232
+ "first_retrieve_step": null,
233
+ "episode_disturbance": 0.0
234
+ },
235
+ {
236
+ "episode_seed": 172021,
237
+ "success": false,
238
+ "steps": 4,
239
+ "first_reveal_step": 1,
240
+ "first_retrieve_step": null,
241
+ "episode_disturbance": 0.0
242
+ },
243
+ {
244
+ "episode_seed": 172022,
245
+ "success": true,
246
+ "steps": 1,
247
+ "first_reveal_step": 1,
248
+ "first_retrieve_step": null,
249
+ "episode_disturbance": 0.0
250
+ },
251
+ {
252
+ "episode_seed": 172023,
253
+ "success": false,
254
+ "steps": 4,
255
+ "first_reveal_step": 1,
256
+ "first_retrieve_step": null,
257
+ "episode_disturbance": 0.0
258
+ },
259
+ {
260
+ "episode_seed": 172024,
261
+ "success": true,
262
+ "steps": 1,
263
+ "first_reveal_step": 1,
264
+ "first_retrieve_step": null,
265
+ "episode_disturbance": 0.0
266
+ },
267
+ {
268
+ "episode_seed": 172025,
269
+ "success": false,
270
+ "steps": 4,
271
+ "first_reveal_step": 1,
272
+ "first_retrieve_step": null,
273
+ "episode_disturbance": 0.0
274
+ },
275
+ {
276
+ "episode_seed": 172026,
277
+ "success": true,
278
+ "steps": 1,
279
+ "first_reveal_step": 1,
280
+ "first_retrieve_step": null,
281
+ "episode_disturbance": 0.0
282
+ },
283
+ {
284
+ "episode_seed": 172027,
285
+ "success": false,
286
+ "steps": 4,
287
+ "first_reveal_step": 1,
288
+ "first_retrieve_step": null,
289
+ "episode_disturbance": 0.0
290
+ },
291
+ {
292
+ "episode_seed": 172028,
293
+ "success": true,
294
+ "steps": 1,
295
+ "first_reveal_step": 1,
296
+ "first_retrieve_step": null,
297
+ "episode_disturbance": 0.0
298
+ },
299
+ {
300
+ "episode_seed": 172029,
301
+ "success": false,
302
+ "steps": 4,
303
+ "first_reveal_step": 1,
304
+ "first_retrieve_step": null,
305
+ "episode_disturbance": 0.0
306
+ },
307
+ {
308
+ "episode_seed": 172030,
309
+ "success": true,
310
+ "steps": 1,
311
+ "first_reveal_step": 1,
312
+ "first_retrieve_step": null,
313
+ "episode_disturbance": 0.0
314
+ },
315
+ {
316
+ "episode_seed": 172031,
317
+ "success": false,
318
+ "steps": 4,
319
+ "first_reveal_step": 1,
320
+ "first_retrieve_step": null,
321
+ "episode_disturbance": 0.0
322
+ },
323
+ {
324
+ "episode_seed": 172032,
325
+ "success": false,
326
+ "steps": 4,
327
+ "first_reveal_step": 1,
328
+ "first_retrieve_step": null,
329
+ "episode_disturbance": 0.0
330
+ },
331
+ {
332
+ "episode_seed": 172033,
333
+ "success": false,
334
+ "steps": 4,
335
+ "first_reveal_step": 1,
336
+ "first_retrieve_step": null,
337
+ "episode_disturbance": 0.0
338
+ },
339
+ {
340
+ "episode_seed": 172034,
341
+ "success": false,
342
+ "steps": 4,
343
+ "first_reveal_step": 1,
344
+ "first_retrieve_step": null,
345
+ "episode_disturbance": 0.0
346
+ },
347
+ {
348
+ "episode_seed": 172035,
349
+ "success": false,
350
+ "steps": 4,
351
+ "first_reveal_step": 1,
352
+ "first_retrieve_step": null,
353
+ "episode_disturbance": 0.0
354
+ },
355
+ {
356
+ "episode_seed": 172036,
357
+ "success": true,
358
+ "steps": 2,
359
+ "first_reveal_step": 1,
360
+ "first_retrieve_step": null,
361
+ "episode_disturbance": 0.0
362
+ },
363
+ {
364
+ "episode_seed": 172037,
365
+ "success": true,
366
+ "steps": 1,
367
+ "first_reveal_step": 1,
368
+ "first_retrieve_step": null,
369
+ "episode_disturbance": 0.0
370
+ },
371
+ {
372
+ "episode_seed": 172038,
373
+ "success": false,
374
+ "steps": 4,
375
+ "first_reveal_step": 1,
376
+ "first_retrieve_step": null,
377
+ "episode_disturbance": 0.0
378
+ },
379
+ {
380
+ "episode_seed": 172039,
381
+ "success": false,
382
+ "steps": 4,
383
+ "first_reveal_step": 1,
384
+ "first_retrieve_step": null,
385
+ "episode_disturbance": 0.0
386
+ },
387
+ {
388
+ "episode_seed": 172040,
389
+ "success": true,
390
+ "steps": 1,
391
+ "first_reveal_step": 1,
392
+ "first_retrieve_step": null,
393
+ "episode_disturbance": 0.0
394
+ },
395
+ {
396
+ "episode_seed": 172041,
397
+ "success": false,
398
+ "steps": 4,
399
+ "first_reveal_step": 1,
400
+ "first_retrieve_step": null,
401
+ "episode_disturbance": 0.0
402
+ },
403
+ {
404
+ "episode_seed": 172042,
405
+ "success": false,
406
+ "steps": 4,
407
+ "first_reveal_step": 1,
408
+ "first_retrieve_step": null,
409
+ "episode_disturbance": 0.0
410
+ },
411
+ {
412
+ "episode_seed": 172043,
413
+ "success": false,
414
+ "steps": 4,
415
+ "first_reveal_step": 1,
416
+ "first_retrieve_step": null,
417
+ "episode_disturbance": 0.0
418
+ },
419
+ {
420
+ "episode_seed": 172044,
421
+ "success": true,
422
+ "steps": 1,
423
+ "first_reveal_step": 1,
424
+ "first_retrieve_step": null,
425
+ "episode_disturbance": 0.0
426
+ },
427
+ {
428
+ "episode_seed": 172045,
429
+ "success": false,
430
+ "steps": 4,
431
+ "first_reveal_step": 1,
432
+ "first_retrieve_step": null,
433
+ "episode_disturbance": 0.0
434
+ },
435
+ {
436
+ "episode_seed": 172046,
437
+ "success": false,
438
+ "steps": 4,
439
+ "first_reveal_step": 1,
440
+ "first_retrieve_step": null,
441
+ "episode_disturbance": 0.0
442
+ },
443
+ {
444
+ "episode_seed": 172047,
445
+ "success": true,
446
+ "steps": 1,
447
+ "first_reveal_step": 1,
448
+ "first_retrieve_step": null,
449
+ "episode_disturbance": 0.0
450
+ },
451
+ {
452
+ "episode_seed": 172048,
453
+ "success": false,
454
+ "steps": 4,
455
+ "first_reveal_step": 1,
456
+ "first_retrieve_step": null,
457
+ "episode_disturbance": 0.0
458
+ },
459
+ {
460
+ "episode_seed": 172049,
461
+ "success": false,
462
+ "steps": 4,
463
+ "first_reveal_step": 1,
464
+ "first_retrieve_step": null,
465
+ "episode_disturbance": 0.0
466
+ }
467
+ ],
468
+ "eval_protocol": {
469
+ "track_id": "bag_track",
470
+ "suite": "maniskill3",
471
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
472
+ "role": "target",
473
+ "eval_mode": "trunk_only_ft",
474
+ "seed": 17,
475
+ "episodes": 50,
476
+ "resolution": 224,
477
+ "cameras": [
478
+ "front",
479
+ "left",
480
+ "right"
481
+ ],
482
+ "observation_stack": "rgb_triplicate_zero_depth",
483
+ "action_horizon": 8,
484
+ "action_space": "widowx_delta_pose",
485
+ "same_test_episodes": true
486
+ },
487
+ "proxy_notes": "Public ManiSkill bridge scene with custom retrieval initialization. The eggplant is placed inside the basket region and must be pulled back out to a staging zone.",
488
+ "train_spec": {
489
+ "track_id": "bag_track",
490
+ "suite": "maniskill3",
491
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
492
+ "model_variant": "trunk_only_ft",
493
+ "seed": 23,
494
+ "train_demos": 32,
495
+ "val_demos": 8,
496
+ "init_checkpoint_group": "/workspace/workspace/VLAarchtests2/VLAarchtests/artifacts/outputs/r3d_handoff_phase/proxy_interaction_r3d_stage3_clip_rgbd_handoff_compact_phase_seed17/checkpoint_best.pt",
497
+ "optimizer": "adamw",
498
+ "learning_rate": 0.0001,
499
+ "lr_schedule": "constant",
500
+ "batch_size": 4,
501
+ "augmentations": "none",
502
+ "early_stopping_metric": "val_total",
503
+ "max_gradient_steps": 114,
504
+ "unfreeze_scope": "fusion_memory_decoder",
505
+ "dataset_split_id": "bag_bridge_smoke_v1_dataset_seed17",
506
+ "same_data_policy": true,
507
+ "same_init_policy": true
508
+ }
509
+ }
reports/maniskill_bag_bridge_val_sweep_seed23/conservative/public_benchmark_package_summary.json ADDED
@@ -0,0 +1,60 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "anchor_pass": true,
3
+ "available_tracks": [
4
+ "bag_track"
5
+ ],
6
+ "ci_above_zero_tracks": [],
7
+ "headline_pass": false,
8
+ "package_name": "public_reveal_retrieve_package_v1",
9
+ "sign_of_life_pass": false,
10
+ "sign_of_life_track_count": 0,
11
+ "sign_of_life_tracks": [],
12
+ "target_macro_average_delta": 0.0,
13
+ "tracks": {
14
+ "bag_track": {
15
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
16
+ "delta_active_vs_trunk": 0.0,
17
+ "delta_active_vs_trunk_ci95": [
18
+ -0.375,
19
+ 0.375
20
+ ],
21
+ "delta_noop_vs_trunk": 0.0,
22
+ "modes": {
23
+ "adapter_active_ft": {
24
+ "disturbance_proxy": 0.0,
25
+ "intervention_rate": 0.0,
26
+ "mean_success": 0.125,
27
+ "non_base_selection_rate": 0.0,
28
+ "num_runs": 1,
29
+ "steps_to_first_reveal_or_access": 1.0,
30
+ "steps_to_retrieve": 4.0
31
+ },
32
+ "adapter_noop": {
33
+ "disturbance_proxy": 0.0,
34
+ "intervention_rate": 0.0,
35
+ "mean_success": 0.125,
36
+ "non_base_selection_rate": 0.0,
37
+ "num_runs": 1,
38
+ "steps_to_first_reveal_or_access": 1.0,
39
+ "steps_to_retrieve": 4.0
40
+ },
41
+ "trunk_only_ft": {
42
+ "disturbance_proxy": 0.0,
43
+ "intervention_rate": 0.0,
44
+ "mean_success": 0.125,
45
+ "non_base_selection_rate": 0.0,
46
+ "num_runs": 1,
47
+ "steps_to_first_reveal_or_access": 1.0,
48
+ "steps_to_retrieve": 4.0
49
+ }
50
+ },
51
+ "notes": "Public ManiSkill bridge scene with custom retrieval initialization. The eggplant is placed inside the basket region and must be pulled back out to a staging zone.",
52
+ "public_source": "ManiSkill public scene proxy: PutEggplantInBasketScene-v1",
53
+ "role": "target",
54
+ "signs_of_life": false,
55
+ "suite": "maniskill3",
56
+ "target_behavior": "retrieve the target object from inside the basket and stage it outside the basket",
57
+ "task_family": "bag_retrieval_proxy"
58
+ }
59
+ }
60
+ }
reports/maniskill_bag_bridge_val_sweep_seed23/conservative/public_benchmark_package_summary.md ADDED
@@ -0,0 +1,15 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # ManiSkill Bag Retrieval Smoke Summary
2
+
3
+ - benchmark_task: PutEggplantInBasketRetrievalProxy-v1
4
+ - target_macro_average_delta: 0.000
5
+ - headline_pass: False
6
+ - sign_of_life_pass: False
7
+
8
+ ## bag_track
9
+ - delta_active_vs_trunk: 0.000
10
+ - delta_noop_vs_trunk: 0.000
11
+ - signs_of_life: False
12
+ - delta_active_vs_trunk_ci95: [-0.375, 0.375]
13
+ - trunk_only_ft: mean_success=0.125
14
+ - adapter_noop: mean_success=0.125
15
+ - adapter_active_ft: mean_success=0.125
reports/maniskill_bag_bridge_val_sweep_seed23/conservative/trunk_only_ft_seed23.json ADDED
@@ -0,0 +1,131 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "track_id": "bag_track",
3
+ "suite": "maniskill3",
4
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
5
+ "role": "target",
6
+ "adapter_mode": "trunk_only_ft",
7
+ "episodes": 8,
8
+ "successes": [
9
+ 0,
10
+ 0,
11
+ 0,
12
+ 0,
13
+ 0,
14
+ 0,
15
+ 0,
16
+ 1
17
+ ],
18
+ "success_rate": 0.125,
19
+ "intervention_rate": 0.0,
20
+ "non_base_selection_rate": 0.0,
21
+ "steps_to_first_reveal_or_access": 1.0,
22
+ "steps_to_retrieve": 4.0,
23
+ "disturbance_proxy": 0.0,
24
+ "episode_records": [
25
+ {
26
+ "episode_seed": 171000,
27
+ "success": false,
28
+ "steps": 4,
29
+ "first_reveal_step": 1,
30
+ "first_retrieve_step": null,
31
+ "episode_disturbance": 0.0
32
+ },
33
+ {
34
+ "episode_seed": 171001,
35
+ "success": false,
36
+ "steps": 4,
37
+ "first_reveal_step": 1,
38
+ "first_retrieve_step": null,
39
+ "episode_disturbance": 0.0
40
+ },
41
+ {
42
+ "episode_seed": 171002,
43
+ "success": false,
44
+ "steps": 4,
45
+ "first_reveal_step": 1,
46
+ "first_retrieve_step": null,
47
+ "episode_disturbance": 0.0
48
+ },
49
+ {
50
+ "episode_seed": 171003,
51
+ "success": false,
52
+ "steps": 4,
53
+ "first_reveal_step": 1,
54
+ "first_retrieve_step": null,
55
+ "episode_disturbance": 0.0
56
+ },
57
+ {
58
+ "episode_seed": 171004,
59
+ "success": false,
60
+ "steps": 4,
61
+ "first_reveal_step": 1,
62
+ "first_retrieve_step": null,
63
+ "episode_disturbance": 0.0
64
+ },
65
+ {
66
+ "episode_seed": 171005,
67
+ "success": false,
68
+ "steps": 4,
69
+ "first_reveal_step": 1,
70
+ "first_retrieve_step": null,
71
+ "episode_disturbance": 0.0
72
+ },
73
+ {
74
+ "episode_seed": 171006,
75
+ "success": false,
76
+ "steps": 4,
77
+ "first_reveal_step": 1,
78
+ "first_retrieve_step": null,
79
+ "episode_disturbance": 0.0
80
+ },
81
+ {
82
+ "episode_seed": 171007,
83
+ "success": true,
84
+ "steps": 1,
85
+ "first_reveal_step": 1,
86
+ "first_retrieve_step": null,
87
+ "episode_disturbance": 0.0
88
+ }
89
+ ],
90
+ "eval_protocol": {
91
+ "track_id": "bag_track",
92
+ "suite": "maniskill3",
93
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
94
+ "role": "target",
95
+ "eval_mode": "trunk_only_ft",
96
+ "seed": 17,
97
+ "episodes": 8,
98
+ "resolution": 224,
99
+ "cameras": [
100
+ "front",
101
+ "left",
102
+ "right"
103
+ ],
104
+ "observation_stack": "rgb_triplicate_zero_depth",
105
+ "action_horizon": 8,
106
+ "action_space": "widowx_delta_pose",
107
+ "same_test_episodes": true
108
+ },
109
+ "proxy_notes": "Public ManiSkill bridge scene with custom retrieval initialization. The eggplant is placed inside the basket region and must be pulled back out to a staging zone.",
110
+ "train_spec": {
111
+ "track_id": "bag_track",
112
+ "suite": "maniskill3",
113
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
114
+ "model_variant": "trunk_only_ft",
115
+ "seed": 23,
116
+ "train_demos": 32,
117
+ "val_demos": 8,
118
+ "init_checkpoint_group": "/workspace/workspace/VLAarchtests2/VLAarchtests/artifacts/outputs/r3d_handoff_phase/proxy_interaction_r3d_stage3_clip_rgbd_handoff_compact_phase_seed17/checkpoint_best.pt",
119
+ "optimizer": "adamw",
120
+ "learning_rate": 0.0001,
121
+ "lr_schedule": "constant",
122
+ "batch_size": 4,
123
+ "augmentations": "none",
124
+ "early_stopping_metric": "val_total",
125
+ "max_gradient_steps": 114,
126
+ "unfreeze_scope": "fusion_memory_decoder",
127
+ "dataset_split_id": "bag_bridge_smoke_v1_dataset_seed17",
128
+ "same_data_policy": true,
129
+ "same_init_policy": true
130
+ }
131
+ }
reports/maniskill_bag_bridge_val_sweep_seed23/less_bonus/adapter_active_ft_seed23.json ADDED
@@ -0,0 +1,131 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "track_id": "bag_track",
3
+ "suite": "maniskill3",
4
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
5
+ "role": "target",
6
+ "adapter_mode": "adapter_active_ft",
7
+ "episodes": 8,
8
+ "successes": [
9
+ 0,
10
+ 0,
11
+ 0,
12
+ 0,
13
+ 0,
14
+ 0,
15
+ 0,
16
+ 1
17
+ ],
18
+ "success_rate": 0.125,
19
+ "intervention_rate": 1.0,
20
+ "non_base_selection_rate": 1.0,
21
+ "steps_to_first_reveal_or_access": 1.0,
22
+ "steps_to_retrieve": 4.0,
23
+ "disturbance_proxy": 0.0,
24
+ "episode_records": [
25
+ {
26
+ "episode_seed": 171000,
27
+ "success": false,
28
+ "steps": 4,
29
+ "first_reveal_step": 1,
30
+ "first_retrieve_step": null,
31
+ "episode_disturbance": 0.0
32
+ },
33
+ {
34
+ "episode_seed": 171001,
35
+ "success": false,
36
+ "steps": 4,
37
+ "first_reveal_step": 1,
38
+ "first_retrieve_step": null,
39
+ "episode_disturbance": 0.0
40
+ },
41
+ {
42
+ "episode_seed": 171002,
43
+ "success": false,
44
+ "steps": 4,
45
+ "first_reveal_step": 1,
46
+ "first_retrieve_step": null,
47
+ "episode_disturbance": 0.0
48
+ },
49
+ {
50
+ "episode_seed": 171003,
51
+ "success": false,
52
+ "steps": 4,
53
+ "first_reveal_step": 1,
54
+ "first_retrieve_step": null,
55
+ "episode_disturbance": 0.0
56
+ },
57
+ {
58
+ "episode_seed": 171004,
59
+ "success": false,
60
+ "steps": 4,
61
+ "first_reveal_step": 1,
62
+ "first_retrieve_step": null,
63
+ "episode_disturbance": 0.0
64
+ },
65
+ {
66
+ "episode_seed": 171005,
67
+ "success": false,
68
+ "steps": 4,
69
+ "first_reveal_step": 1,
70
+ "first_retrieve_step": null,
71
+ "episode_disturbance": 0.0
72
+ },
73
+ {
74
+ "episode_seed": 171006,
75
+ "success": false,
76
+ "steps": 4,
77
+ "first_reveal_step": 1,
78
+ "first_retrieve_step": null,
79
+ "episode_disturbance": 0.0
80
+ },
81
+ {
82
+ "episode_seed": 171007,
83
+ "success": true,
84
+ "steps": 1,
85
+ "first_reveal_step": 1,
86
+ "first_retrieve_step": null,
87
+ "episode_disturbance": 0.0
88
+ }
89
+ ],
90
+ "eval_protocol": {
91
+ "track_id": "bag_track",
92
+ "suite": "maniskill3",
93
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
94
+ "role": "target",
95
+ "eval_mode": "adapter_active_ft",
96
+ "seed": 17,
97
+ "episodes": 8,
98
+ "resolution": 224,
99
+ "cameras": [
100
+ "front",
101
+ "left",
102
+ "right"
103
+ ],
104
+ "observation_stack": "rgb_triplicate_zero_depth",
105
+ "action_horizon": 8,
106
+ "action_space": "widowx_delta_pose",
107
+ "same_test_episodes": true
108
+ },
109
+ "proxy_notes": "Public ManiSkill bridge scene with custom retrieval initialization. The eggplant is placed inside the basket region and must be pulled back out to a staging zone.",
110
+ "train_spec": {
111
+ "track_id": "bag_track",
112
+ "suite": "maniskill3",
113
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
114
+ "model_variant": "adapter_active_ft",
115
+ "seed": 23,
116
+ "train_demos": 32,
117
+ "val_demos": 8,
118
+ "init_checkpoint_group": "/workspace/workspace/VLAarchtests2/VLAarchtests/artifacts/outputs/r3d_handoff_phase/proxy_interaction_r3d_stage3_clip_rgbd_handoff_compact_phase_seed17/checkpoint_best.pt",
119
+ "optimizer": "adamw",
120
+ "learning_rate": 0.0001,
121
+ "lr_schedule": "constant",
122
+ "batch_size": 4,
123
+ "augmentations": "none",
124
+ "early_stopping_metric": "val_total",
125
+ "max_gradient_steps": 114,
126
+ "unfreeze_scope": "fusion_memory_decoder",
127
+ "dataset_split_id": "bag_bridge_smoke_v1_dataset_seed17",
128
+ "same_data_policy": true,
129
+ "same_init_policy": true
130
+ }
131
+ }
reports/maniskill_bag_bridge_val_sweep_seed23/less_bonus/adapter_noop_seed23.json ADDED
@@ -0,0 +1,110 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "track_id": "bag_track",
3
+ "suite": "maniskill3",
4
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
5
+ "role": "target",
6
+ "adapter_mode": "adapter_noop",
7
+ "episodes": 8,
8
+ "successes": [
9
+ 0,
10
+ 0,
11
+ 0,
12
+ 1,
13
+ 0,
14
+ 0,
15
+ 0,
16
+ 0
17
+ ],
18
+ "success_rate": 0.125,
19
+ "intervention_rate": 0.0,
20
+ "non_base_selection_rate": 0.0,
21
+ "steps_to_first_reveal_or_access": 1.0,
22
+ "steps_to_retrieve": 4.0,
23
+ "disturbance_proxy": 0.0,
24
+ "episode_records": [
25
+ {
26
+ "episode_seed": 171000,
27
+ "success": false,
28
+ "steps": 4,
29
+ "first_reveal_step": 1,
30
+ "first_retrieve_step": null,
31
+ "episode_disturbance": 0.0
32
+ },
33
+ {
34
+ "episode_seed": 171001,
35
+ "success": false,
36
+ "steps": 4,
37
+ "first_reveal_step": 1,
38
+ "first_retrieve_step": null,
39
+ "episode_disturbance": 0.0
40
+ },
41
+ {
42
+ "episode_seed": 171002,
43
+ "success": false,
44
+ "steps": 4,
45
+ "first_reveal_step": 1,
46
+ "first_retrieve_step": null,
47
+ "episode_disturbance": 0.0
48
+ },
49
+ {
50
+ "episode_seed": 171003,
51
+ "success": true,
52
+ "steps": 1,
53
+ "first_reveal_step": 1,
54
+ "first_retrieve_step": null,
55
+ "episode_disturbance": 0.0
56
+ },
57
+ {
58
+ "episode_seed": 171004,
59
+ "success": false,
60
+ "steps": 4,
61
+ "first_reveal_step": 1,
62
+ "first_retrieve_step": null,
63
+ "episode_disturbance": 0.0
64
+ },
65
+ {
66
+ "episode_seed": 171005,
67
+ "success": false,
68
+ "steps": 4,
69
+ "first_reveal_step": 1,
70
+ "first_retrieve_step": null,
71
+ "episode_disturbance": 0.0
72
+ },
73
+ {
74
+ "episode_seed": 171006,
75
+ "success": false,
76
+ "steps": 4,
77
+ "first_reveal_step": 1,
78
+ "first_retrieve_step": null,
79
+ "episode_disturbance": 0.0
80
+ },
81
+ {
82
+ "episode_seed": 171007,
83
+ "success": false,
84
+ "steps": 4,
85
+ "first_reveal_step": 1,
86
+ "first_retrieve_step": null,
87
+ "episode_disturbance": 0.0
88
+ }
89
+ ],
90
+ "eval_protocol": {
91
+ "track_id": "bag_track",
92
+ "suite": "maniskill3",
93
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
94
+ "role": "target",
95
+ "eval_mode": "adapter_noop",
96
+ "seed": 17,
97
+ "episodes": 8,
98
+ "resolution": 224,
99
+ "cameras": [
100
+ "front",
101
+ "left",
102
+ "right"
103
+ ],
104
+ "observation_stack": "rgb_triplicate_zero_depth",
105
+ "action_horizon": 8,
106
+ "action_space": "widowx_delta_pose",
107
+ "same_test_episodes": true
108
+ },
109
+ "proxy_notes": "Public ManiSkill bridge scene with custom retrieval initialization. The eggplant is placed inside the basket region and must be pulled back out to a staging zone."
110
+ }
reports/maniskill_bag_bridge_val_sweep_seed23/less_bonus/public_benchmark_package_summary.json ADDED
@@ -0,0 +1,60 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "anchor_pass": true,
3
+ "available_tracks": [
4
+ "bag_track"
5
+ ],
6
+ "ci_above_zero_tracks": [],
7
+ "headline_pass": false,
8
+ "package_name": "public_reveal_retrieve_package_v1",
9
+ "sign_of_life_pass": false,
10
+ "sign_of_life_track_count": 0,
11
+ "sign_of_life_tracks": [],
12
+ "target_macro_average_delta": 0.0,
13
+ "tracks": {
14
+ "bag_track": {
15
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
16
+ "delta_active_vs_trunk": 0.0,
17
+ "delta_active_vs_trunk_ci95": [
18
+ -0.375,
19
+ 0.375
20
+ ],
21
+ "delta_noop_vs_trunk": 0.0,
22
+ "modes": {
23
+ "adapter_active_ft": {
24
+ "disturbance_proxy": 0.0,
25
+ "intervention_rate": 1.0,
26
+ "mean_success": 0.125,
27
+ "non_base_selection_rate": 1.0,
28
+ "num_runs": 1,
29
+ "steps_to_first_reveal_or_access": 1.0,
30
+ "steps_to_retrieve": 4.0
31
+ },
32
+ "adapter_noop": {
33
+ "disturbance_proxy": 0.0,
34
+ "intervention_rate": 0.0,
35
+ "mean_success": 0.125,
36
+ "non_base_selection_rate": 0.0,
37
+ "num_runs": 1,
38
+ "steps_to_first_reveal_or_access": 1.0,
39
+ "steps_to_retrieve": 4.0
40
+ },
41
+ "trunk_only_ft": {
42
+ "disturbance_proxy": 0.0,
43
+ "intervention_rate": 0.0,
44
+ "mean_success": 0.125,
45
+ "non_base_selection_rate": 0.0,
46
+ "num_runs": 1,
47
+ "steps_to_first_reveal_or_access": 1.0,
48
+ "steps_to_retrieve": 4.0
49
+ }
50
+ },
51
+ "notes": "Public ManiSkill bridge scene with custom retrieval initialization. The eggplant is placed inside the basket region and must be pulled back out to a staging zone.",
52
+ "public_source": "ManiSkill public scene proxy: PutEggplantInBasketScene-v1",
53
+ "role": "target",
54
+ "signs_of_life": false,
55
+ "suite": "maniskill3",
56
+ "target_behavior": "retrieve the target object from inside the basket and stage it outside the basket",
57
+ "task_family": "bag_retrieval_proxy"
58
+ }
59
+ }
60
+ }
reports/maniskill_bag_bridge_val_sweep_seed23/less_bonus/run.log ADDED
@@ -0,0 +1,76 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ CLIPModel LOAD REPORT from: openai/clip-vit-base-patch32
3
+ Key | Status | |
4
+ -------------------------------------+------------+--+-
5
+ vision_model.embeddings.position_ids | UNEXPECTED | |
6
+ text_model.embeddings.position_ids | UNEXPECTED | |
7
+
8
+ Notes:
9
+ - UNEXPECTED: can be ignored when loading from different task/architecture; not ok if you expect identical arch.
10
+ CLIPTextTransformer does not expose input embeddings. Gradients cannot flow back to the token embeddings when using adapters or gradient checkpointing. Override `get_input_embeddings` to fully support those features, or set `_input_embed_layer` to the attribute name that holds the embeddings.
11
+ /usr/local/lib/python3.11/dist-packages/torch/nn/modules/transformer.py:307: UserWarning: enable_nested_tensor is True, but self.use_nested_tensor is False because encoder_layer.norm_first was True
12
+ warnings.warn(f"enable_nested_tensor is True, but self.use_nested_tensor is False because {why_not_sparsity_fast_path}")
13
+ 2026-04-01 11:26:03,154 - mani_skill - WARNING - Mimic targets dictionary is missing for controller config for widowx250s_bridgedataset_sink. Assuming the first joint is the control joint and the second joint is the mimic joint
14
+ 2026-04-01 11:26:03,205 - mani_skill - WARNING - No initial pose set for actor builder of eggplant, setting to default pose q=[1,0,0,0], p=[0,0,0]. Not setting reasonable initial poses may slow down simulation, see https://github.com/haosulab/ManiSkill/issues/421.
15
+ 2026-04-01 11:26:03,217 - mani_skill - WARNING - No initial pose set for actor builder of dummy_sink_target_plane, setting to default pose q=[1,0,0,0], p=[0,0,0]. Not setting reasonable initial poses may slow down simulation, see https://github.com/haosulab/ManiSkill/issues/421.
16
+ 2026-04-01 11:26:03,863 - mani_skill - WARNING - Mimic targets dictionary is missing for controller config for widowx250s_bridgedataset_sink. Assuming the first joint is the control joint and the second joint is the mimic joint
17
+ 2026-04-01 11:26:03,866 - mani_skill - WARNING - No initial pose set for actor builder of eggplant, setting to default pose q=[1,0,0,0], p=[0,0,0]. Not setting reasonable initial poses may slow down simulation, see https://github.com/haosulab/ManiSkill/issues/421.
18
+ 2026-04-01 11:26:03,866 - mani_skill - WARNING - No initial pose set for actor builder of dummy_sink_target_plane, setting to default pose q=[1,0,0,0], p=[0,0,0]. Not setting reasonable initial poses may slow down simulation, see https://github.com/haosulab/ManiSkill/issues/421.
19
+ /usr/local/lib/python3.11/dist-packages/gymnasium/core.py:311: UserWarning: WARN: env.get_obs to get variables from other wrappers is deprecated and will be removed in v1.0, to get this variable you can do `env.unwrapped.get_obs` for environment variables or `env.get_wrapper_attr('get_obs')` that will search the reminding wrappers.
20
+ logger.warn(
21
+ /usr/local/lib/python3.11/dist-packages/gymnasium/core.py:311: UserWarning: WARN: env.get_info to get variables from other wrappers is deprecated and will be removed in v1.0, to get this variable you can do `env.unwrapped.get_info` for environment variables or `env.get_wrapper_attr('get_info')` that will search the reminding wrappers.
22
+ logger.warn(
23
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "trunk_only_ft", "episode_seed": 171000, "success": false, "steps": 4}
24
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "trunk_only_ft", "episode_seed": 171001, "success": false, "steps": 4}
25
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "trunk_only_ft", "episode_seed": 171002, "success": false, "steps": 4}
26
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "trunk_only_ft", "episode_seed": 171003, "success": false, "steps": 4}
27
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "trunk_only_ft", "episode_seed": 171004, "success": false, "steps": 4}
28
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "trunk_only_ft", "episode_seed": 171005, "success": false, "steps": 4}
29
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "trunk_only_ft", "episode_seed": 171006, "success": false, "steps": 4}
30
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "trunk_only_ft", "episode_seed": 171007, "success": true, "steps": 1}
31
+
32
+ CLIPModel LOAD REPORT from: openai/clip-vit-base-patch32
33
+ Key | Status | |
34
+ -------------------------------------+------------+--+-
35
+ vision_model.embeddings.position_ids | UNEXPECTED | |
36
+ text_model.embeddings.position_ids | UNEXPECTED | |
37
+
38
+ Notes:
39
+ - UNEXPECTED: can be ignored when loading from different task/architecture; not ok if you expect identical arch.
40
+ 2026-04-01 11:26:36,209 - mani_skill - WARNING - Mimic targets dictionary is missing for controller config for widowx250s_bridgedataset_sink. Assuming the first joint is the control joint and the second joint is the mimic joint
41
+ 2026-04-01 11:26:36,211 - mani_skill - WARNING - No initial pose set for actor builder of eggplant, setting to default pose q=[1,0,0,0], p=[0,0,0]. Not setting reasonable initial poses may slow down simulation, see https://github.com/haosulab/ManiSkill/issues/421.
42
+ 2026-04-01 11:26:36,211 - mani_skill - WARNING - No initial pose set for actor builder of dummy_sink_target_plane, setting to default pose q=[1,0,0,0], p=[0,0,0]. Not setting reasonable initial poses may slow down simulation, see https://github.com/haosulab/ManiSkill/issues/421.
43
+ 2026-04-01 11:26:36,429 - mani_skill - WARNING - Mimic targets dictionary is missing for controller config for widowx250s_bridgedataset_sink. Assuming the first joint is the control joint and the second joint is the mimic joint
44
+ 2026-04-01 11:26:36,431 - mani_skill - WARNING - No initial pose set for actor builder of eggplant, setting to default pose q=[1,0,0,0], p=[0,0,0]. Not setting reasonable initial poses may slow down simulation, see https://github.com/haosulab/ManiSkill/issues/421.
45
+ 2026-04-01 11:26:36,432 - mani_skill - WARNING - No initial pose set for actor builder of dummy_sink_target_plane, setting to default pose q=[1,0,0,0], p=[0,0,0]. Not setting reasonable initial poses may slow down simulation, see https://github.com/haosulab/ManiSkill/issues/421.
46
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_noop", "episode_seed": 171000, "success": false, "steps": 4}
47
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_noop", "episode_seed": 171001, "success": false, "steps": 4}
48
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_noop", "episode_seed": 171002, "success": false, "steps": 4}
49
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_noop", "episode_seed": 171003, "success": true, "steps": 1}
50
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_noop", "episode_seed": 171004, "success": false, "steps": 4}
51
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_noop", "episode_seed": 171005, "success": false, "steps": 4}
52
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_noop", "episode_seed": 171006, "success": false, "steps": 4}
53
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_noop", "episode_seed": 171007, "success": false, "steps": 4}
54
+
55
+ CLIPModel LOAD REPORT from: openai/clip-vit-base-patch32
56
+ Key | Status | |
57
+ -------------------------------------+------------+--+-
58
+ vision_model.embeddings.position_ids | UNEXPECTED | |
59
+ text_model.embeddings.position_ids | UNEXPECTED | |
60
+
61
+ Notes:
62
+ - UNEXPECTED: can be ignored when loading from different task/architecture; not ok if you expect identical arch.
63
+ 2026-04-01 11:27:03,215 - mani_skill - WARNING - Mimic targets dictionary is missing for controller config for widowx250s_bridgedataset_sink. Assuming the first joint is the control joint and the second joint is the mimic joint
64
+ 2026-04-01 11:27:03,217 - mani_skill - WARNING - No initial pose set for actor builder of eggplant, setting to default pose q=[1,0,0,0], p=[0,0,0]. Not setting reasonable initial poses may slow down simulation, see https://github.com/haosulab/ManiSkill/issues/421.
65
+ 2026-04-01 11:27:03,218 - mani_skill - WARNING - No initial pose set for actor builder of dummy_sink_target_plane, setting to default pose q=[1,0,0,0], p=[0,0,0]. Not setting reasonable initial poses may slow down simulation, see https://github.com/haosulab/ManiSkill/issues/421.
66
+ 2026-04-01 11:27:03,434 - mani_skill - WARNING - Mimic targets dictionary is missing for controller config for widowx250s_bridgedataset_sink. Assuming the first joint is the control joint and the second joint is the mimic joint
67
+ 2026-04-01 11:27:03,437 - mani_skill - WARNING - No initial pose set for actor builder of eggplant, setting to default pose q=[1,0,0,0], p=[0,0,0]. Not setting reasonable initial poses may slow down simulation, see https://github.com/haosulab/ManiSkill/issues/421.
68
+ 2026-04-01 11:27:03,437 - mani_skill - WARNING - No initial pose set for actor builder of dummy_sink_target_plane, setting to default pose q=[1,0,0,0], p=[0,0,0]. Not setting reasonable initial poses may slow down simulation, see https://github.com/haosulab/ManiSkill/issues/421.
69
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_active_ft", "episode_seed": 171000, "success": false, "steps": 4}
70
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_active_ft", "episode_seed": 171001, "success": false, "steps": 4}
71
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_active_ft", "episode_seed": 171002, "success": false, "steps": 4}
72
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_active_ft", "episode_seed": 171003, "success": false, "steps": 4}
73
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_active_ft", "episode_seed": 171004, "success": false, "steps": 4}
74
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_active_ft", "episode_seed": 171005, "success": false, "steps": 4}
75
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_active_ft", "episode_seed": 171006, "success": false, "steps": 4}
76
+ {"phase": "eval_episode_complete", "task": "bag", "adapter_mode": "adapter_active_ft", "episode_seed": 171007, "success": true, "steps": 1}
reports/maniskill_bag_bridge_val_sweep_seed23/low_bonus_high_thresh/adapter_active_ft_seed23.json ADDED
@@ -0,0 +1,131 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "track_id": "bag_track",
3
+ "suite": "maniskill3",
4
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
5
+ "role": "target",
6
+ "adapter_mode": "adapter_active_ft",
7
+ "episodes": 8,
8
+ "successes": [
9
+ 0,
10
+ 0,
11
+ 0,
12
+ 1,
13
+ 0,
14
+ 0,
15
+ 0,
16
+ 0
17
+ ],
18
+ "success_rate": 0.125,
19
+ "intervention_rate": 0.0,
20
+ "non_base_selection_rate": 0.0,
21
+ "steps_to_first_reveal_or_access": 1.0,
22
+ "steps_to_retrieve": 4.0,
23
+ "disturbance_proxy": 0.0,
24
+ "episode_records": [
25
+ {
26
+ "episode_seed": 171000,
27
+ "success": false,
28
+ "steps": 4,
29
+ "first_reveal_step": 1,
30
+ "first_retrieve_step": null,
31
+ "episode_disturbance": 0.0
32
+ },
33
+ {
34
+ "episode_seed": 171001,
35
+ "success": false,
36
+ "steps": 4,
37
+ "first_reveal_step": 1,
38
+ "first_retrieve_step": null,
39
+ "episode_disturbance": 0.0
40
+ },
41
+ {
42
+ "episode_seed": 171002,
43
+ "success": false,
44
+ "steps": 4,
45
+ "first_reveal_step": 1,
46
+ "first_retrieve_step": null,
47
+ "episode_disturbance": 0.0
48
+ },
49
+ {
50
+ "episode_seed": 171003,
51
+ "success": true,
52
+ "steps": 1,
53
+ "first_reveal_step": 1,
54
+ "first_retrieve_step": null,
55
+ "episode_disturbance": 0.0
56
+ },
57
+ {
58
+ "episode_seed": 171004,
59
+ "success": false,
60
+ "steps": 4,
61
+ "first_reveal_step": 1,
62
+ "first_retrieve_step": null,
63
+ "episode_disturbance": 0.0
64
+ },
65
+ {
66
+ "episode_seed": 171005,
67
+ "success": false,
68
+ "steps": 4,
69
+ "first_reveal_step": 1,
70
+ "first_retrieve_step": null,
71
+ "episode_disturbance": 0.0
72
+ },
73
+ {
74
+ "episode_seed": 171006,
75
+ "success": false,
76
+ "steps": 4,
77
+ "first_reveal_step": 1,
78
+ "first_retrieve_step": null,
79
+ "episode_disturbance": 0.0
80
+ },
81
+ {
82
+ "episode_seed": 171007,
83
+ "success": false,
84
+ "steps": 4,
85
+ "first_reveal_step": 1,
86
+ "first_retrieve_step": null,
87
+ "episode_disturbance": 0.0
88
+ }
89
+ ],
90
+ "eval_protocol": {
91
+ "track_id": "bag_track",
92
+ "suite": "maniskill3",
93
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
94
+ "role": "target",
95
+ "eval_mode": "adapter_active_ft",
96
+ "seed": 17,
97
+ "episodes": 8,
98
+ "resolution": 224,
99
+ "cameras": [
100
+ "front",
101
+ "left",
102
+ "right"
103
+ ],
104
+ "observation_stack": "rgb_triplicate_zero_depth",
105
+ "action_horizon": 8,
106
+ "action_space": "widowx_delta_pose",
107
+ "same_test_episodes": true
108
+ },
109
+ "proxy_notes": "Public ManiSkill bridge scene with custom retrieval initialization. The eggplant is placed inside the basket region and must be pulled back out to a staging zone.",
110
+ "train_spec": {
111
+ "track_id": "bag_track",
112
+ "suite": "maniskill3",
113
+ "benchmark_task": "PutEggplantInBasketRetrievalProxy-v1",
114
+ "model_variant": "adapter_active_ft",
115
+ "seed": 23,
116
+ "train_demos": 32,
117
+ "val_demos": 8,
118
+ "init_checkpoint_group": "/workspace/workspace/VLAarchtests2/VLAarchtests/artifacts/outputs/r3d_handoff_phase/proxy_interaction_r3d_stage3_clip_rgbd_handoff_compact_phase_seed17/checkpoint_best.pt",
119
+ "optimizer": "adamw",
120
+ "learning_rate": 0.0001,
121
+ "lr_schedule": "constant",
122
+ "batch_size": 4,
123
+ "augmentations": "none",
124
+ "early_stopping_metric": "val_total",
125
+ "max_gradient_steps": 114,
126
+ "unfreeze_scope": "fusion_memory_decoder",
127
+ "dataset_split_id": "bag_bridge_smoke_v1_dataset_seed17",
128
+ "same_data_policy": true,
129
+ "same_init_policy": true
130
+ }
131
+ }
reports/maniskill_bag_bridge_val_sweep_seed23/summary.json ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "default": {
3
+ "active_success": 0.0,
4
+ "trunk_success": 0.125,
5
+ "noop_success": 0.125,
6
+ "active_intervention_rate": 1.0,
7
+ "active_non_base_rate": 1.0
8
+ },
9
+ "less_bonus": {
10
+ "active_success": 0.125,
11
+ "trunk_success": 0.125,
12
+ "noop_success": 0.125,
13
+ "active_intervention_rate": 1.0,
14
+ "active_non_base_rate": 1.0
15
+ },
16
+ "conservative": {
17
+ "active_success": 0.125,
18
+ "trunk_success": 0.125,
19
+ "noop_success": 0.125,
20
+ "active_intervention_rate": 0.0,
21
+ "active_non_base_rate": 0.0
22
+ },
23
+ "low_bonus_high_thresh": {
24
+ "active_success": 0.125,
25
+ "trunk_success": 0.125,
26
+ "noop_success": 0.125,
27
+ "active_intervention_rate": 0.0,
28
+ "active_non_base_rate": 0.0
29
+ }
30
+ }
reports/proxy_rank_aligned_fast_eval/active/reveal_benchmark.json ADDED
The diff for this file is too large to render. See raw diff
 
reports/proxy_rank_aligned_fast_eval/active/reveal_benchmark.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Reveal Proxy Benchmark
2
+
3
+ ## adapter
4
+ - controller: model
5
+ - checkpoint: /workspace/workspace/outputs/adapter_proxy/proxy_adapter_wrapped_clip_rank_only_fast_seed17/checkpoint_best.pt
6
+ - episodes: 24.000
7
+ - mean_success: 0.000
8
+ - visibility_integral: 0.159
9
+ - corridor_availability: 0.002
10
+ - reocclusion_rate: 0.002
11
+ - disturbance_cost: 0.678
12
+ - premature_retrieve_rate: 0.827
13
+ - reocclusion_after_reveal_rate: 0.000
14
+ - planner_regret: 0.001
15
+ - foliage_success: 0.000
16
+ - bag_success: 0.000
17
+ - cloth_success: 0.000
reports/proxy_rank_aligned_fast_eval/noop/reveal_benchmark.json ADDED
The diff for this file is too large to render. See raw diff
 
reports/proxy_rank_aligned_fast_eval/noop/reveal_benchmark.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Reveal Proxy Benchmark
2
+
3
+ ## adapter
4
+ - controller: model
5
+ - checkpoint: /workspace/workspace/outputs/adapter_proxy/proxy_adapter_wrapped_clip_rank_only_fast_seed17/checkpoint_best.pt
6
+ - episodes: 24.000
7
+ - mean_success: 0.000
8
+ - visibility_integral: 0.160
9
+ - corridor_availability: 0.002
10
+ - reocclusion_rate: 0.002
11
+ - disturbance_cost: 0.676
12
+ - premature_retrieve_rate: 0.835
13
+ - reocclusion_after_reveal_rate: 0.000
14
+ - planner_regret: 0.046
15
+ - foliage_success: 0.000
16
+ - bag_success: 0.000
17
+ - cloth_success: 0.000
reports/proxy_semantic_nowm_quick12_rerun/reveal_benchmark.json ADDED
The diff for this file is too large to render. See raw diff
 
reports/proxy_semantic_nowm_quick12_rerun/reveal_benchmark.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Reveal Proxy Benchmark
2
+
3
+ ## patched
4
+ - controller: model
5
+ - checkpoint: /workspace/workspace/outputs/adapter_proxy/proxy_adapter_wrapped_clip_base_reuse128_seed17/checkpoint_best.pt
6
+ - episodes: 36.000
7
+ - mean_success: 0.222
8
+ - visibility_integral: 10.312
9
+ - corridor_availability: 0.303
10
+ - reocclusion_rate: 0.001
11
+ - disturbance_cost: 0.147
12
+ - premature_retrieve_rate: 0.649
13
+ - reocclusion_after_reveal_rate: 0.000
14
+ - planner_regret: 0.224
15
+ - foliage_success: 0.667
16
+ - bag_success: 0.000
17
+ - cloth_success: 0.000
reports/proxy_transition_fast_eval/active/reveal_benchmark.json ADDED
The diff for this file is too large to render. See raw diff
 
reports/proxy_transition_fast_eval/active/reveal_benchmark.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Reveal Proxy Benchmark
2
+
3
+ ## adapter
4
+ - controller: model
5
+ - checkpoint: /workspace/workspace/outputs/adapter_proxy/proxy_adapter_wrapped_clip_transition_fast_seed17/checkpoint_best.pt
6
+ - episodes: 24.000
7
+ - mean_success: 0.000
8
+ - visibility_integral: 0.158
9
+ - corridor_availability: 0.002
10
+ - reocclusion_rate: 0.002
11
+ - disturbance_cost: 0.689
12
+ - premature_retrieve_rate: 0.820
13
+ - reocclusion_after_reveal_rate: 0.000
14
+ - planner_regret: 0.001
15
+ - foliage_success: 0.000
16
+ - bag_success: 0.000
17
+ - cloth_success: 0.000
reports/proxy_transition_fast_eval/noop/reveal_benchmark.json ADDED
The diff for this file is too large to render. See raw diff
 
reports/proxy_transition_fast_eval/noop/reveal_benchmark.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Reveal Proxy Benchmark
2
+
3
+ ## adapter
4
+ - controller: model
5
+ - checkpoint: /workspace/workspace/outputs/adapter_proxy/proxy_adapter_wrapped_clip_transition_fast_seed17/checkpoint_best.pt
6
+ - episodes: 24.000
7
+ - mean_success: 0.000
8
+ - visibility_integral: 0.160
9
+ - corridor_availability: 0.002
10
+ - reocclusion_rate: 0.002
11
+ - disturbance_cost: 0.676
12
+ - premature_retrieve_rate: 0.835
13
+ - reocclusion_after_reveal_rate: 0.000
14
+ - planner_regret: 0.046
15
+ - foliage_success: 0.000
16
+ - bag_success: 0.000
17
+ - cloth_success: 0.000