YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
VLAarchTests Bench
This repository is a bundled benchmark workspace for future VLAarchTests model
iterations, with emphasis on RLBench2 take tray out of oven.
Included
- VLAarchTests benchmark code and generated public benchmark manifest
- Patched AnyBimanual RLBench runtime used to execute the public oven benchmark
- Patched PointFlowMatch +
diffusion_policysource snapshot used to executetake shoes out of box - Official
katefgroup/3d_flowmatch_actorPerAct2 checkpoint and public test data - Official PointFlowMatch
1717447341-indigo-quokkacheckpoint fortake_shoes_out_of_box - Public AnyBimanual LF baseline weights and comparison logs
- Verified benchmark reports:
- oven subset run:
9/10 - oven full official run:
95/100 = 0.95 - shoes GPU search: non-zero success verified before later simulator crash
- hybrid public benchmark smoke outputs
- oven subset run:
- DexGarmentLab benchmark-related validation scripts and validation logs
Key Result
The strongest public out-of-box checkpoint validated here is:
models/3dfa_peract2/3dfa_peract2.pth
Official oven result artifacts:
reports/3dfa_peract2_take_tray_out_of_oven_subset10/eval_after_official_ttm.jsonreports/3dfa_peract2_take_tray_out_of_oven_full100/eval.json
Shoes result artifacts:
reports/pointflowmatch_take_shoes_out_of_box_ep10_k50_gpu/summary.jsonreports/pointflowmatch_take_shoes_out_of_box_ep10_k50_gpu/run.log
Important Code Paths
code/VLAarchtests4/code/VLAarchtests2_code/VLAarchtests/code/reveal_vla_bimanual/eval/public_benchmark_package.pycode/VLAarchtests4/code/VLAarchtests2_code/VLAarchtests/code/reveal_vla_bimanual/eval/run_rlbench_hybrid_smoke.pythird_party/AnyBimanual/third_party/RLBench/rlbench/bimanual_tasks/bimanual_take_tray_out_of_oven.pythird_party/AnyBimanual/third_party/RLBench/rlbench/task_ttms/bimanual_take_tray_out_of_oven.ttmthird_party/PointFlowMatch/pfp/envs/rlbench_env.pythird_party/PointFlowMatch/pfp/policy/fm_policy.pyscripts/run_pointflowmatch_take_shoes_out_of_box.sh
External Dependencies Not Mirrored Here
- CoppeliaSim v4.1.0 binary runtime
- Local Python environments under
/workspace/envs - Full IsaacSim installation
- Full DexGarmentLab simulator assets beyond the benchmark-related scripts and logs
See docs/ENVIRONMENT_NOTES.md for the runtime notes used in this workspace.
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support