wsagi
/

SmolVLA-PickOrange

@@ -16,7 +16,7 @@ language:
 base_model: lerobot/smolvla_base
 ---
-# SmolVLA2-PickOrange
 针对 [LeIsaac SO-101 PickOrange](https://github.com/LightwheelAI/leisaac) 任务 LoRA-free 微调的 [SmolVLA](https://huggingface.co/lerobot/smolvla_base) 策略 — 自训 30k step。
 _A fine-tuned [SmolVLA](https://huggingface.co/lerobot/smolvla_base) policy on the [LeIsaac SO-101 PickOrange](https://github.com/LightwheelAI/leisaac) task, 30k steps full-parameter from `lerobot/smolvla_base`._
@@ -25,7 +25,7 @@ _A fine-tuned [SmolVLA](https://huggingface.co/lerobot/smolvla_base) policy on t
 - [vitorcen/isaaclab-experience](https://github.com/vitorcen/isaaclab-experience) — Isaac Lab + LeIsaac 多策略横评（parent project）— 含 7-baseline benchmark
 - [vitorcen/LeIsaac-Training](https://github.com/vitorcen/LeIsaac-Training) — LeIsaac fork（训练脚本 + 设计文档 / training scripts + design docs）
-> **命名注意 / Naming note**：仓库名是 `SmolVLA2-PickOrange` 但 `config.type=smolvla`（v1，SmolVLM2-500M-Video-Instruct backbone + Action Expert）。LeRobot 当时没把 `smolvla2`（带 LoRA on）merge 到 main，所以这里仍是 v1。命名是 dir 命名误称延续。
 > _Despite the repo name, `config.type=smolvla` (v1). LeRobot's smolvla2 (with LoRA enabled) hadn't merged to main at training time; the "2" is carried over from the local output directory naming._
 ## TL;DR
@@ -39,7 +39,7 @@ _A fine-tuned [SmolVLA](https://huggingface.co/lerobot/smolvla_base) policy on t
   - 详见 [`vitorcen/isaaclab-experience`](https://github.com/vitorcen/isaaclab-experience) 的 `LeIsaac/README.md` benchmark section
 - **⚠️ 推理 inference 配置**：
   - `policy_action_horizon=50`（= chunk_size，全 chunk receding window）
-  - LeRobot async server 端 `--policy_checkpoint_path=wsagi/SmolVLA2-PickOrange`
   - `step_hz=30` 匹配 dataset
 ## 模型亮点
@@ -82,7 +82,7 @@ DISPLAY=:0 python -u LeIsaac/scripts/evaluation/policy_inference.py \
     --policy_type=lerobot-smolvla \
     --policy_host=127.0.0.1 --policy_port=8080 \
     --policy_action_horizon=50 \
-    --policy_checkpoint_path=wsagi/SmolVLA2-PickOrange \
     --policy_language_instruction='Pick up the orange and place it on the plate' \
     --device=cuda --enable_cameras
 ```
@@ -91,7 +91,7 @@ DISPLAY=:0 python -u LeIsaac/scripts/evaluation/policy_inference.py \
 ```python
 from lerobot.policies.smolvla.modeling_smolvla import SmolVLAPolicy
-policy = SmolVLAPolicy.from_pretrained("wsagi/SmolVLA2-PickOrange")
 # 见 LeRobot 文档
 ```

 base_model: lerobot/smolvla_base
 ---
+# SmolVLA-PickOrange
 针对 [LeIsaac SO-101 PickOrange](https://github.com/LightwheelAI/leisaac) 任务 LoRA-free 微调的 [SmolVLA](https://huggingface.co/lerobot/smolvla_base) 策略 — 自训 30k step。
 _A fine-tuned [SmolVLA](https://huggingface.co/lerobot/smolvla_base) policy on the [LeIsaac SO-101 PickOrange](https://github.com/LightwheelAI/leisaac) task, 30k steps full-parameter from `lerobot/smolvla_base`._
 - [vitorcen/isaaclab-experience](https://github.com/vitorcen/isaaclab-experience) — Isaac Lab + LeIsaac 多策略横评（parent project）— 含 7-baseline benchmark
 - [vitorcen/LeIsaac-Training](https://github.com/vitorcen/LeIsaac-Training) — LeIsaac fork（训练脚本 + 设计文档 / training scripts + design docs）
+> **命名注意 / Naming note**：仓库名是 `SmolVLA-PickOrange` 但 `config.type=smolvla`（v1，SmolVLM2-500M-Video-Instruct backbone + Action Expert）。LeRobot 当时没把 `smolvla2`（带 LoRA on）merge 到 main，所以这里仍是 v1。命名是 dir 命名误称延续。
 > _Despite the repo name, `config.type=smolvla` (v1). LeRobot's smolvla2 (with LoRA enabled) hadn't merged to main at training time; the "2" is carried over from the local output directory naming._
 ## TL;DR
   - 详见 [`vitorcen/isaaclab-experience`](https://github.com/vitorcen/isaaclab-experience) 的 `LeIsaac/README.md` benchmark section
 - **⚠️ 推理 inference 配置**：
   - `policy_action_horizon=50`（= chunk_size，全 chunk receding window）
+  - LeRobot async server 端 `--policy_checkpoint_path=wsagi/SmolVLA-PickOrange`
   - `step_hz=30` 匹配 dataset
 ## 模型亮点
     --policy_type=lerobot-smolvla \
     --policy_host=127.0.0.1 --policy_port=8080 \
     --policy_action_horizon=50 \
+    --policy_checkpoint_path=wsagi/SmolVLA-PickOrange \
     --policy_language_instruction='Pick up the orange and place it on the plate' \
     --device=cuda --enable_cameras
 ```
 ```python
 from lerobot.policies.smolvla.modeling_smolvla import SmolVLAPolicy
+policy = SmolVLAPolicy.from_pretrained("wsagi/SmolVLA-PickOrange")
 # 见 LeRobot 文档
 ```