wsagi commited on
Commit
edb1d87
·
verified ·
1 Parent(s): 186b5a1

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -16,7 +16,7 @@ language:
16
  base_model: lerobot/smolvla_base
17
  ---
18
 
19
- # SmolVLA2-PickOrange
20
 
21
  针对 [LeIsaac SO-101 PickOrange](https://github.com/LightwheelAI/leisaac) 任务 LoRA-free 微调的 [SmolVLA](https://huggingface.co/lerobot/smolvla_base) 策略 — 自训 30k step。
22
  _A fine-tuned [SmolVLA](https://huggingface.co/lerobot/smolvla_base) policy on the [LeIsaac SO-101 PickOrange](https://github.com/LightwheelAI/leisaac) task, 30k steps full-parameter from `lerobot/smolvla_base`._
@@ -25,7 +25,7 @@ _A fine-tuned [SmolVLA](https://huggingface.co/lerobot/smolvla_base) policy on t
25
  - [vitorcen/isaaclab-experience](https://github.com/vitorcen/isaaclab-experience) — Isaac Lab + LeIsaac 多策略横评(parent project)— 含 7-baseline benchmark
26
  - [vitorcen/LeIsaac-Training](https://github.com/vitorcen/LeIsaac-Training) — LeIsaac fork(训练脚本 + 设计文档 / training scripts + design docs)
27
 
28
- > **命名注意 / Naming note**:仓库名是 `SmolVLA2-PickOrange` 但 `config.type=smolvla`(v1,SmolVLM2-500M-Video-Instruct backbone + Action Expert)。LeRobot 当时没把 `smolvla2`(带 LoRA on)merge 到 main,所以这里仍是 v1。命名是 dir 命名误称延续。
29
  > _Despite the repo name, `config.type=smolvla` (v1). LeRobot's smolvla2 (with LoRA enabled) hadn't merged to main at training time; the "2" is carried over from the local output directory naming._
30
 
31
  ## TL;DR
@@ -39,7 +39,7 @@ _A fine-tuned [SmolVLA](https://huggingface.co/lerobot/smolvla_base) policy on t
39
  - 详见 [`vitorcen/isaaclab-experience`](https://github.com/vitorcen/isaaclab-experience) 的 `LeIsaac/README.md` benchmark section
40
  - **⚠️ 推理 inference 配置**:
41
  - `policy_action_horizon=50`(= chunk_size,全 chunk receding window)
42
- - LeRobot async server 端 `--policy_checkpoint_path=wsagi/SmolVLA2-PickOrange`
43
  - `step_hz=30` 匹配 dataset
44
 
45
  ## 模型亮点
@@ -82,7 +82,7 @@ DISPLAY=:0 python -u LeIsaac/scripts/evaluation/policy_inference.py \
82
  --policy_type=lerobot-smolvla \
83
  --policy_host=127.0.0.1 --policy_port=8080 \
84
  --policy_action_horizon=50 \
85
- --policy_checkpoint_path=wsagi/SmolVLA2-PickOrange \
86
  --policy_language_instruction='Pick up the orange and place it on the plate' \
87
  --device=cuda --enable_cameras
88
  ```
@@ -91,7 +91,7 @@ DISPLAY=:0 python -u LeIsaac/scripts/evaluation/policy_inference.py \
91
 
92
  ```python
93
  from lerobot.policies.smolvla.modeling_smolvla import SmolVLAPolicy
94
- policy = SmolVLAPolicy.from_pretrained("wsagi/SmolVLA2-PickOrange")
95
  # 见 LeRobot 文档
96
  ```
97
 
 
16
  base_model: lerobot/smolvla_base
17
  ---
18
 
19
+ # SmolVLA-PickOrange
20
 
21
  针对 [LeIsaac SO-101 PickOrange](https://github.com/LightwheelAI/leisaac) 任务 LoRA-free 微调的 [SmolVLA](https://huggingface.co/lerobot/smolvla_base) 策略 — 自训 30k step。
22
  _A fine-tuned [SmolVLA](https://huggingface.co/lerobot/smolvla_base) policy on the [LeIsaac SO-101 PickOrange](https://github.com/LightwheelAI/leisaac) task, 30k steps full-parameter from `lerobot/smolvla_base`._
 
25
  - [vitorcen/isaaclab-experience](https://github.com/vitorcen/isaaclab-experience) — Isaac Lab + LeIsaac 多策略横评(parent project)— 含 7-baseline benchmark
26
  - [vitorcen/LeIsaac-Training](https://github.com/vitorcen/LeIsaac-Training) — LeIsaac fork(训练脚本 + 设计文档 / training scripts + design docs)
27
 
28
+ > **命名注意 / Naming note**:仓库名是 `SmolVLA-PickOrange` 但 `config.type=smolvla`(v1,SmolVLM2-500M-Video-Instruct backbone + Action Expert)。LeRobot 当时没把 `smolvla2`(带 LoRA on)merge 到 main,所以这里仍是 v1。命名是 dir 命名误称延续。
29
  > _Despite the repo name, `config.type=smolvla` (v1). LeRobot's smolvla2 (with LoRA enabled) hadn't merged to main at training time; the "2" is carried over from the local output directory naming._
30
 
31
  ## TL;DR
 
39
  - 详见 [`vitorcen/isaaclab-experience`](https://github.com/vitorcen/isaaclab-experience) 的 `LeIsaac/README.md` benchmark section
40
  - **⚠️ 推理 inference 配置**:
41
  - `policy_action_horizon=50`(= chunk_size,全 chunk receding window)
42
+ - LeRobot async server 端 `--policy_checkpoint_path=wsagi/SmolVLA-PickOrange`
43
  - `step_hz=30` 匹配 dataset
44
 
45
  ## 模型亮点
 
82
  --policy_type=lerobot-smolvla \
83
  --policy_host=127.0.0.1 --policy_port=8080 \
84
  --policy_action_horizon=50 \
85
+ --policy_checkpoint_path=wsagi/SmolVLA-PickOrange \
86
  --policy_language_instruction='Pick up the orange and place it on the plate' \
87
  --device=cuda --enable_cameras
88
  ```
 
91
 
92
  ```python
93
  from lerobot.policies.smolvla.modeling_smolvla import SmolVLAPolicy
94
+ policy = SmolVLAPolicy.from_pretrained("wsagi/SmolVLA-PickOrange")
95
  # 见 LeRobot 文档
96
  ```
97