act model - 🧪 phosphobot training pipeline
- Dataset: edgarcancinoe/eval_soarm101_pick_cubes
- Wandb run id: None
Error Traceback
We faced an issue while training your model.
Training process failed with exit code 1:
response = fn(cfg, *args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^
File "/workspace/lerobot/src/lerobot/scripts/lerobot_train.py", line 399, in train
batch = next(dl_iter)
^^^^^^^^^^^^^
File "/workspace/lerobot/src/lerobot/datasets/utils.py", line 912, in cycle
yield next(iterator)
^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/dist-packages/accelerate/data_loader.py", line 589, in __iter__
next_batch = next(dataloader_iter)
^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/dist-packages/torch/utils/data/dataloader.py", line 740, in __next__
data = self._next_data()
^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/dist-packages/torch/utils/data/dataloader.py", line 1529, in _next_data
return self._process_data(data, worker_id)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/dist-packages/torch/utils/data/dataloader.py", line 1567, in _process_data
data.reraise()
File "/usr/local/lib/python3.12/dist-packages/torch/_utils.py", line 774, in reraise
raise RuntimeError(msg) from None
RuntimeError: Caught FileNotFoundError in DataLoader worker process 1.
Original Traceback (most recent call last):
File "/usr/local/lib/python3.12/dist-packages/torch/utils/data/_utils/worker.py", line 358, in _worker_loop
data = fetcher.fetch(index) # type: ignore[possibly-undefined]
^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/dist-packages/torch/utils/data/_utils/fetch.py", line 52, in fetch
data = [self.dataset[idx] for idx in possibly_batched_index]
~~~~~~~~~~~~^^^^^
File "/workspace/lerobot/src/lerobot/datasets/lerobot_dataset.py", line 1067, in __getitem__
video_frames = self._query_videos(query_timestamps, ep_idx)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/workspace/lerobot/src/lerobot/datasets/lerobot_dataset.py", line 1030, in _query_videos
frames = decode_video_frames(video_path, shifted_query_ts, self.tolerance_s, self.video_backend)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/workspace/lerobot/src/lerobot/datasets/video_utils.py", line 71, in decode_video_frames
return decode_video_frames_torchvision(video_path, timestamps, tolerance_s, backend)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/workspace/lerobot/src/lerobot/datasets/video_utils.py", line 112, in decode_video_frames_torchvision
reader = torchvision.io.VideoReader(video_path, "video")
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/dist-packages/torchvision/io/video_reader.py", line 169, in __init__
self.container = av.open(src, metadata_errors="ignore")
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "av/container/core.pyx", line 417, in av.container.core.open
File "av/container/core.pyx", line 282, in av.container.core.Container.__cinit__
File "av/container/core.pyx", line 302, in av.container.core.Container.err_check
File "av/error.pyx", line 424, in av.error.err_check
av.error.FileNotFoundError: [Errno 2] No such file or directory: '/workspace/inputs/datasets/edgarcancinoe/eval_soarm101_pick_cubes/videos/observation.images.top/chunk-000/file-000.mp4'
Training parameters
{
"batch_size": 60,
"steps": 8000,
"save_freq": 5000
}
📖 Get Started: docs.phospho.ai
🤖 Get your robot: robots.phospho.ai