| # Instruction for downloading data from the sft-data repository. |
|
|
| First, you would want to log in and access the huggingface data through using |
|
|
| ```py |
| from huggingface_hub import login |
| login() |
| ``` |
|
|
| Then, you could either download the zip file of the all the sft data folders, which would look like |
|
|
| ```py |
| from huggingface_hub import hf_hub_download |
| hf_hub_download(repo_id="LEVI-Project/sft-data", filename="sft-data.zip") |
| ``` |
|
|
| Notice that the `sft-data.zip` file above has the following structure: |
|
|
| ``` |
| sft-data |
| βββ README.md # This README file. |
| βββ alf # Folder for ALFWORLD. |
| β βββ alfworld.json # The JSON file for ALFWORLD. |
| β βββ alf_data_folder # Folder for the ALFWORLD environment. |
| β βββ alf_image_id_0 # Folder 0 for ALFWORLD image data. |
| β βββ alf_image_id_1 # Folder 1 for ALFWORLD image data. |
| β βββ alf_image_id_2 # Folder 2 for ALFWORLD image data. |
| β βββ alf_image_id_3 # Folder 3 for ALFWORLD image data. |
| β βββ alf_image_id_4 # Folder 4 for ALFWORLD image data. |
| βββ blackjack # Folder for blackjack environment in the `gym_cards`. |
| β βββ blackjack_data_folder # Folder for blackjack image data. |
| β βββ blackjack.json # The JSON file for blackjack. |
| βββ ezpoints # Folder for ezpoints environment in the `gym_cards`. |
| β βββ ezpoints_data_folder # Folder for ezpoints image data. |
| β βββ ezpoints.json # The JSON file for ezpoints. |
| βββ points24 # Folder for points24 environment in the `gym_cards`. |
| β βββ points24_data_folder # Folder for points24 image data. |
| β βββ points24.json # The JSON file for points24. |
| βββ numberline # Folder for numberline environment in the `gym_cards`. |
| βββ numberline_data_folder # Folder for numberline image data. |
| βββ numberline.json # The JSON file for numberline. |
| ``` |
|
|
|
|
| Also, you could choose to download the files for any environment out of the five ones. For example, you should be using the following code for downloading data from blackjack. |
|
|
| ```py |
| from huggingface_hub import hf_hub_download |
| hf_hub_download(repo_id="LEVI-Project/sft-data", filename="blackjack.zip") # zip folder for image data folder |
| hf_hub_download(repo_id="LEVI-Project/sft-data", filename="blackjack.json") # JSON file |
| ``` |
|
|
| For ALFWORLD, notice that the zip file for the image data folder is `alf_data_folder.zip`. |