Update sections/03_create_dataset.md
Browse filesfix more issues with markdown conversion
sections/03_create_dataset.md
CHANGED
|
@@ -20,26 +20,23 @@ huggingface-cli login
|
|
| 20 |
1. Click Name icon ⇒ [New → Dataset](https://huggingface.co/new)
|
| 21 |
1. Fill out dataset name
|
| 22 |
2. Navigate to "Files and Versions" → README.md
|
| 23 |
-
3. Fill out the top Dataset Card metadata (you can come back and fill
|
| 24 |
2. Web-workflow
|
| 25 |
1. Edit READ.md directly in the browser
|
| 26 |
2. upload/delete other files directly
|
| 27 |
3. Add any data processing scripts/workflows for reproducibility
|
| 28 |
1. `git clone https://huggingface.co/datasets/<username>/<repo-name>`
|
| 29 |
2. create analysis folder structure, such as:
|
| 30 |
-
|
| 31 |
src/ \# scripts for data curation
|
| 32 |
-
|
| 33 |
data/ \# stored raw data for processing/curation
|
| 34 |
-
|
| 35 |
intermediate/ \# store processed/curated data for uploading
|
| 36 |
-
|
| 37 |
3. Add `.gitignore`
|
| 38 |
-
|
| 39 |
-
data/
|
| 40 |
-
|
| 41 |
-
|
| 42 |
-
|
| 43 |
4. Use standard git workflow for modifying README.md and curation scripts
|
| 44 |
|
| 45 |
#### Uploading data to HuggingFace
|
|
|
|
| 20 |
1. Click Name icon ⇒ [New → Dataset](https://huggingface.co/new)
|
| 21 |
1. Fill out dataset name
|
| 22 |
2. Navigate to "Files and Versions" → README.md
|
| 23 |
+
3. Fill out the top Dataset Card metadata (you can come back and fill out more details later)
|
| 24 |
2. Web-workflow
|
| 25 |
1. Edit READ.md directly in the browser
|
| 26 |
2. upload/delete other files directly
|
| 27 |
3. Add any data processing scripts/workflows for reproducibility
|
| 28 |
1. `git clone https://huggingface.co/datasets/<username>/<repo-name>`
|
| 29 |
2. create analysis folder structure, such as:
|
| 30 |
+
```
|
| 31 |
src/ \# scripts for data curation
|
|
|
|
| 32 |
data/ \# stored raw data for processing/curation
|
|
|
|
| 33 |
intermediate/ \# store processed/curated data for uploading
|
| 34 |
+
```
|
| 35 |
3. Add `.gitignore`
|
| 36 |
+
```
|
| 37 |
+
data/*
|
| 38 |
+
intermediate/*
|
| 39 |
+
```
|
|
|
|
| 40 |
4. Use standard git workflow for modifying README.md and curation scripts
|
| 41 |
|
| 42 |
#### Uploading data to HuggingFace
|