Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,33 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: mit
|
| 3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
---
|
| 4 |
+
# Instructions to Run the Model on Sample Dataset
|
| 5 |
+
|
| 6 |
+
## 1. Using Pegasus Model
|
| 7 |
+
- Open `code_pegasus.py`.
|
| 8 |
+
- Run the script. (We have provided the path to the sample data `9.txt`.)
|
| 9 |
+
- To test on other conversation files, provide the path to the desired data files.
|
| 10 |
+
*(You can find our sample dataset in the `Sample Dataset` directory.)*
|
| 11 |
+
|
| 12 |
+
## 2. Using BART-large Model
|
| 13 |
+
- Open `code_bart.py`.
|
| 14 |
+
- Run the script. (We have provided the path to the sample data `9.txt`.)
|
| 15 |
+
- To test on other conversation files, provide the path to the desired data files.
|
| 16 |
+
*(You can find our sample dataset in the `Sample Dataset` directory.)*
|
| 17 |
+
|
| 18 |
+
---
|
| 19 |
+
|
| 20 |
+
### Note:
|
| 21 |
+
The `Sample Dataset` includes:
|
| 22 |
+
- An anonymized our sample dataset named `conv_1`, `conv_2`, and `conv_3`.
|
| 23 |
+
- A Chinese language dataset translated to English named `conv_1001`, `conv_1002`, and `conv_1003` (D4).
|
| 24 |
+
|
| 25 |
+
---
|
| 26 |
+
|
| 27 |
+
### Important:
|
| 28 |
+
- This dataset is provided **only for research purposes**.
|
| 29 |
+
To get full access to the dataset, please fill out the Google form:
|
| 30 |
+
[Request Dataset Access](https://docs.google.com/forms/d/e/1FAIpQLSe_unAObNmRJjKrmluRC-UZZUTyKd9aujVLNUeTLgPtB-9BSw/viewform?usp=sf_link)
|
| 31 |
+
|
| 32 |
+
- For any queries related to dataset and model access, please contact:
|
| 33 |
+
**nilesh21@iiserb.ac.in**
|