| Writing logs to /p/qdata/jm8wx/research/text_attacks/textattack/outputs/training/albert-base-v2-glue:wnli-2020-06-29-11:21/log.txt. |
| Loading [94mnlp[0m dataset [94mglue[0m, subset [94mwnli[0m, split [94mtrain[0m. |
| Loading [94mnlp[0m dataset [94mglue[0m, subset [94mwnli[0m, split [94mvalidation[0m. |
| Loaded dataset. Found: 2 labels: ([0, 1]) |
| Loading transformers AutoModelForSequenceClassification: albert-base-v2 |
| Tokenizing training data. (len: 635) |
| Tokenizing eval data (len: 71) |
| Loaded data and tokenized in 4.413618564605713s |
| Training model across 4 GPUs |
| ***** Running training ***** |
| Num examples = 635 |
| Batch size = 64 |
| Max sequence length = 256 |
| Num steps = 45 |
| Num epochs = 5 |
| Learning rate = 2e-05 |
| Eval accuracy: 59.154929577464785% |
| Best acc found. Saved model to /p/qdata/jm8wx/research/text_attacks/textattack/outputs/training/albert-base-v2-glue:wnli-2020-06-29-11:21/. |
| Eval accuracy: 47.88732394366197% |
| Eval accuracy: 45.07042253521127% |
| Eval accuracy: 47.88732394366197% |
| Eval accuracy: 50.70422535211267% |
| Saved tokenizer <textattack.models.tokenizers.auto_tokenizer.AutoTokenizer object at 0x7f9b70a4ba60> to /p/qdata/jm8wx/research/text_attacks/textattack/outputs/training/albert-base-v2-glue:wnli-2020-06-29-11:21/. |
| Wrote README to /p/qdata/jm8wx/research/text_attacks/textattack/outputs/training/albert-base-v2-glue:wnli-2020-06-29-11:21/README.md. |
| Wrote training args to /p/qdata/jm8wx/research/text_attacks/textattack/outputs/training/albert-base-v2-glue:wnli-2020-06-29-11:21/train_args.json. |
| Writing logs to /p/qdata/jm8wx/research/text_attacks/textattack/outputs/training/albert-base-v2-glue:wnli-2020-06-29-11:21/log.txt. |
| Loading [94mnlp[0m dataset [94mglue[0m, subset [94mwnli[0m, split [94mtrain[0m. |
| Loading [94mnlp[0m dataset [94mglue[0m, subset [94mwnli[0m, split [94mvalidation[0m. |
| Loaded dataset. Found: 2 labels: ([0, 1]) |
| Loading transformers AutoModelForSequenceClassification: albert-base-v2 |
| Tokenizing training data. (len: 635) |
| Tokenizing eval data (len: 71) |
| Loaded data and tokenized in 4.476848840713501s |
| Training model across 4 GPUs |
| ***** Running training ***** |
| Num examples = 635 |
| Batch size = 128 |
| Max sequence length = 256 |
| Num steps = 20 |
| Num epochs = 5 |
| Learning rate = 2e-05 |
|
|