Training in progress, epoch 1

Files changed (4) hide show

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [fav-kky/wav2vec2-base-cs-80k-ClTRUS](https://huggingface.co/fav-kky/wav2vec2-base-cs-80k-ClTRUS) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: inf
-- Wer: 1.0555
 ## Model description
@@ -38,28 +38,23 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-05
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
 - gradient_accumulation_steps: 2
-- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_ratio: 0.2
-- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss | Wer    |
-|:-------------:|:-------:|:----:|:---------------:|:------:|
-| 0.0           | 3.1949  | 500  | inf             | 1.0555 |
-| 0.0           | 6.3898  | 1000 | inf             | 1.0555 |
-| 0.0           | 9.5847  | 1500 | inf             | 1.0555 |
-| 0.0           | 12.7796 | 2000 | inf             | 1.0555 |
-| 0.0           | 15.9744 | 2500 | inf             | 1.0555 |
-| 0.0           | 19.1693 | 3000 | inf             | 1.0555 |
 ### Framework versions

 This model is a fine-tuned version of [fav-kky/wav2vec2-base-cs-80k-ClTRUS](https://huggingface.co/fav-kky/wav2vec2-base-cs-80k-ClTRUS) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: inf
+- Wer: 1.0
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
+- train_batch_size: 4
+- eval_batch_size: 4
 - seed: 42
 - gradient_accumulation_steps: 2
+- total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.3
+- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer |
+|:-------------:|:-----:|:----:|:---------------:|:---:|
+| 650.6179      | 4.0   | 500  | inf             | 1.0 |
 ### Framework versions

config.json CHANGED Viewed

@@ -42,9 +42,9 @@
     2,
     2
   ],
-  "ctc_loss_reduction": "sum",
   "ctc_zero_infinity": false,
-  "decoder_start_token_id": 50,
   "diversity_loss_weight": 0.1,
   "do_stable_layer_norm": false,
   "eos_token_id": 2,
@@ -77,7 +77,7 @@
   "num_hidden_layers": 6,
   "num_negatives": 100,
   "output_hidden_size": 768,
-  "pad_token_id": 49,
   "proj_codevector_dim": 256,
   "tdnn_dilation": [
     1,
@@ -103,6 +103,6 @@
   "torch_dtype": "float32",
   "transformers_version": "4.45.2",
   "use_weighted_layer_sum": false,
-  "vocab_size": 52,
   "xvector_output_dim": 512
 }

     2,
     2
   ],
+  "ctc_loss_reduction": "mean",
   "ctc_zero_infinity": false,
+  "decoder_start_token_id": 43,
   "diversity_loss_weight": 0.1,
   "do_stable_layer_norm": false,
   "eos_token_id": 2,
   "num_hidden_layers": 6,
   "num_negatives": 100,
   "output_hidden_size": 768,
+  "pad_token_id": 42,
   "proj_codevector_dim": 256,
   "tdnn_dilation": [
     1,
   "torch_dtype": "float32",
   "transformers_version": "4.45.2",
   "use_weighted_layer_sum": false,
+  "vocab_size": 45,
   "xvector_output_dim": 512
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:32c5df6b29386e3581bae9e108f81b3403c666e7490422494e155844b236d9ef
-size 65000248

 version https://git-lfs.github.com/spec/v1
+oid sha256:05674d5b7b603a3e51d220c23c3c645e44b702c50e7e57e90880be0a5c196dc8
+size 64989468

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:30ee42f67045bba106f47f0f6f5f062c3e5114c16ce8a08158710f7262318745
 size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:90038c0fc2656f331005268e5bcd90aad755a09be99dbe75ab03240634580813
 size 5176