smangrul
/

xls-r-mr-model

Automatic Speech Recognition

mozilla-foundation/common_voice_8_0

Generated from Trainer

robust-speech-event

hf-asr-leaderboard

Model card Files Files and versions Community

smangrul commited on Feb 5, 2022

Commit

72c6017

·

1 Parent(s): bd7db4e

Create README.md

Files changed (1) hide show

README.md +113 -0

README.md ADDED Viewed

	@@ -0,0 +1,113 @@

+---
+language:
+- mr
+license: apache-2.0
+tags:
+- automatic-speech-recognition
+- mozilla-foundation/common_voice_8_0
+- generated_from_trainer
+- robust-speech-event
+datasets:
+- common_voice
+model-index:
+- name: wav2vec2-large-xls-r-300m-mr
+  results:
+  - task:
+      type: automatic-speech-recognition
+      name: Speech Recognition
+    dataset:
+      type: mozilla-foundation/common_voice_8_0
+      name: Common Voice 8
+      args: mr
+    metrics:
+      - type: wer    # Required. Example: wer
+        value: 31.57  # Required. Example: 20.90
+        name: Test WER    # Optional. Example: Test WER
+      - name: Test CER
+        type: cer
+        value: 6.93
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+#
+This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - MR dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.494580
+- Wer: 0.395909
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0001
+- train_batch_size: 16
+- eval_batch_size: 8
+- seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 32
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 1000
+- num_epochs: 200.0
+- mixed_precision_training: Native AMP
+### Training results
+ [12991/14200 9:46:02 < 54:32, 0.37 it/s, Epoch 182.96/200]
+Step	Training Loss	Validation Loss	Wer
+400	3.794000	3.532227	1.000000
+800	3.362400	3.359044	1.000000
+1200	2.293900	1.011279	0.829924
+1600	1.233000	0.502743	0.593662
+2000	0.962600	0.412519	0.496992
+2400	0.831800	0.402903	0.493783
+2800	0.737000	0.389773	0.469314
+3200	0.677100	0.373987	0.436021
+3600	0.634400	0.383823	0.432010
+4000	0.586000	0.375610	0.419575
+4400	0.561000	0.387891	0.418371
+4800	0.518500	0.386357	0.417569
+5200	0.515300	0.415069	0.430004
+5600	0.478100	0.399211	0.408744
+6000	0.468100	0.424542	0.402327
+6400	0.439400	0.430979	0.410750
+6800	0.429600	0.427700	0.409146
+7200	0.400300	0.451111	0.419976
+7600	0.395100	0.463446	0.405134
+8000	0.381800	0.454752	0.407942
+8400	0.371500	0.461547	0.404733
+8800	0.362500	0.461543	0.411151
+9200	0.338200	0.468299	0.417168
+9600	0.338800	0.480989	0.412355
+10000	0.317600	0.475700	0.410750
+10400	0.315100	0.478920	0.403530
+10800	0.296200	0.480600	0.398315
+11200	0.299000	0.477083	0.393502
+11600	0.290000	0.465646	0.393903
+12000	0.290900	0.490041	0.405937
+12400	0.275600	0.489354	0.399519
+12800	0.272600	0.494580	0.395909
+### Framework versions
+- Transformers 4.17.0.dev0
+- Pytorch 1.10.2+cu113
+- Datasets 1.18.3.dev0
+- Tokenizers 0.11.0