This is model is a finetune of the openai/whisper-small model using approximately 750 hours of general conversational audio from Part 3 of the National Speech Corpus converted to CTranslate2 format for faster inference. These are the final results on the evaluation set (~95 hours of audio):
- Validation Loss: 0.386770
- WER: 14.257934
- Downloads last month
- 1
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for Xycone/faster-whisper-SGspeech-finetune
Base model
openai/whisper-small