metadata
base_model:
- openai/whisper-large-v2
datasets:
- MERaLiON/MNSC
library_name: transformers
license: other
license_name: meralion-public-license
license_link: >-
https://huggingface.co/MERaLiON/MERaLiON-AudioLLM-Whisper-SEA-LION/blob/main/MERaLiON-Public-Licence-v1.pdf
metrics:
- bleu
- wer
pipeline_tag: automatic-speech-recognition
tags:
- vllm
- LLM-as-a-Judge
- chat
- audio
- safetensors
extra_gated_fields:
First Name: text
Last Name: text
Company: text
Country: country
Job Title: text
Specific date: date_picker
I want to use this model for:
type: select
options:
- Research
- Education
- label: Other
value: other
I agree to use this model according to MERaLiON-Public-License-v1: checkbox
Disclaimer
The current MERaLiON-AudioLLM has not been specifically aligned for safety and may generate content that is inappropriate, offensive, or harmful. Developers and users are responsible for performing their own safety fine-tuning and implementing necessary security measures. The authors shall not be held liable for any claims, damages, or other liabilities arising from the use of the released models, weights, or code.
Citation
If you find our work useful, please cite our paper:
@misc{he2024meralionaudiollmtechnicalreport,
title={MERaLiON-AudioLLM: Bridging Audio and Language with Large Language Models},
author={{MERaLiON Team}},
year={2024},
eprint={2412.09818},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2412.09818},
}