|
--- |
|
base_model: |
|
- openai/whisper-large-v2 |
|
datasets: |
|
- MERaLiON/MNSC |
|
library_name: transformers |
|
license: other |
|
license_name: meralion-public-license |
|
license_link: https://huggingface.co/MERaLiON/MERaLiON-AudioLLM-Whisper-SEA-LION/blob/main/MERaLiON-Public-Licence-v1.pdf |
|
metrics: |
|
- bleu |
|
- wer |
|
pipeline_tag: automatic-speech-recognition |
|
tags: |
|
- vllm |
|
- LLM-as-a-Judge |
|
- chat |
|
- audio |
|
- safetensors |
|
|
|
extra_gated_fields: |
|
First Name: text |
|
Last Name: text |
|
Company: text |
|
Country: country |
|
Job Title: text |
|
Specific date: date_picker |
|
|
|
I want to use this model for: |
|
type: select |
|
options: |
|
- Research |
|
- Education |
|
- label: Other |
|
value: other |
|
|
|
I agree to use this model according to MERaLiON-Public-License-v1: checkbox |
|
--- |
|
|
|
|
|
## Disclaimer |
|
|
|
The current MERaLiON-AudioLLM has not been specifically aligned for safety and may generate content that is inappropriate, offensive, or harmful. Developers and users are responsible for performing their own safety fine-tuning and implementing necessary security measures. The authors shall not be held liable for any claims, damages, or other liabilities arising from the use of the released models, weights, or code. |
|
|
|
## Citation |
|
|
|
If you find our work useful, please cite our paper: |
|
|
|
``` |
|
@misc{he2024meralionaudiollmtechnicalreport, |
|
title={MERaLiON-AudioLLM: Bridging Audio and Language with Large Language Models}, |
|
author={{MERaLiON Team}}, |
|
year={2024}, |
|
eprint={2412.09818}, |
|
archivePrefix={arXiv}, |
|
primaryClass={cs.CL}, |
|
url={https://arxiv.org/abs/2412.09818}, |
|
} |
|
``` |