README.md · MERaLiON/MERaLiON-AudioLLM-it-dev at main

metadata

base_model:
  - openai/whisper-large-v2
datasets:
  - MERaLiON/MNSC
library_name: transformers
license: other
license_name: meralion-public-license
license_link: >-
  https://huggingface.co/MERaLiON/MERaLiON-AudioLLM-Whisper-SEA-LION/blob/main/MERaLiON-Public-Licence-v1.pdf
metrics:
  - bleu
  - wer
pipeline_tag: automatic-speech-recognition
tags:
  - vllm
  - LLM-as-a-Judge
  - chat
  - audio
  - safetensors
extra_gated_fields:
  First Name: text
  Last Name: text
  Company: text
  Country: country
  Job Title: text
  Specific date: date_picker
  I want to use this model for:
    type: select
    options:
      - Research
      - Education
      - label: Other
        value: other
  I agree to use this model according to MERaLiON-Public-License-v1: checkbox

Disclaimer

The current MERaLiON-AudioLLM has not been specifically aligned for safety and may generate content that is inappropriate, offensive, or harmful. Developers and users are responsible for performing their own safety fine-tuning and implementing necessary security measures. The authors shall not be held liable for any claims, damages, or other liabilities arising from the use of the released models, weights, or code.

Citation

If you find our work useful, please cite our paper:

@misc{he2024meralionaudiollmtechnicalreport,
      title={MERaLiON-AudioLLM: Bridging Audio and Language with Large Language Models}, 
      author={{MERaLiON Team}},
      year={2024},
      eprint={2412.09818},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2412.09818}, 
}