You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Orpheus 3B Chilean Spanish finetune

Orpheus TTS is a Llama-based Speech-LLM designed for high-quality, empathetic text-to-speech generation. This model has been finetuned from canopylabs/3b-es_it-ft-research_release with ylacombe/google-argentinian-spanish to deliver human-level, Chilean accent speech synthesis, achieving clarity, expressiveness, and real-time streaming performances.

Model Details

Model Capabilities

  • Human-Like Speech: Natural intonation, emotion, and rhythm that is superior to SOTA closed source models
  • Zero-Shot Voice Cloning: Clone voices without prior fine-tuning
  • Guided Emotion and Intonation: Control speech and emotion characteristics with simple tags
  • Low Latency: ~200ms streaming latency for realtime applications, reducible to ~100ms with input streaming

Model Sources

Usage

Check out Colab (link to Colab) or GitHub (link to GitHub) on how to run easy inference on our finetuned models.

Model Misuse

Do not use our models for impersonation without consent, misinformation or deception (including fake news or fraudulent calls), or any illegal or harmful activity. By using this model, you agree to follow all applicable laws and ethical guidelines. We disclaim responsibility for any use.

Downloads last month
0
Safetensors
Model size
3.3B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for marianbasti/Llama-3.2-3B-Orpheus-Chilean-1795

Dataset used to train marianbasti/Llama-3.2-3B-Orpheus-Chilean-1795