Orpheus-3b-FT-AWQ

This is a quantised version of canopylabs/orpheus-3b-0.1-ft.

Orpheus is a high-performance Text-to-Speech model fine-tuned for natural, emotional speech synthesis. This repository hosts the 8-bit quantised version of the 3B parameter model, optimised for efficiency while maintaining high-quality output.

Model Description

Orpheus-3b-FT-AWQ is a 3 billion parameter Text-to-Speech model that converts text inputs into natural-sounding speech with support for multiple voices and emotional expressions. The model has been quantised to 8-bit (Q8_0) format for efficient inference, making it accessible on consumer hardware.

Key features:

  • 8 distinct voice options with different characteristics
  • Support for emotion tags like laughter, sighs, etc.
  • Optimised for CUDA acceleration on RTX GPUs
  • Produces high-quality 24kHz mono audio
  • Fine-tuned for conversational naturalness
Downloads last month
679
Safetensors
Model size
859M params
Tensor type
F32
I32
FP16
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support