skip to content
gigarouter gigarouter
models / text-to-speech · coming soon

MeloTTS Spanish

myshell-ai/MeloTTS-Spanish

published Feb 2024 · updated Mar 2024

MeloTTS Spanish is a text-to-speech model that generates high-quality Spanish speech from text.

status
coming soon
API providers
0
downloads / mo
71.3K
license
mit

specs

TaskText-to-Speech (TTS)
ArchitectureVITS-based (VITS, VITS2, Bert-VITS2)
LicenseMIT

about this model

myshell-ai/MeloTTS-Spanish is a text-to-speech model that produces natural, high-quality Spanish speech. It is part of the MeloTTS family developed by MyShell.ai in collaboration with MIT and Tsinghua University. The model supports CPU real-time inference, making it fast and accessible for production deployments without requiring a GPU.

Supported Languages

While the Spanish model is optimized for Spanish, the MeloTTS family covers multiple languages and accents, all accessible via the same architecture:

LanguageExample Audio
English (American)Listen
English (British)Listen
English (Indian)Listen
English (Australian)Listen
SpanishListen
FrenchListen
Chinese (mixed EN)Listen
JapaneseListen
KoreanListen

The Chinese model additionally supports mixed Chinese-English output. All models are fast enough for CPU real-time inference.

Community Adoption

The MeloTTS-Spanish model has been downloaded over 71,000 times in the past month and is used by 17 Hugging Face Spaces, reflecting active real-world deployment.

License and Attribution

Licensed under MIT, free for commercial and non-commercial use. The official citation is @software{zhao2024melo, author = {Zhao, Wenliang and Yu, Xumin and Qin, Zengyi}, title = {MeloTTS: High-quality Multi-lingual Multi-accent Text-to-Speech}, url = {https://github.com/myshell-ai/MeloTTS}, year = {2023}}.

best for

FAQ

What input does the model accept and what output does it produce?

It accepts Spanish text and produces a WAV audio file of synthesized speech.

Is this model suitable for real-time applications?

Yes, it is fast enough for CPU real-time inference.

What is the license for MeloTTS Spanish?

It is released under the MIT License, allowing both commercial and non-commercial use.

How do I call this model via the gigarouter API?

Use the OpenAI-compatible endpoint with your API key, sending the text and receiving audio in response.

Does this model support languages other than Spanish?

No, this specific model is trained only for Spanish. Other language variants are available separately.

not yet live

We're benchmarking and onboarding MeloTTS Spanish as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.

related text-to-speech models

compare all →