MPU (i.MX 8 and i.MX 9)
- Multi-speakers (language-dependent)
- Frequency sampling: 16 kHz or 22 kHz
- Languages: English, Mandarin Chinese, Hindi, Spanish, French, Arabic, Portuguese, Russian, German, Italian, Dutch, Catalan, Czech, Welsh, Danish, Greek, Farsi, Finnish, Hungarian, Indonesian, Icelandic, Georgian, Kazakh, Luxembourgian, Latvian, Malayalam, Nepali, Norwegian, Polish, Romanian, Slovak, Slovenian, Serbian, Swedish, Swahili, Turkish, Ukrainian, Vietnamese
- Natural speech quality
- The TTS model is based on VITS architecture (Variational Inference with adversarial learning for end-to-end Text-to-Speech paper). Depending on the language, there may be several speakers, male or female, different accents and tones of voice. The frequency sampling rate is 16 kHz or 22 kHz depending on the model
|