Text To Speech

Roll over image to zoom in

Diagram

Text To Speech

Text To Speech

Features

MPU (i.MX 8 and i.MX 9)

  • Multi-speakers (language-dependent)
  • Frequency sampling: 16 kHz or 22 kHz
  • Languages: English, Mandarin Chinese, Hindi, Spanish, French, Arabic, Portuguese, Russian, German, Italian, Dutch, Catalan, Czech, Welsh, Danish, Greek, Farsi, Finnish, Hungarian, Indonesian, Icelandic, Georgian, Kazakh, Luxembourgian, Latvian, Malayalam, Nepali, Norwegian, Polish, Romanian, Slovak, Slovenian, Serbian, Swedish, Swahili, Turkish, Ukrainian, Vietnamese
  • Natural speech quality
  • The TTS model is based on VITS architecture (Variational Inference with adversarial learning for end-to-end Text-to-Speech paper). Depending on the language, there may be several speakers, male or female, different accents and tones of voice. The frequency sampling rate is 16 kHz or 22 kHz depending on the model

MCU (i.MX RT700)

  • Leverages NPU
  • Frequency sampling: 16 kHz
  • Language: English

Supported Devices

  • i.MX8M: i.MX 8M Family - Arm® Cortex®-A53, Cortex-M4, Audio, Voice, Video
  • i.MX91: Secure, Energy-Efficient Linux Capabilities for Smart Home and Industrial Automation
  • i.MX93: i.MX 93 Applications Processor Family – Arm® Cortex®-A55, ML Acceleration, Power Efficient MPU
  • i.MX95: i.MX 95 Applications Processor Family: High-Performance, Safety Enabled Platform with eIQ® Neutron NPU
  • i.MX-RT700: i.MX RT700 Crossover MCU with Arm® Cortex®-M33, NPU, DSP and GPU Cores

System Requirements

Design Resources

Hardware

Quick reference to our board types.

2 hardware offerings

Support

What do you need help with?