For best experience this site requires Javascript to be enabled. To enable on your browser, follow our accessibility instructions.

Click image to open expanded view

Our Text-to-Speech (TTS) software solution converts text into synthetic speech to enable AI assistant voice output, automated responses and improved user interaction across digital platforms and services.

Software Details

Diagram

Text To Speech

Get diagram PDF

Features

MPU (i.MX 8 and i.MX 9)

Multi-speakers (language-dependent)
Frequency sampling: 16 kHz or 22 kHz
Languages: English, Mandarin Chinese, Hindi, Spanish, French, Arabic, Portuguese, Russian, German, Italian, Dutch, Catalan, Czech, Welsh, Danish, Greek, Farsi, Finnish, Hungarian, Indonesian, Icelandic, Georgian, Kazakh, Luxembourgian, Latvian, Malayalam, Nepali, Norwegian, Polish, Romanian, Slovak, Slovenian, Serbian, Swedish, Swahili, Turkish, Ukrainian and Vietnamese
Natural speech quality
The TTS model is based on variational inference with adversarial learning for end-to-end text-to-speech paper (VITS) architecture. Depending on the language, there may be several speakers, male or female, different accents and tones of voice. The frequency sampling rate is 16 kHz or 22 kHz depending on the model

MCU (i.MX RT700)

Leverages NPU
Frequency sampling: 16 kHz
Language: English

Supported Devices

i.MX8M: i.MX 8M Family - Arm® Cortex®-A53, Cortex-M4, Audio, Voice and Video

i.MX91: Secure, Energy-Efficient Linux Capabilities for Smart Home and Industrial Automation

i.MX93: i.MX 93 Applications Processor Family – Arm® Cortex®-A55, ML Acceleration, Power Efficient MPU

i.MX95: i.MX 95 Applications Processor Family: High-Performance, Safety Enabled Platform with eIQ® Neutron NPU

i.MX-RT700: i.MX RT700 Crossover MCU with Arm® Cortex®-M33, NPU, DSP and GPU Cores

System Requirements

MCU: TensorFlow Lite
MPU: ONNX

Footprint and Profiling

Model	Model size [parameters]	Platform / Core	Real Time Factor	DNS-MOS	Quantization format	Code size [MB]	Library dependency
MPU	~20M	i.MX 95 / 6x Cortex-A55	0.26	4.35	8-bit int (linear layers)	~23	ONNX
-	-	i.MX 8MP / 4x Cortex-A53	0.52	-	-	-	-
-	-	i.MX 8MM / 4x Cortex-A53	0.59	-	-	-	-
-	-	i.MX 8MN / 4x Cortex-A53	0.62	-	-	-	-
-	-	i.MX 91 / 1x Cortex-A55	0.97	-	-	-	-

MCU	~2.2M	Cortex-M33/NPU	0.24	4.17	8-bit int + 16a_8w int	150 kB + 2.6 MB model size (and 1.7 MB + 2.6 MB SRAM)	TFLite

Design Resources

Hardware

Quick reference to our board types.

2 hardware offerings

Evaluation and Development Boards
i.MX 93 Evaluation Kit
Active

i.MX93EVK
Evaluation and Development Boards
i.MX 91 Evaluation Kit
Active

i.MX91EVK

Text-to-Speech

Software Details

Diagram

Text To Speech

Features

MPU (i.MX 8 and i.MX 9)

MCU (i.MX RT700)

Supported Devices

System Requirements

Footprint and Profiling

Design Resources

Hardware

Filter By

i.MX 93 Evaluation Kit
Active

i.MX 91 Evaluation Kit
Active

Filter By

Support

What do you need help with?

Suggested Links

Text-to-Speech

Software Details

Diagram

Text To Speech

Features

MPU (i.MX 8 and i.MX 9)

MCU (i.MX RT700)

Supported Devices

System Requirements

Footprint and Profiling

Design Resources

Hardware

Filter By

i.MX 93 Evaluation Kit Active

i.MX 91 Evaluation Kit Active

Filter By

Support

What do you need help with?

Suggested Links

i.MX 93 Evaluation Kit
Active

i.MX 91 Evaluation Kit
Active