DID YOU KNOW?ĭid you know that the first machine to produce a human sound was created in 1779? Christian Gottlieb Kratzenstein made a model of the human vocal tract named “vowel organ.' He used resonance tubes connected to organ pipes with free reeds to produce five long vowel sounds. Conversely, a speech-to-text system does the opposite, with speech recognition as its main function. It is part of natural language generation in natural language processing, wherein a machine is programmed to synthesize speech that is similar to the natural voices of humans in pitch, tone, and duration. Text-to-Speech systems, often abbreviated as TTS, is a system that converts phonetic and orthographic transcriptions into artificial speech.