The written text is first broken down into individual words and phrases by the TTS software’s text analysis component and then various rules and algorithms are applied to determine the appropriate pronunciation, inflection, and emphasis for each word. Text to speech software leverages AI and deep learning algorithms to process the written input and sythesize a spoken output. Today, the technology is used to create narratives for explainer videos or product demos, turn a book into an audio book, generate voiceovers for elearning materials, training videos, ads and commercials, YouTube videos, or podcasts, among other things. With a touch of a button, TTS can take words on a computer or other digital device and convert them into audio files. Today, the applications of TTS systems have grown manifold, and range from content creation to voiceover generation to customer service, and more. It was primarily designed as an assistive technology to help individuals with hearing impairments, visual and learning disabilities, and aged citizens to understand and consume content in a better manner. In essence, text to speech is the generation of synthesized speech from text. Text to Speech: What is it and how does it works?