Tacotron 2 – a human like text to speech Artificial Intelligence system, has really taken Google to the top of the world. The system is capable of generating AI generated computer speech, which will look like those of human speech. Well, we are sure to recall the vision shared by Google CEO Sunder Pichai, when he had made it clear that Google would sport a portfolio of ‘AI first’ products.
Into Tacotron 2, the system is said to have achieved a MOS (Mean Opinion Score) of 4.53 – closely matching up the figure of professionally recorded human speech, 4.58. It also differentiated between nouns and verbs based on pronunciation. The file format of the speech generated by ‘Tacotron 2’ is ‘gt’, aka ground truth – the term in machine learning stands for, the real deal.