ML for Audio Study Group - Text to Speech Deep Dive

This week will do a deep dive into Text to Speech. You can ask your questions at - Join the discussion at Discord ( #ml-4-audio-study-group channel). - Check out the GitHub repository of the project: Vaibhav (VB) is a consultant turned student researcher at University of Stuttgart, Germany. His current research is in the field of Performance Prediction for NLP models and Speech Synthesis. He is also an active volunteer with Europython and Python DE. Vatsal left the world of mathematics in 2017 to dive into Speech Synthesis soon after he came across the WaveNet paper. His research has focused on Normalising Flows, a particular kind of Deep Generative Model. At Amazon, he researched the deep-learning based vocoding module that is used in production, and disentanglement in deep generative models for zero-shot speech generation (text-to-speech & voi
Back to Top