Build your own Alexa with the ESP32 and TensorFlow Lite

We’ve been building towards this project in the previous set of videos. And we’re now ready to build our very own DIY Alexa! All the code for this project is on GitHub - What are we building - 1:15 Wake Word Detection - 2:27 Command Recognition - 11:47 Digging into the code - 16:30 What’s life all about Marvin? - 21:52 To detect the wake words we stream audio from either an I2S microphone or from the ADC. The wake word detector looks at a 1-second window of audio. The spectrogram of the audio is calculated and fed into a TensorFlow Lite model. Once we detect the wake word we stream the audio up to to recognise the user’s intent. It works surprisingly well for such a small model, there are improvements that could be made with more training data. I’ll leave the access token for live for as long as I can, but at some point, you will need to generate your own application.
Back to Top