Build your own Alexa with the ESP32 and TensorFlow Lite

We’ve been building towards this project in the previous set of videos. And we’re now ready to build our very own DIY Alexa! All the code for this project is on GitHub - What are we building - 1:15 Wake Word Detection - 2:27 Command Recognition - 11:47 Digging into the code - 16:30 What’s life all about Marvin? - 21:52 To detect the wake words we stream audio from either an I2S microphone or from the ADC. The wake word detector looks at a 1-second window of audio. The spectrogram of the audio is calculated and fed into a TensorFlow Lite model. Once we detect the wake word we stream the audio up to to recognise the user’s intent. It works surprisingly well for such a small model, there are improvements that could be made with more training data. I’ll leave the access token for live for as long as I can, but at some point, you will need to generate your own application. Let me know how you get on in the comments! Related Videos: Audio Input Audio Output And TensorFlow Lite for machine learning Components you could use: MAX98357 - TinyPico - INMP441 I2S Microphone: ICS-43434 I2S Microphone: ESP32 Dev board: Analogue Audio Amplifier: Speakers: --- Want to help support the channel? I’m accepting coffee on
Back to Top