A super-simple speech recogniser

We make what is possibly the world’s simplest speech recognition system. It can only recognise two different words, but will help you understand the basic idea of pattern recognition using template matching. The templates are just pre-recorded words, with known labels. The features extracted are just two formant frequencies in the middle of the word, and the distance measure between unknown word and templates is simply the Euclidean distance.

Try it for yourself with these materials (a zip file containing the three waveforms used in this video).

Look out for later videos where we extend the idea of template matching to use sequences of features, when we will have to solve the problem of aligning the two words before comparing them.