Building a voice assistant model

With the rise of voice technology, this leading global provider of audio equipment wanted to develop an automatic speech recognition (ASR) model to test in their products. However, traditional data vendors did not offer the proper tools or a diverse enough crowd to represent their target base. With our global community of 210,000+ people and industry-leading enterprise portal, DefinedCrowd® was ideally equipped to serve their needs.

The client would need high-quality data to train an ASR system on everything from simple audio system commands like “repeat,” to fuller assistant requests like “find me a restaurant”, which could be spoken in a quiet home environment or a moving vehicle with background noise. The system would need to understand variations of the same request – such as “make it louder” or “turn it up”– as well as accents and other factors that influence people’s speech.

To achieve the right result, the quality of data used to train the ASR system would be paramount.