Indian English Dataset
Overview
Title
Indian English Language Dataset
Dataset Type
Wake Word
Description
Wake Words / Voice Command / Trigger Word / Keyphrase collection of data
- 400 speakers
- 5 unique keyphrases per speaker
- 20 audio files per unique keyphrase
- 100 total recorded utterances per speaker
Data Set Details
Total hours
40,000 Audios
Sample Rate
16 kHz
Audio Channel
1 channel
Recording Platform
Mobile App
Audio Format
.wav
Transcription Format
.json
WER (%)
5
Data Set Demographics
Country
Indian English
Language
Indian English
Gender
Female 50%, Male 50%, Unknown 10%
Number of Speakers
200
Age
18-50
Featured Clients
Empowering teams to build world-leading AI products.
Can’t find what you are looking for?
New off-the-shelf datasets are being collected across all data types
Contact us now to let go of your audio/speech training data collection worries