Indian English Dataset

High-Quality Indian English Wake Word Dataset for AI & Speech Models

Overview

Title

Indian English Language Dataset

Dataset Type

Wake Word

Description

Wake Words / Voice Command / Trigger Word / Keyphrase collection of data

400 speakers
5 unique keyphrases per speaker
20 audio files per unique keyphrase
100 total recorded utterances per speaker

Data Set Details

Total hours

40,000 Audios

Sample Rate

16 kHz

Audio Channel

1 channel

Recording Platform

Mobile App

Audio Format

.wav

Transcription Format

.json

WER (%)

Data Set Demographics

Country

Indian English

Language

Indian English

Gender

Female 50%, Male 50%, Unknown 10%

Number of Speakers

200

Age

18-50

Featured Clients

Empowering teams to build world-leading AI products.

Can’t find what you are looking for?

New off-the-shelf datasets are being collected across all data types

First Name*
Last Name*
Email*
Phone*
Company*
Country*
Country
Volume of Data*
Tentative Budget*
By registering, I agree with Shaip Privacy Policy and Terms of Service and provide my consent to receive B2B marketing communication from Shaip.
Name
This field is for validation purposes and should be left unchanged.

Indian English Dataset

Overview

Data Set Details

Data Set Demographics

Featured Clients

Can’t find what you are looking for?

New off-the-shelf datasets are being collected across all data types

AI Data Services

Platform

Speciality

Industry

Resources

Company

Contact Us