Punjabi Dataset
ਪੰਜਾਬੀ ਡਾਟਾਸੈਟ
Overview
Title
Punjabi Language Dataset
Dataset Type
Call-Center
Description
Unscripted, synthetic telephonic conversation between “agent” and “customer”, Approx. Audio Duration (Range) 5-15 Minutes.
Use Case
ASR, Virtual Assistant, Chatbot, Conversational AI, Speech Analytics, TTS, Language Modelling
Data Set Details
Total hours
60
Sample Rate
8 Khz
Audio Channel
Dual
Recording Platform
Desktop
Audio Format
.wav
Transcription Format
.json
WER (%)
5
Data Set Demographics
Country
India
Language
Punjabi
Gender
Male: 330, Female: 364 and Unknown: 0
Number of Speakers
694
Age
18-50
Overview
Title
Punjabi Language Dataset
Dataset Type
General Conversation
Description
Unscripted, synthetic telephonic conversation between “agent” and “customer”, Approx. Audio Duration (Range) 5-15 Minutes.
Use Case
ASR, Virtual Assistant, Chatbot, Conversational AI, Speech Analytics, TTS, Language Modelling
Data Set Details
Total hours
100
Sample Rate
8 Khz
Audio Channel
Dual
Recording Platform
Desktop
Audio Format
.wav
Transcription Format
.json
WER (%)
5
Data Set Demographics
Country
India
Language
Punjabi
Gender
Male: 142, Female: 176 and Unknown: 0
Number of Speakers
318
Age
18-50
Overview
Title
Punjabi Language Dataset
Dataset Type
Media Audio
Description
Licensable Public domain audio/video files such as interviews, podcasts etc – 1 to 5 people. Approx. Audio Duration (Range) 15-60 minutes.
Use Case
ASR, Virtual Assistant, Chatbot, Conversational AI, Speech Analytics, TTS, Language Modelling
Data Set Details
Total hours
40
Sample Rate
16 Khz
Audio Channel
Mono
Recording Platform
Web Sourcing
Audio Format
.wav
Transcription Format
.json
WER (%)
5
Data Set Demographics
Country
India
Language
Punjabi
Gender
Male: 37, Female: 7 and Unknown: 0
Number of Speakers
44
Age
18-50
Featured Clients
Empowering teams to build world-leading AI products.
Can’t find what you are looking for?
New off-the-shelf datasets are being collected across all data types
Contact us now to let go of your audio/speech training data collection worries