Dari Dataset
Dari Dataset
Overview
Title
Dari Language Dataset
Dataset Type
General Conversation
Description
Unscripted, synthetic telephonic conversation between “speaker 1” and “speaker 2”, Approx Audio Duration (Range) 5-15 Minutes.
Use Case
Politics, current affairs, local news, religion, economics and finance, and tourism
Data Set Details
Total hours
100
Sample Rate
44 kHz
Audio Channel
Mono
Recording Platform
Mobile App
Audio Format
.wav
Transcription Format
.json
WER (%)
5
Data Set Demographics
Country
Afghanistan
Language
Dari
Age
18-50
Overview
Title
Dari Language Dataset
Dataset Type
TTS
Description
Single-utterance recordings, which tend to fall in the 5 to 30 second range.
Use Case
Politics, current affairs, local news, religion, economics and finance, and tourism
Data Set Details
Total hours
600
Sample Rate
16 kHz
Audio Channel
Mono
Recording Platform
Mobile App
Audio Format
.wav
Transcription Format
.json
WER (%)
5
Data Set Demographics
Country
Afghanistan
Language
Dari
Age
18-50
Featured Clients
Empowering teams to build world-leading AI products.
Can’t find what you are looking for?
New off-the-shelf datasets are being collected across all data types
Contact us now to let go of your audio/speech training data collection worries