Irish Dataset
Tacar Sonraí Iris
Overview
Title
Irish Language Dataset
Dataset Type
General Conversation
Description
Unscripted telephonic conversation between two people. Approx. Audio Duration (Range) – 15-60 minutes.
Use Case
ASR, Virtual Assistant, Chatbot, Conversational AI, Speech Analytics, TTS, Language Modelling
Data Set Details
Total hours
192
Sample Rate
8 kHz
Audio Channel
Dual
Recording Platform
Desktop
Audio Format
.wav
Transcription Format
.json
WER (%)
5
Data Set Demographics
Country
Ireland
Language
Irish
Gender
Female 213, Male 153, Unknown 0
Number of Speakers
366
Age
18-50
Featured Clients
Empowering teams to build world-leading AI products.
Can’t find what you are looking for?
New off-the-shelf datasets are being collected across all data types
Contact us now to let go of your audio/speech training data collection worries