Welsh (English Accent) Dataset

Set Ddata Cymraeg (Accent Saesneg)

High-Quality Welsh English Wake Word Dataset for AI & Speech Models

Overview

Title

Welsh (English Accent) Language Dataset

Dataset Type

General Conversation

Description

Unscripted, synthetic telephonic conversation between “agent” and “customer”, Approx. Audio Duration (Range) 5-15 Minutes.

Use Case

ASR, Virtual Assistant, Chatbot, Conversational AI, Speech Analytics, TTS, Language Modelling

Data Set Details

Total hours

278

Sample Rate

8 kHz

Audio Channel

Dual

Recording Platform

Desktop

Audio Format

.wav

Transcription Format

.json

WER (%)

5

Data Set Demographics

Country

Welsh (English Accent)

Language

Welsh (English Accent)

Gender

Female 270, Male 324, Unknown 0

Number of Speakers

594

Age

18-50

Featured Clients

Empowering teams to build world-leading AI products.

Shaip contact us

Can’t find what you are looking for?

New off-the-shelf datasets are being collected across all data types

Contact us now to let go of your audio/speech training data collection worries

  • By registering, I agree with Shaip Privacy Policy and Terms of Service and provide my consent to receive B2B marketing communication from Shaip.
  • This field is for validation purposes and should be left unchanged.