US English Singing Audio Dataset
Overview
Title
US English Singing Audio Datase
Dataset Type
Singing Audio
Description
- Singing audio collection & transcription
- Audio categories: Classical, Pop music, and Rhythmic
- Project Volume: Classical (6hr), Pop music (10hr), and Rhythmic (4hr)
Use Case
Classical (6hr), Pop music (10hr), and Rhythmic (4hr)
Data Set Details
Total hours
20
Sample Rate
48 kHz
Audio Channel
Mono
Recording Platform
Mobile App
Audio Format
.wav
Transcription Format
.json
WER (%)
5
Data Set Demographics
Country
US
Language
US English
Gender
Female 19%, Male 28%
Number of Speakers
–
Age
18-50
Featured Clients
Empowering teams to build world-leading AI products.
Can’t find what you are looking for?
New off-the-shelf datasets are being collected across all data types
Contact us now to let go of your audio/speech training data collection worries