Professional single-speaker recordings with word-level timestamps and emotion annotations
Languages
English
Quality Check
100% Verified
Professional English monologue recordings featuring spontaneous free speech and formal presentations. Includes personal narratives, workplace communications, medical notes, educational talks, and motivational speeches. Features word-level timestamps and per-utterance emotion labels with confidence scores. Ideal for training ASR, TTS, emotion recognition, and voice cloning models.
WAV audio files (48kHz 24-bit mono), JSON transcripts with word-level timestamps, Per-utterance emotion labels with confidence scores, Speaker metadata
48kHz sample rate, 24-bit depth, 1152 kbps, mono
JSON with word timestamps, speaker labels, emotion annotations