Stereo multi-speaker Telugu dialogue recordings with speaker diarization and emotion annotations
Languages
Telugu
Quality Check
100% Verified
Natural Telugu conversational recordings featuring spontaneous discussions and casual dialogues between native Telugu speakers from major Telugu-speaking regions. Stereo recordings with dedicated left/right audio channels per speaker for clean speaker separation—recorded via LiveKit with isolated per-participant tracks. Covers diverse conversation topics with representation from Telangana, Andhra Pradesh, and Rayalaseema dialect regions. Features speaker diarization, word-level timestamps, and per-utterance emotion labels. Ideal for training multi-speaker Telugu ASR, speaker extraction, and conversational AI models. Complements our Telugu Expressive TTS Voice dataset for single-speaker content.
Stereo WAV files with L/R speaker separation (48kHz, 16-bit), JSON transcripts with speaker diarization labels, Word-level timestamps per speaker, Per-utterance emotion labels with confidence scores, Speaker metadata with region information
48kHz sample rate, 16-bit depth, 1536 kbps, stereo with L/R channel separation
JSON with speaker labels, word timestamps, emotion annotations