Dataset Catalog

Curated datasets for production AI

License ready-to-ship datasets or brief us on a custom build. Every collection is rights-cleared, quality audited, and delivered with enterprise support.

Luel

What's inside every delivery

Vetted sourcing, consent logs, and QA checks to reduce risk and shorten your path to training.

Compliance on autopilot

Consent releases, PII audits, and audit logging baked in for every dataset.

Structured delivery

Manifests with clip metadata, transcripts, QA scores, and direct download links.

Custom augmentation

Annotations, translations, balancing, or rubric-based scoring. Delivered end-to-end.

Rights-clearedQuality auditedEnterprise support

Backed by leading researchers and institutions

Premium collections, ready to license

Browse our curated datasets spanning audio, video, speech, and sensor data. Each collection is rights-cleared and production-ready.

Browse curated datasets

Filter by modality to find the best fit for your training needs.

SpeechCustom
Japanese Conversational Speech

Multi-speaker Japanese dialogue with stereo speaker separation and emotion annotations

Languages

Japanese
  • Stereo speaker separation: L/R channel isolation for perfect speaker extraction
  • High-density emotion annotations per utterance
  • Per-utterance emotion labels with confidence scores
SpeechEnterprise
English Conversational Speech

Multi-speaker dialogue recordings with speaker diarization and emotion annotations

Languages

English
  • Speaker diarization with labeled turns
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Determination, Interest, Calmness, Confusion, and more
SpeechEnterprise
French Conversational Speech

Multi-speaker French dialogue recordings with speaker diarization and emotion annotations

Languages

French
  • Multi-speaker conversations with complex diarization
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories including Anger, Doubt, Excitement, Determination
SpeechEnterprise
German Conversational Speech

Multi-speaker German dialogue recordings with speaker diarization and emotion annotations

Languages

German
  • Speaker diarization with labeled turns
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Determination, Interest, Calmness, Confusion, and more
SpeechEnterprise
English Monologue Speech

Professional single-speaker recordings with word-level timestamps and emotion annotations

Languages

English
  • Word-level timestamps for each utterance
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Determination, Interest, Calmness, Confusion, and more
SpeechEnterprise
French Monologue Speech

Professional single-speaker French recordings with word-level timestamps and emotion annotations

Languages

French
  • Word-level timestamps for precise alignment
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Determination, Interest, Calmness, Confusion, and more
SpeechEnterprise
German Monologue Speech

Professional single-speaker German recordings with word-level timestamps and emotion annotations

Languages

German
  • Word-level timestamps for precise alignment
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Determination, Interest, Calmness, Confusion, and more
SpeechEnterprise
Japanese Monologue Speech

Professional single-speaker Japanese recordings with word-level timestamps and emotion annotations

Languages

Japanese
  • Word-level timestamps for precise alignment
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Interest, Confusion, Amusement, Calmness, and more
SpeechCustom
Doctor-Patient Consultation

Clinical consultation dialogues between doctors and patients

Languages

EnglishUrdu
  • Fully transcribed clinical dialogues
  • Diverse hospital settings: surgeons, endocrinologists, cardiologists, neurologists, etc.
  • Realistic clinical dialogue patterns
SpeechCustom
Telugu Expressive TTS Voice

Natural Telugu speech recordings from native speakers across major regions

Languages

Telugu
  • Fully transcribed with phoneme-level alignment
  • Native Telugu speakers across major regions
  • Comprehensive emotion and style coverage
SpeechCustom
Spanish Finance Conversation

Customer service conversations in finance & banking contexts

Clips

9,000+

Languages

Spanish
  • Dual-channel recording with clear speaker separation
  • Fully transcribed with speaker diarization
  • Multiple conversation types and scenarios
VideoSensorCustom
Egocentric Vision for Accessibility AI

First-person video from accessibility users with rich metadata

  • Real-world first-person perspectives
  • Rich multimodal data: video, audio, OCR, motion
  • Diverse daily scenarios and environments
SpeechCustom
Nighttime Traffic Audio Narrations

Urban audio narrations with ambient noise profiling

Languages

English
  • Fully transcribed narrations
  • Real-world urban noise environments
  • Diverse noise profiles and locations
SpeechCustom
Spanish-English Contact Center ASR

Bilingual Spanish-English contact center conversations

Languages

SpanishEnglish
  • Fully transcribed bilingual conversations
  • Bilingual Spanish-English conversations
  • Dual-channel recordings with speaker separation

Want to see more? This is just a sample of our available datasets. Browse our complete catalog, explore detailed specifications, request samples, and get custom collection quotes.Contact us to access the full dataset list.

Get in Touch

Need something bespoke? Let's scope it together.

Whether you're training speech-to-text models, vision systems, or multimodal assistants, Luel's collection network and QA pipeline plug directly into your roadmap.

  • End-to-end vendor, sourcing, QA, legal & delivery
  • Flexible licensing models, flat fee, per minute, or revenue share
  • Dedicated account team for enterprises with recurring needs

Let's work together

Tell us a bit about the data you need and how we can help.