Overview
First-person smartphone recordings of contributors cleaning kitchens: loading dishwashers, handwashing dishes, wiping counters, and storing cookware. Each session includes synchronized narration describing actions, object bounding boxes, and step timestamps—ideal for robotic manipulation and instructional models.
Highlights
- Multi-angle coverage: primary POV plus optional static wide shot
- Action narration with on-screen captions for recognition and alignment
- Tool and ingredient metadata (brand, material, observed state)
Deliverables
Files
4K MP4 (POV), 1080p MP4 (static wide), JSON action timeline, YOLO-format bounding boxes
Notes
Includes step-by-step transcripts with timecodes; variations with/without gloves
Labels
action_class, object_state, surface_type, cleanliness_score