The Pipeline
Every video follows the same automated pipeline. No manual editing. No desktop software. Just APIs, prompts, and a Mac.
Pick 6 best photos from the MLS listing. The system works with any standard listing photography — no special shots required. Exterior, living room, kitchen, bedrooms, and a feature shot (pool, view, fireplace).
Input: 6 JPGsPhotos are cropped to 16:9 cinematic aspect ratio (1536×864). MLS watermarks are automatically removed via intelligent cropping. Colors are normalized for consistent tone across all 6 frames.
AutomaticEach photo is submitted to Google's Veo 3.1 video generation model with a custom cinematic prompt. The AI animates each still into a smooth, realistic video clip with specific camera movement:
Each clip runs 5–8 seconds. Camera movement is anchored to real objects in the photo — the stone mantle, the granite countertop, the timber beams — preventing AI drift or hallucinated objects.
Google Veo 3.1A property-specific voiceover script is generated (50–80 words, 30–50 seconds). Scripts follow a proven structure:
The script is voiced by ElevenLabs AI (multiple voice options). Background music is generated to match the property aesthetic — warm acoustic for mountain homes, ambient electronic for urban condos, airy percussion for beachfront.
ElevenLabsA single command assembles the final video. All clips normalized to 1920×1080, 30fps, H.264. Concatenated in narrative sequence — exterior → interior rooms → feature → closing. Voiceover mixed at full volume. Music at −12dB with fade in/out.
FFmpegFinal video reviewed for AI artifacts, timing alignment, and audio balance. Exported as broadcast-ready 1080p MP4. Upload directly to MLS, Zillow, YouTube, Instagram, TikTok — no conversion needed.
Output: 1080p MP4The secret sauce
The difference between a slideshow and cinema is in the prompts. Each one is crafted with:
Tech stack
The same AI models powering Google and ElevenLabs production workflows.
Video generation — animates still photos into cinematic clips with realistic camera movement.
Voice + music generation — professional AI voiceover and property-matched background scores.
Assembly pipeline — clip normalization, concatenation, multi-track audio mixing, broadcast encoding.
Comparison
| Traditional Video | Reel Houses | |
|---|---|---|
| Cost per video | $500 – $2,000+ | $49 |
| Turnaround | 3–7 days | < 1 hour |
| Crew required | Videographer + editor | None |
| Scheduling | Coordinate staging, weather, time | Works instantly |
| Voiceover | $150–$500 extra | Included |
| Licensed music | $50–$200 extra | Included |
| Reshoots | Reschedule entire shoot | Re-run with new photos |
| Scalability | 1 video per shoot | Dozens per day |
Send us 6 photos. We'll send back a cinematic tour.
Get started →