| Episode | Status |
|---|---|
Tim McAleer is a producer at Ken Burns’s Florentine Films who is responsible for the technology and processes that power their documentary production. Rather than using AI to generate creative content...
Tim McAleer, a producer at Ken Burns's Florentine Films, demonstrates how he built custom AI-powered tools to automate the tedious parts of documentary production. Rather than using AI for creative generation, Tim focuses on solving data management challenges: automatically extracting metadata from tens of thousands of archival images and videos, building iOS apps for field research, and creating OCR tools for historical documents. His approach shows how AI can eliminate manual data entry while maintaining journalistic accuracy through metadata guardrails and semantic search capabilities.
Tim explains the core challenge in documentary post-production: managing hundreds of hours of footage and tens of thousands of photos across multiple file types. For the Muhammad Ali series alone, they gathered 20,000 still images and over 100 hours of footage. His goal was to automate the manual data entry process that had been done for years.
Tim demonstrates live coding with Cursor and Claude to build a Python script that submits images to OpenAI for description. The breakthrough came when ChatGPT added image upload capability. He shows how to scrape embedded metadata from archival images and use it as guardrails to prevent AI hallucination, ensuring journalistic accuracy.
Tim shows his evolved system: a REST API running on a Mac mini that processes every asset added to their database. The five-step 'autolog' process gathers file specs, copies files, parses metadata, scrapes URLs for additional context, and generates descriptions. For video, he samples frames every 5 seconds using cheap models, then sends consolidated data to reasoning models.
Beyond readable metadata, Tim generates vector embeddings using CLIP for images and OpenAI text models for descriptions, then fuses them. This enables semantic discovery - searching for 'dog' finds 'puppy' - replacing exact text search. The 'Find Similar' feature uses reverse image search within their collection to discover visually similar assets.
Tim vibe-coded an iOS app called Flip Flop to solve field research chaos. Researchers photograph archive materials (front and back), but files get out of order. The app pairs fronts with backs, immediately transcribes text from the back, and embeds all metadata directly into the image file's EXIF data - making it portable across any system.
Tim built OCR Party, a Mac menu bar app for selectively transcribing parts of historical documents. Instead of OCR-ing entire newspapers, editors crop just the relevant article. The app handles poor quality images, creases, handwriting, and multiple languages - tasks traditional OCR engines fail at. Includes option for macOS Vision or AI API for user trust.
Tim's approach to learning AI tools mirrors his experience with creative software like Photoshop and Premiere. Both require navigating complex menus via YouTube and Reddit. He emphasizes that creative professionals are better suited for vibe coding than they realize - coding now feels more like creation than technical work.
Tim distinguishes between AI for tooling (ready today) and creation (not professional-grade yet). He's cautious about generating fake archival footage in nonfiction - a journalistic integrity issue. While acknowledging job displacement concerns in commercial video, he advocates learning the tools regardless. Video generation works well for storyboarding without displacing final production.
Tim's voice-mode prompting strategy involves being polite to AI and using 'resume work' prompts when stuck. He asks the AI to summarize everything for another developer, which reveals misunderstandings. This summary shows where communication broke down, allowing him to prune and restart in a fresh chat rather than fighting the same conversation.
“Nobody wanted to do this work”: How Emmy Award–winning filmmakers use AI to automate the tedious parts of documentaries
Ask me anything about this podcast episode...
Try asking: