The Setup

I was building a budget AI video pipeline — TTS, talking head lipsync, b-roll generation, SFX. Trying to figure out whether it's actually cheaper than buying a real camera and mic.

The AI I was talking to was great. Enthusiastic. Knowledgeable. Every answer started with "YES!", "100%", "You nailed it." We were on a roll.

Here's the flow we landed on for a 5-min YouTube video:

Step