TL;DRAI

Developer built Suflo, a copilot for calls delivering real-time AI hints; core challenges: system audio capture and latency. For tech managers: real-time AI requires first-token streaming, server-side keys, and platform-permission issues often exceed core engineering cost.

Half a year ago I asked a simple question: during an online call, could a short, to-the-point hint appear on my screen in a second or two — while the other person is still talking? Not an after-the-fact transcript, but help in the moment.

The result is a desktop assistant (macOS + Windows). Below is an honest breakdown of what turned out to be hard, and which solutions worked. Engineering only, no marketing.

Architecture in one paragraph

On the device there are only two things: audio capture and a thin UI overlay. All the "brains" (provider keys, prompts, model selection) live on the server. The client gets a short-lived per-session token and streams audio; the server returns the transcript and the generated answer. I picked this split not for "security theater" but because otherwise keys and prompts would have to be baked into the binary — and both leak instantly.

Hard part #1: system audio, not the microphone

dev.to

Building a real-time desktop AI copilot for calls: the hard parts

Half a year ago I asked a simple question: during an online call, could a short, to-the-point hint...

domenica 21 giugno 2026 New tab

TL;DRAI

691 words~3 min read

The result is a desktop assistant (macOS + Windows). Below is an honest breakdown of what turned out to be hard, and which solutions worked. Engineering only, no marketing.

Architecture in one paragraph

Hard part #1: system audio, not the microphone

Building a real-time desktop AI copilot for calls: the hard parts

Building a real-time desktop AI copilot for calls: the hard parts

Other newsrooms on this story

Related reading

The hardest part of my AI meeting app was the audio, not the AI

I Built Two AI Desktop Tools with Python — No API Keys, No Cloud Costs

How to prompt Copilot or ChatGPT to accurately troubleshoot your PC - without…

Deskbrid: A Linux Desktop HAL Built Entirely by AI Agents

I Built a Desktop AI Assistant That Controls Your Computer — Here's How

I Built an AI Assistant That Lives in My Telegram — Here's What 6 Months Taught…

Other newsrooms on this story

Related reading

The hardest part of my AI meeting app was the audio, not the AI

I Built Two AI Desktop Tools with Python — No API Keys, No Cloud Costs

How to prompt Copilot or ChatGPT to accurately troubleshoot your PC - without…

Deskbrid: A Linux Desktop HAL Built Entirely by AI Agents

I Built a Desktop AI Assistant That Controls Your Computer — Here's How

I Built an AI Assistant That Lives in My Telegram — Here's What 6 Months Taught…