I built a caption editor that runs 100% in the browser - Whisper on WebGPU, MP4 export with WebCodecs, no server

Every "add captions to your short" tool works the same way: you upload your clip to their servers, they transcribe and render it in the cloud, and they meter your exports. That means an upload wait, a queue, file-size caps, a per-export bill, and your footage sitting on someone else's disk.

I wanted to know if you could do the whole thing in the browser instead. Turns out you can, and the result (CapStudio) has a strange property for a video tool: it costs me almost nothing to run, because there is no render farm and no transcription API. The only server is auth, billing, and syncing a tiny project file. That is the entire reason one person can run it.

Here is how the pieces fit together.

Transcription: Whisper on WebGPU, in a tab

Transcription runs locally with @huggingface/transformers (transformers.js v4), which can execute Whisper on WebGPU. The clip's audio is decoded to a mono 16kHz Float32 buffer with decodeAudioData + an OfflineAudioContext, then fed to the pipeline.

Here is how the pieces fit together.

Transcription: Whisper on WebGPU, in a tab

I built a caption editor that runs 100% in the browser - Whisper on WebGPU, MP4 export with WebCodecs, no server

I built a caption editor that runs 100% in the browser - Whisper on WebGPU, MP4 export with WebCodecs, no server

Related reading

Free Video subtitles in seconds — no signup, no watermark, no upload

A Privacy-First Video Captioner That Runs in Your Browser

I built 12 free browser-based tools for creators — here's what I learned

Auto-Add Captions to Every Video Your Team Uploads (n8n + FFmpeg)

I built an AI video clip finder that runs 100% in your browser — no uploads, no…

How I Built 100 Browser-Based Image Tools With No Server (FFmpeg WASM, PDF-lib,…

Related reading

Free Video subtitles in seconds — no signup, no watermark, no upload

A Privacy-First Video Captioner That Runs in Your Browser

I built 12 free browser-based tools for creators — here's what I learned

Auto-Add Captions to Every Video Your Team Uploads (n8n + FFmpeg)

I built an AI video clip finder that runs 100% in your browser — no uploads, no…

How I Built 100 Browser-Based Image Tools With No Server (FFmpeg WASM, PDF-lib,…