The company behind the world’s largest stablecoin is on a hiring spree, and it has nothing to do with dollar reserves. Tether is actively recruiting inference engineers for its QVAC initiative, a project designed to run AI models locally on consumer devices like smartphones and laptops, no cloud required.

The open roles span multiple seniority levels, including Lead AI Inference Engineer, Senior AI Inference Engineer, and AI Inference Engineer positions within the QVAC division. Candidates are expected to bring expertise in frameworks like llama.cpp and ggml, both widely used in the open-source community for running large language models on modest hardware. The focus is on optimizing Tether’s C++ runtime layer for edge devices.

QVAC: Tether’s bet on local-first AI

QVAC was initially unveiled in May 2025 as a decentralized platform for running AI agents directly on user hardware. The QVAC Fabric LLM now supports inference on consumer-grade devices across various GPU architectures, spanning smartphones to desktops.

The most tangible proof of progress came on May 7, 2026, with the launch of QVAC MedPsy. This is a medical language model built specifically for local execution on smartphones and wearables. Tether claims it delivers performance matching or exceeding traditional cloud-based models.