TL;DRAI

Tether is scaling QVAC—on-device AI for consumer hardware—hiring inference engineers, launching QVAC MedPsy medical model, and offering uncapped developer grants. For managers, this signals a viable alternative to cloud AI, enabling privacy control and infrastructure independence from Big Tech platforms.

The company behind the world’s largest stablecoin is on a hiring spree, and it has nothing to do with dollar reserves. Tether is actively recruiting inference engineers for its QVAC initiative, a project designed to run AI models locally on consumer devices like smartphones and laptops, no cloud required.

The open roles span multiple seniority levels, including Lead AI Inference Engineer, Senior AI Inference Engineer, and AI Inference Engineer positions within the QVAC division. Candidates are expected to bring expertise in frameworks like llama.cpp and ggml, both widely used in the open-source community for running large language models on modest hardware. The focus is on optimizing Tether’s C++ runtime layer for edge devices.

QVAC: Tether’s bet on local-first AI

QVAC was initially unveiled in May 2025 as a decentralized platform for running AI agents directly on user hardware. The QVAC Fabric LLM now supports inference on consumer-grade devices across various GPU architectures, spanning smartphones to desktops.

The most tangible proof of progress came on May 7, 2026, with the launch of QVAC MedPsy. This is a medical language model built specifically for local execution on smartphones and wearables. Tether claims it delivers performance matching or exceeding traditional cloud-based models.

cryptobriefing.com

Tether AI hires inference engineers to advance local AI projects

Tether is recruiting C++ inference engineers for its QVAC platform, a local-first AI initiative focused on on-device processing, privacy, and decentralized development.

lunedì 1 giugno 2026 New tab

TL;DRAI

458 words~2 min read

QVAC: Tether’s bet on local-first AI

Tether AI hires inference engineers to advance local AI projects

Tether AI hires inference engineers to advance local AI projects

Other newsrooms on this story

Related reading

Tether AI open-sources brain-to-text engine, prioritizes user privacy with QVAC

Tether releases open source version of Google's TurboQuant to cut AI memory use

Tether AI open-sources TurboQuant, reducing LLM KV cache memory use by 5x

Tether Brings AI Memory Compression To Consumer Devices

Democratizing AI adoption with Tether's Bitnet LLM fine-tuning framework

AMD acqui-hires the employees behind Untether AI | TechCrunch

Other newsrooms on this story

Related reading

Tether AI open-sources brain-to-text engine, prioritizes user privacy with QVAC

Tether releases open source version of Google's TurboQuant to cut AI memory use

Tether AI open-sources TurboQuant, reducing LLM KV cache memory use by 5x

Tether Brings AI Memory Compression To Consumer Devices

Democratizing AI adoption with Tether's Bitnet LLM fine-tuning framework

AMD acqui-hires the employees behind Untether AI | TechCrunch