TL;DR – We’re excited to introduce the Voyage 4 series, a new generation of text embedding models featuring industry-first shared embedding spaces. The series includes voyage-4-large, voyage-4, voyage-4-lite, and the open-weighted voyage-4-nano. All models produce compatible embeddings, allowing customers to mix and match models for query and document embedding based on their specific accuracy, latency, and cost requirements. Furthermore, voyage-4-large leverages a mixture-of-experts (MoE) model architecture to deliver state-of-the-art retrieval accuracy while maintaining serving costs 40% lower than comparable dense models.
Today, we’re excited to announce the Voyage 4 model family. These models serve two key use cases: existing customers seeking more accurate retrieval, and developers building context-engineered agents that require high retrieval accuracy with low latency and cost for high-volume reads (e.g., from shared memory):
voyage-4-large. Our new flagship embedding model leveraging a mixture-of-experts (MoE) architecture to establish a new state-of-the-art while maintaining serving costs 40% lower than comparable dense models. This is the first production-grade embedding model to utilize MoE architecture.






