mxbai-embed-large-v1
Mixedbread's open-source 335M embedding model. Top MTEB benchmark for English retrieval at release.
mxbai-embed-large-v1 is embeddings AI model from Custom, priced at €0.000 per 1M input tokens with a 512 tokens context window.
Pricing
API Integration
Use our OpenAI-compatible API to integrate mxbai-embed-large-v1 into your application.
npm install railwailimport railwail from "railwail";
const rw = railwail("YOUR_API_KEY");
const vectors = await rw.run("mxbai-embed-large-v1", "Hello world", { type: "embed" });
console.log(vectors[0].length); // embedding dimensions
// Or use the embed() method for full control
const res = await rw.embed("mxbai-embed-large-v1", ["Hello", "World"]);
for (const item of res.data) {
console.log(item.embedding.length);
}Deep dive — Mixedbread AI's mxbai-embed-large-v1
Mixedbread AI (also stylised mxbai) was founded in 2023 in Berlin by Sean Lee, Aamir Shakir, Julius Lipp and Rui Huang with the goal of building best-in-class open-source retrieval and embedding models. The team released several iterations of the mxbai-embed and mxbai-rerank series under Apache 2.0 licence on Hugging Face and is widely cited as one of the few well-funded open-weights embedding labs alongside Jina AI and Nomic AI. mxbai-embed-large-v1 launched in March 2024 and immediately ranked at the top of the MTEB English leaderboard among models under 1B parameters, while remaining fully open under Apache 2.0. The company raised a seed round in 2024 from BlueYard Capital and angel investors and offers a hosted API as a paid product complementing the free open weights.
Visit Mixedbread AI →mxbai-embed-large-v1 is a 335M-parameter Transformer bi-encoder built on top of the bert-large-uncased backbone (24 layers, 1024 hidden dim, 16 heads). It outputs a 1024-dim vector and is trained for English text embedding using the AnglE loss (Angle-Optimized Text Embedding) plus contrastive InfoNCE and a curated mix of supervised text pairs from NLI, MS MARCO, HotpotQA, FEVER, NQ and SQuAD. The model supports Matryoshka representation learning, meaning the leading 64 / 128 / 256 / 512 / 768 dimensions are independently meaningful and can be truncated for storage savings with minimal quality loss. Maximum input length is 512 tokens, and inputs longer than this must be chunked. The training emphasised generalisation rather than benchmark over-fitting, and the model is reported to remain competitive without any prompt engineering. Weights are Apache-2.0 licensed and the model runs efficiently on consumer GPUs.
- Parameters
- 335M
- Context
- 512 tokens
- Top-tier MTEB English score for an open-weights 335M model
- 1024-dim vectors with Matryoshka truncation to 64/128/256/512 dims
- AnglE loss for improved similarity isotropy
- Open weights under Apache 2.0 with no use restriction
- Runs on a single 8 GB GPU or in CPU mode for low-volume tasks
- Strong on retrieval, clustering and STS benchmarks
- Best for: open-source RAG, on-premise embedding pipelines, cost-sensitive SaaS
Supervised training on a curated mix of English text pairs from NLI, MS MARCO, HotpotQA, FEVER, Natural Questions and SQuAD, plus contrastive negatives mined from large web corpora.
License: Apache 2.0 for code and weights; commercial use permitted without restriction.
Known limitations
- English only (multilingual support requires a separate Mixedbread checkpoint)
- 512-token context limit
- 1024-dim full vectors heavier than 384-dim alternatives
- 335M parameters slower than smaller distilled models for high-throughput inference
- AnglE loss sensitive to input normalisation choices
Frequently asked questions
Related Models
View all EmbeddingsText Embedding 3 Large
OpenAI's most powerful embedding model. 3072 dimensions for maximum accuracy.
Voyage AI voyage-3
Voyage's general-purpose embedding model. 1024 dims, 32k context, strong retrieval performance.
Cohere embed-multilingual-v3
Cohere's multilingual embedding model. Supports 100+ languages with separate search and classification modes.
Jina Embeddings v3 (Multilingual)
Jina's frontier multilingual embedding model. 570M params, 8192 ctx, 89 languages, Matryoshka dims 128-1024.
Start using mxbai-embed-large-v1 today
Get started with free credits. No credit card required. Access mxbai-embed-large-v1 and 100+ other models through a single API.