mxbai-embed-large-v1

Custom
Embeddings

Mixedbread's open-source 335M embedding model. Top MTEB benchmark for English retrieval at release.

Embed with mxbai-embed-large-v1
Vectorize text and preview the first 8 dimensions as a bar chart.
Sign in to try this model with €5 free credits.
Sign in
Outputs a high-dimensional vector you can plug into RAG or search.
Vector preview appears here.
TL;DR·Last updated May 16, 2026

mxbai-embed-large-v1 is embeddings AI model from Custom, priced at €0.000 per 1M input tokens with a 512 tokens context window.

Try mxbai-embed-large-v1
Direct API access coming soon

Pricing

Price per Generation
Per generationFree

API Integration

Use our OpenAI-compatible API to integrate mxbai-embed-large-v1 into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

const vectors = await rw.run("mxbai-embed-large-v1", "Hello world", { type: "embed" });
console.log(vectors[0].length); // embedding dimensions

// Or use the embed() method for full control
const res = await rw.embed("mxbai-embed-large-v1", ["Hello", "World"]);
for (const item of res.data) {
  console.log(item.embedding.length);
}
Specifications
Context window
512 tokens
Developer
Custom
Category
Embeddings
Supported Formats
text
Tags
mixedbread
embedding
open-weights
english
mteb
pricing-tbd

Deep dive — Mixedbread AI's mxbai-embed-large-v1

About Mixedbread AI
Founded 2023 · Berlin, Germany

Mixedbread AI (also stylised mxbai) was founded in 2023 in Berlin by Sean Lee, Aamir Shakir, Julius Lipp and Rui Huang with the goal of building best-in-class open-source retrieval and embedding models. The team released several iterations of the mxbai-embed and mxbai-rerank series under Apache 2.0 licence on Hugging Face and is widely cited as one of the few well-funded open-weights embedding labs alongside Jina AI and Nomic AI. mxbai-embed-large-v1 launched in March 2024 and immediately ranked at the top of the MTEB English leaderboard among models under 1B parameters, while remaining fully open under Apache 2.0. The company raised a seed round in 2024 from BlueYard Capital and angel investors and offers a hosted API as a paid product complementing the free open weights.

Visit Mixedbread AI →
Architecture
Transformer bi-encoder with AnglE loss and Matryoshka representation learning

mxbai-embed-large-v1 is a 335M-parameter Transformer bi-encoder built on top of the bert-large-uncased backbone (24 layers, 1024 hidden dim, 16 heads). It outputs a 1024-dim vector and is trained for English text embedding using the AnglE loss (Angle-Optimized Text Embedding) plus contrastive InfoNCE and a curated mix of supervised text pairs from NLI, MS MARCO, HotpotQA, FEVER, NQ and SQuAD. The model supports Matryoshka representation learning, meaning the leading 64 / 128 / 256 / 512 / 768 dimensions are independently meaningful and can be truncated for storage savings with minimal quality loss. Maximum input length is 512 tokens, and inputs longer than this must be chunked. The training emphasised generalisation rather than benchmark over-fitting, and the model is reported to remain competitive without any prompt engineering. Weights are Apache-2.0 licensed and the model runs efficiently on consumer GPUs.

Parameters
335M
Context
512 tokens
What it can do
  • Top-tier MTEB English score for an open-weights 335M model
  • 1024-dim vectors with Matryoshka truncation to 64/128/256/512 dims
  • AnglE loss for improved similarity isotropy
  • Open weights under Apache 2.0 with no use restriction
  • Runs on a single 8 GB GPU or in CPU mode for low-volume tasks
  • Strong on retrieval, clustering and STS benchmarks
  • Best for: open-source RAG, on-premise embedding pipelines, cost-sensitive SaaS
Training & License

Supervised training on a curated mix of English text pairs from NLI, MS MARCO, HotpotQA, FEVER, Natural Questions and SQuAD, plus contrastive negatives mined from large web corpora.

License: Apache 2.0 for code and weights; commercial use permitted without restriction.

Known limitations
  • English only (multilingual support requires a separate Mixedbread checkpoint)
  • 512-token context limit
  • 1024-dim full vectors heavier than 384-dim alternatives
  • 335M parameters slower than smaller distilled models for high-throughput inference
  • AnglE loss sensitive to input normalisation choices

Frequently asked questions

Start using mxbai-embed-large-v1 today

Get started with free credits. No credit card required. Access mxbai-embed-large-v1 and 100+ other models through a single API.