How much does mxbai-embed-large-v1 cost via Railwail?

No monthly minimum, no subscription. Start with €5 free credits.

What is the context window of mxbai-embed-large-v1?

mxbai-embed-large-v1 supports a 512 tokens context window — enough for short prompts and chat.

How fast is mxbai-embed-large-v1?

Latency depends on prompt length and load — typically 200ms to 2s for short prompts. We measure p50/p95 in real-time on /rankings.

Is mxbai-embed-large-v1 better than BGE Large EN v1.5?

It depends on your use case. mxbai-embed-large-v1 (Custom) and BGE Large EN v1.5 (huggingface) are both strong choices in embeddings. Compare them side-by-side at /compare/mxbai-embed-large-v1-vs-bge-large-en-v1-5.

mxbai-embed-large-v1

Name: mxbai-embed-large-v1
Brand: Custom
SKU: mxbai-embed-large-v1
Availability: InStock

Custom

Embeddings

Mixedbread's open-source 335M embedding model. Top MTEB benchmark for English retrieval at release.

Embed with mxbai-embed-large-v1

Vectorize text and preview the first 8 dimensions as a bar chart.

Outputs a high-dimensional vector you can plug into RAG or search.

Vector preview appears here.

TL;DR·Last updated June 24, 2026

mxbai-embed-large-v1 is embeddings AI model from Custom, priced at €0.000 per 1M input tokens with a 512 tokens context window.

Try mxbai-embed-large-v1

Input Text

Direct API access coming soon

Pricing

Price per Generation

Per generationFree

API Integration

Use our OpenAI-compatible API to integrate mxbai-embed-large-v1 into your application.

Install

npm install railwail

JavaScript / TypeScript

import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

const vectors = await rw.run("mxbai-embed-large-v1", "Hello world", { type: "embed" });
console.log(vectors[0].length); // embedding dimensions

// Or use the embed() method for full control
const res = await rw.embed("mxbai-embed-large-v1", ["Hello", "World"]);
for (const item of res.data) {
  console.log(item.embedding.length);
}

Specifications

Context window

512 tokens

Developer

Custom

Deep dive — Mixedbread AI's mxbai-embed-large-v1

About Mixedbread AI

Founded 2023 · Berlin, Germany

Mixedbread AI (also stylised mxbai) was founded in 2023 in Berlin by Sean Lee, Aamir Shakir, Julius Lipp and Rui Huang with the goal of building best-in-class open-source retrieval and embedding models. The team released several iterations of the mxbai-embed and mxbai-rerank series under Apache 2.0 licence on Hugging Face and is widely cited as one of the few well-funded open-weights embedding labs alongside Jina AI and Nomic AI. mxbai-embed-large-v1 launched in March 2024 and immediately ranked at the top of the MTEB English leaderboard among models under 1B parameters, while remaining fully open under Apache 2.0. The company raised a seed round in 2024 from BlueYard Capital and angel investors and offers a hosted API as a paid product complementing the free open weights.

Visit Mixedbread AI →

Architecture

Transformer bi-encoder with AnglE loss and Matryoshka representation learning

mxbai-embed-large-v1 is a 335M-parameter Transformer bi-encoder built on top of the bert-large-uncased backbone (24 layers, 1024 hidden dim, 16 heads). It outputs a 1024-dim vector and is trained for English text embedding using the AnglE loss (Angle-Optimized Text Embedding) plus contrastive InfoNCE and a curated mix of supervised text pairs from NLI, MS MARCO, HotpotQA, FEVER, NQ and SQuAD. The model supports Matryoshka representation learning, meaning the leading 64 / 128 / 256 / 512 / 768 dimensions are independently meaningful and can be truncated for storage savings with minimal quality loss. Maximum input length is 512 tokens, and inputs longer than this must be chunked. The training emphasised generalisation rather than benchmark over-fitting, and the model is reported to remain competitive without any prompt engineering. Weights are Apache-2.0 licensed and the model runs efficiently on consumer GPUs.

Parameters: 335M
Context: 512 tokens

What it can do

Top-tier MTEB English score for an open-weights 335M model
1024-dim vectors with Matryoshka truncation to 64/128/256/512 dims
AnglE loss for improved similarity isotropy
Open weights under Apache 2.0 with no use restriction
Runs on a single 8 GB GPU or in CPU mode for low-volume tasks
Strong on retrieval, clustering and STS benchmarks
Best for: open-source RAG, on-premise embedding pipelines, cost-sensitive SaaS

Training & License

Supervised training on a curated mix of English text pairs from NLI, MS MARCO, HotpotQA, FEVER, Natural Questions and SQuAD, plus contrastive negatives mined from large web corpora.

License: Apache 2.0 for code and weights; commercial use permitted without restriction.

Known limitations

English only (multilingual support requires a separate Mixedbread checkpoint)
512-token context limit
1024-dim full vectors heavier than 384-dim alternatives
335M parameters slower than smaller distilled models for high-throughput inference
AnglE loss sensitive to input normalisation choices

Research papers

Frequently asked questions

Related Models

View all Embeddings

BGE Large EN v1.5

huggingface

BAAI (Beijing Academy of AI) open-weight English embedding model with 335M parameters. Returns 1024-dim vectors and was a top MTEB English retrieval model on release. The v1.5 update improved similarity distribution so it works well without a query instruction prefix for symmetric tasks. A widely used open alternative to hosted embeddings.

€1.00

BGE-M3 (Multilingual)

huggingface

BAAI multilingual embedding model covering 100+ languages with an 8192-token context. M3 stands for its multi-functionality (dense, sparse and ColBERT-style multi-vector retrieval), multilinguality and multi-granularity over long documents. Returns 1024-dim dense vectors and is a strong open choice for cross-lingual and long-text retrieval.

€1.00

ESM-2 650M (Protein Embeddings)

huggingface

Meta AI 650M-parameter protein language model trained on UniRef50 sequences. Feed it an amino-acid sequence and the per-residue hidden states act as learned protein embeddings, used for structure prediction, variant-effect and function tasks. This 33-layer checkpoint is the common balance of quality and cost in the ESM-2 family.

€2.00

Nomic Embed Text v1.5

huggingface

Nomic AI open embedding model with a fully reproducible training pipeline (open weights, data and code). Supports an 8192-token context and Matryoshka representation learning, so you can truncate the 768-dim output down to 64 dims with graceful quality loss. Uses task prefixes like search_query and search_document.

€1.00

Start using mxbai-embed-large-v1 today

Get started with free credits. No credit card required. Access mxbai-embed-large-v1 and 100+ other models through a single API.

Get Started Free Browse All Models

mxbai-embed-large-v1

Pricing

API Integration

Deep dive — Mixedbread AI's mxbai-embed-large-v1

Research papers

Frequently asked questions

What is mxbai-embed-large-v1?

How much does mxbai-embed-large-v1 cost via Railwail?

What is the context window of mxbai-embed-large-v1?

How fast is mxbai-embed-large-v1?

Is mxbai-embed-large-v1 better than BGE Large EN v1.5?

Related Models

BGE Large EN v1.5

BGE-M3 (Multilingual)

ESM-2 650M (Protein Embeddings)

Nomic Embed Text v1.5

Start using mxbai-embed-large-v1 today