Jina Embeddings v3 (Multilingual)

Custom
Embeddings

Jina's frontier multilingual embedding model. 570M params, 8192 ctx, 89 languages, Matryoshka dims 128-1024.

Embed with Jina Embeddings v3 (Multilingual)
Vectorize text and preview the first 8 dimensions as a bar chart.
Sign in to try this model with €5 free credits.
Sign in
Outputs a high-dimensional vector you can plug into RAG or search.
Vector preview appears here.
TL;DRΒ·Last updated May 16, 2026

Jina Embeddings v3 (Multilingual) is embeddings AI model from Custom, priced at €0.020 per 1M input tokens with a 8.2K tokens context window.

Try Jina Embeddings v3 (Multilingual)
Direct API access coming soon

Pricing

Price per Generation
Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Jina Embeddings v3 (Multilingual) into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

const vectors = await rw.run("jina-embeddings-v3-multilingual", "Hello world", { type: "embed" });
console.log(vectors[0].length); // embedding dimensions

// Or use the embed() method for full control
const res = await rw.embed("jina-embeddings-v3-multilingual", ["Hello", "World"]);
for (const item of res.data) {
  console.log(item.embedding.length);
}
Specifications
Context window
8,192 tokens
Developer
Custom
Category
Embeddings
Supported Formats
text
Tags
jina
embedding
multilingual
matryoshka
task-lora

Deep dive β€” Jina AI's Jina Embeddings v3 (Multilingual)

About Jina AI
Founded 2020 Β· Berlin, Germany

Jina AI was founded in February 2020 in Berlin by Han Xiao (CEO, ex-Tencent and Zalando) together with co-founders Maximilian Werk, Christina Reher and Vincent Zhang. The company started as an open-source neural search framework (Jina, DocArray) and later pivoted to building proprietary multimodal embedding and reranking models offered through a hosted API. Jina has raised over $40M from investors including Canaan Partners, Mango Capital, Yunqi Partners and SAP, and ships its products under a freemium model. The Jina Embeddings v3 family was released in September 2024 and was the first open-weights embedding model to feature task-specific Low-Rank Adaptation (LoRA) heads selected at inference time, plus an 8,192-token context window. The release was widely covered as a top-tier multilingual embedding alternative on the MTEB leaderboard.

Visit Jina AI β†’
Architecture
Transformer bi-encoder with task-specific LoRA heads and Matryoshka representation learning

Jina Embeddings v3 is a 570M-parameter Transformer bi-encoder based on the XLM-RoBERTa architecture with several upgrades: rotary position embeddings, FlashAttention 2 and an extended context window of 8,192 tokens. It supports 89 languages with strong cross-lingual retrieval. A distinguishing feature is a set of five task-specific LoRA adapters (retrieval.query, retrieval.passage, separation, classification, text-matching) that are swapped at inference time by passing a 'task' parameter, which improves quality on each downstream task without retraining the base model. The output is a 1024-dimensional vector with Matryoshka representation learning, so truncation to 256 / 512 / 768 dimensions remains semantically meaningful and allows a quality vs. storage trade-off. Training used a multi-stage curriculum on multilingual text-pair data, search-query/document pairs and curated NLI data. Weights are released on Hugging Face under CC-BY-NC 4.0 for research use; commercial use is permitted via the hosted API or a paid commercial licence.

Parameters
570M
Context
8.2K tokens
What it can do
  • 89 languages with strong cross-lingual retrieval
  • 8,192-token context window for long-document embedding
  • Task-specific LoRA adapters (query, passage, classification, clustering, similarity)
  • Matryoshka representation learning: truncate to 256/512/768 dims with graceful degradation
  • Top-tier MTEB performance for a model under 1B parameters
  • Open weights on Hugging Face for research; commercial via API or paid licence
  • Best for: multilingual long-document RAG, semantic search, embedding-heavy SaaS
Training & License

Multi-stage training on multilingual text pairs, search-query/document pairs, NLI data and curated synthetic data. Exact token count not disclosed.

License: Weights under CC-BY-NC 4.0 on Hugging Face (research only). Commercial use via Jina API or a paid commercial licence.

Known limitations
  • Open weights require commercial licence for paid products
  • 1024-dim default may be wasteful without Matryoshka truncation
  • Task adapter parameter required for best quality on each task
  • Long-context retrieval still weaker than chunk-based pipelines on some benchmarks
  • Hosted API latency higher than OpenAI text-embedding-3 on small inputs

Frequently asked questions

Start using Jina Embeddings v3 (Multilingual) today

Get started with free credits. No credit card required. Access Jina Embeddings v3 (Multilingual) and 100+ other models through a single API.