Text Embedding 3 Large

Popular
OpenAI
Embeddings

OpenAI's most powerful embedding model. 3072 dimensions for maximum accuracy.

Embed with Text Embedding 3 Large
Vectorize text and preview the first 8 dimensions as a bar chart.
Sign in to try this model with €5 free credits.
Sign in
Outputs a high-dimensional vector you can plug into RAG or search.
Vector preview appears here.
TL;DRΒ·Last updated March 4, 2026

Text Embedding 3 Large is embeddings AI model from OpenAI, priced at €1.30 per 1M input tokens with a unknown context window.

Try Text Embedding 3 Large
Sign in to generate β€” 50 free credits on sign-up

Examples

See what Text Embedding 3 Large can generate

FAQ Matching

Input:

"How do I reset my password?"

Similar matches:

Steps to change your account password

96%

I forgot my login credentials

91%

Account recovery and password reset guide

89%

Code Search

Input:

"React useEffect cleanup function memory leak"

Similar matches:

Preventing memory leaks in React component lifecycle

93%

useEffect return function for subscription cleanup

90%

React hooks best practices for side effects

84%

Pricing

Price per Generation
Per generationFree

API Integration

Use our OpenAI-compatible API to integrate Text Embedding 3 Large into your application.

Install
npm install railwail
JavaScript / TypeScript
import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

const vectors = await rw.run("text-embedding-3-large", "Hello world", { type: "embed" });
console.log(vectors[0].length); // embedding dimensions

// Or use the embed() method for full control
const res = await rw.embed("text-embedding-3-large", ["Hello", "World"]);
for (const item of res.data) {
  console.log(item.embedding.length);
}
Specifications
Avg. latency
600ms
Developer
OpenAI
Category
Embeddings
Tags
high-quality

Deep dive β€” OpenAI's Text Embedding 3 Large

About OpenAI
Founded 2015 Β· San Francisco, California, USA

OpenAI was founded in December 2015 by Sam Altman, Elon Musk, Greg Brockman, Ilya Sutskever, Wojciech Zaremba and John Schulman, and restructured to capped-profit OpenAI LP in 2019. The embedding model family started with text-embedding-ada-001 in 2021, was unified into text-embedding-ada-002 in December 2022 (still the most-used embedding model on Earth at one point) and replaced in January 2024 by text-embedding-3-small and text-embedding-3-large. The v3 release was OpenAI's first to support Matryoshka-style dimension reduction (sale of arbitrary 256-3072 dim vectors from the same model) and beat ada-002 by 20+ percentage points on the MIRACL multilingual retrieval benchmark while costing roughly the same.

Visit OpenAI β†’
Architecture
Transformer bi-encoder with Matryoshka representation learning

OpenAI text-embedding-3-large is the flagship embedding model in the v3 generation. It produces 3,072-dimensional vectors by default with full Matryoshka representation support, so callers can request any dimension between 256 and 3,072 in the API and OpenAI will truncate-and-renormalise without re-running the model. The model accepts up to 8,191 tokens per input and was trained with a contrastive retrieval objective on a curated multilingual web corpus, including search-query/document pairs and large-scale instruction-tuned pairs. It scores 64.6% on MTEB and 54.9% on MIRACL multilingual retrieval, a step up from 61.0% / 31.4% for ada-002. Pricing is $0.00013 per 1k tokens. Output vectors are L2-normalised. Like all OpenAI models the system is closed-source and hosted only. OpenAI has not published a technical paper for the v3 family beyond a launch blog.

Parameters
Undisclosed
Context
8.2K tokens
What it can do
  • 3,072-dim vectors with Matryoshka truncation to any 256-3072 size
  • 8,191-token context window for long-document embedding
  • Multilingual coverage across 100+ languages
  • State-of-the-art MIRACL multilingual retrieval (~55%)
  • L2-normalised vectors with cosine similarity
  • Drop-in upgrade from ada-002 via the same /embeddings endpoint
  • Best for: production RAG, multilingual search, retrieval-heavy SaaS
Training & License

Not disclosed. OpenAI describes a 'curated multilingual web corpus' with search-query/document pairs and instruction-tuned pairs.

License: Proprietary commercial API. Generated embeddings may be stored and used commercially under the OpenAI Usage Policy.

Known limitations
  • Closed weights, hosted only
  • Cannot fine-tune the model
  • 3,072-dim full vectors are storage-heavy without truncation
  • Worse-than-Cohere v3 on some low-resource languages
  • 8,191-token cap may force chunking for very long documents

Frequently asked questions

Start using Text Embedding 3 Large today

Get started with free credits. No credit card required. Access Text Embedding 3 Large and 100+ other models through a single API.