How much does GPT-4o cost via Railwail?

Input: €25.00 per 1M tokens. Output: €100.00 per 1M tokens. No monthly minimum, no subscription. Start with €5 free credits.

What is the context window of GPT-4o?

GPT-4o supports a 128K tokens context window — enough for long books, technical manuals, and extended analysis.

Average response latency: 2.0s (p50 across recent Railwail traffic). See live p50/p95 metrics on /rankings.

Is GPT-4o better than Bio_ClinicalBERT?

It depends on your use case. GPT-4o (OpenAI) and Bio_ClinicalBERT (huggingface) are both strong choices in text & chat. Compare them side-by-side at /compare/gpt-4o-vs-bio-clinicalbert.

GPT-4o

Name: GPT-4o
Brand: OpenAI
SKU: gpt-4o
Price: 0.025 EUR
Availability: InStock

Popular

OpenAI

Text & Chat

OpenAI's most capable multimodal model. Excellent for complex reasoning, coding, and creative tasks.

Try GPT-4o now

Send a single prompt and stream a response inline. Hit Cmd+Enter to submit.

Press Cmd+Enter to send

Response appears here.

TL;DR·Last updated March 4, 2026

GPT-4o is text & chat AI model from OpenAI, priced at €25.00 per 1M input tokens with a 128K tokens context window.

Try GPT-4o

System Prompt

Message

Temperature

0.7

Max Tokens

Examples

See what GPT-4o can generate

Creative Writing

Write a haiku about the first snowfall of winter

Silent flakes descend, blanketing the sleeping earth— winter whispers soft.

Complex Analysis

Explain the difference between supervised and unsupervised machine learning in simple terms

Supervised learning is like studying with an answer key—you feed the model labeled examples so it learns to predict the correct output. Unsupervised learning is more like exploring a dataset without labels, where the model discovers hidden patterns, groupings, or structures on its own. Think of it as the difference between a teacher grading your homework versus sorting a pile of unlabeled photos into albums by similarity.

Pricing

Price per Generation

Per generationFree

API Integration

Use our OpenAI-compatible API to integrate GPT-4o into your application.

Install

npm install railwail

JavaScript / TypeScript

import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("gpt-4o", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("gpt-4o", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("gpt-4o", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);

Specifications

Context window

128,000 tokens

Max output

16,384 tokens

Avg. latency

2.0s

Developer

OpenAI

Deep dive — OpenAI's GPT-4o

About OpenAI

Founded 2015 · San Francisco, USA

OpenAI was founded in December 2015 by Sam Altman, Elon Musk, Greg Brockman, Ilya Sutskever, Wojciech Zaremba and John Schulman as a non-profit AI research lab with a $1B pledged commitment. The lab restructured into a capped-profit company (OpenAI LP) in 2019 to attract capital from Microsoft, which has invested over $13 billion. OpenAI's most-cited papers include 'Improving Language Understanding by Generative Pre-Training' (GPT-1, 2018), 'Language Models are Few-Shot Learners' (GPT-3, 2020), the GPT-4 Technical Report (2023) and the InstructGPT/RLHF paper (2022). The ChatGPT consumer product, launched in November 2022, reached 100M weekly active users faster than any consumer software in history. Product milestones include GPT-3.5, GPT-4 (March 2023), GPT-4 Turbo, GPT-4o (May 2024), the o1/o3 reasoning family (late 2024/2025), GPT-4.1 (April 2025) and the Sora video model. Sam Altman is CEO. The company's last reported valuation in 2025 exceeded $300 billion.

Visit OpenAI →

Architecture

Decoder-only Transformer (natively multimodal)

GPT-4o (the 'o' stands for 'omni') was released in May 2024 as OpenAI's first natively multimodal flagship: a single neural network trained end-to-end on text, audio and images, replacing the earlier cascade of separate ASR, LLM and TTS pipelines used by ChatGPT Voice. Inputs and outputs can be any combination of text, audio and image tokens, and the model emits audio with a median response latency of 320ms for spoken conversation. Architecturally it is a decoder-only Transformer with a unified tokenizer that mixes BPE text tokens, image patches and discretised audio frames. Pretraining used a multi-trillion-token web-scale corpus filtered for quality, plus large image-text pair datasets and licensed audio. Post-training applied RLHF with human raters, model-graded rewards and red-teaming under OpenAI's Preparedness Framework. GPT-4o introduced a new tokenizer with substantially better compression for non-English scripts, reducing token counts by 1.1x-4.4x across languages such as Hindi, Arabic, Korean and Tamil. The context window is 128K tokens with 16K maximum output. Function calling, JSON mode, vision input and Structured Outputs (with strict schema adherence) are first-class. The model card discloses Preparedness Framework evaluations against CBRN, cyber and persuasion risks.

Parameters: Undisclosed (estimated in the hundreds of billions, dense)
Context: 128K tokens

What it can do

Natively multimodal: text, image and audio in and out
Real-time voice conversation with sub-second latency
128K context window with 16K max output
Improved non-English tokenizer (up to 4.4x fewer tokens for some languages)
Vision: charts, diagrams, screenshots, OCR
Function calling and parallel tool calls
Structured Outputs with strict JSON schema
Strong coding performance with file editing via tools
Singing, emotional speech and laughter in audio mode
Multilingual fluency across 50+ languages
Best for: real-time voice agents, multimodal assistants, general-purpose chat, vision tasks.

Training & License

Trained on a multi-trillion-token mixture of web text, code, books, licensed third-party data and large image-text and audio datasets. Knowledge cutoff is October 2023. OpenAI does not disclose exact dataset composition.

License: Proprietary, available via OpenAI API and Azure OpenAI Service. Commercial use allowed under OpenAI Terms of Use.

Known limitations

Can hallucinate citations and factual details
Voice mode sometimes mimics user voice unexpectedly
Knowledge cutoff October 2023 without tools
Performance on hardest reasoning trails o-series models
Audio output is not available in all API regions

Research papers

Frequently asked questions

Related Models

View all Text & Chat

Bio_ClinicalBERT

huggingface

The original Bio_ClinicalBERT from Alsentzer et al., a BERT model initialized from BioBERT and further pretrained on all MIMIC-III clinical notes. Served as a fill-mask endpoint it predicts masked tokens in clinical text and produces clinical embeddings. It is the standard encoder backbone behind many downstream clinical NLP fine-tunes.

€1.00

Biomedical NER (all entities)

huggingface

Token-classification model from d4data that tags 84 biomedical entity types in clinical and medical text, including disease, sign, symptom, medication, dosage, lab value, body part and procedure. Trained on the Maccrobat clinical case corpus on a DistilBERT base, so it runs cheaply for high-volume tagging.

€1.00

Claude Opus 4

Anthropic

Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.

Free

Claude Opus 4.8