GPT-4.1

New

Popular

OpenAI

Text & Chat

OpenAI's newest flagship model. Improved reasoning, instruction following, and coding over GPT-4o.

Try this model View API Docs

Free credits on sign-up

Examples

See what GPT-4.1 can generate

System Design

Design a URL shortener service. What are the key components and how would you handle 10 million requests per day?

A URL shortener at 10M req/day needs: 1) An API layer behind a load balancer to accept long URLs and return short codes. 2) A base62 encoding scheme (a-z, A-Z, 0-9) generating 6-character codes, yielding 56 billion unique URLs. 3) A distributed key-value store like Redis for fast lookups with a persistent database (PostgreSQL) as the source of truth. 4) A caching layer (CDN + Redis) since reads vastly outnumber writes. 5) Rate limiting per IP/API key to prevent abuse. At ~115 req/sec average, a few application servers with horizontal scaling can handle the load comfortably.

Instruction Following

Write exactly 3 sentences about black holes. Each sentence must contain exactly 10 words. Do not use the word 'space'.

Black holes form when massive stars collapse under their gravity. Light cannot escape the intense gravitational pull they create. Scientists study these mysterious objects using powerful telescopes and math.

Try GPT-4.1

System Prompt

Message

Temperature

0.7

Max Tokens

Response will appear here...

Get Started

Pricing

Price per Generation

Per generationFree

API Integration

Use our OpenAI-compatible API to integrate GPT-4.1 into your application.

Install

npm install railwail

JavaScript / TypeScript

import railwail from "railwail";

const rw = railwail("YOUR_API_KEY");

// Simple — just pass a string
const reply = await rw.run("gpt-4-1", "Hello! What can you do?");
console.log(reply);

// With message history
const reply2 = await rw.run("gpt-4-1", [
  { role: "system", content: "You are a helpful assistant." },
  { role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);

// Full response with usage info
const res = await rw.chat("gpt-4-1", [
  { role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);

Specifications

Context window

1,000,000 tokens

Max output

32,768 tokens

Avg. latency

2.5s

Provider

OpenAI

Related Models

View all Text & Chat

Claude Opus 4

Anthropic

Anthropic's most powerful model. Exceptional at complex analysis, agentic tasks, and extended reasoning.

Free

Claude Sonnet 4

Anthropic

Anthropic's most capable model. Excellent for complex analysis, coding, math, and creative writing.

Free

GPT-4.5 Preview

OpenAI

OpenAI's latest frontier model with improved reasoning, creativity, and instruction following. Significant improvements over GPT-4o.

Free

GPT-4o

OpenAI

OpenAI's most capable multimodal model. Excellent for complex reasoning, coding, and creative tasks.

Free

Start using GPT-4.1 today

Get started with free credits. No credit card required. Access GPT-4.1 and 100+ other models through a single API.

Get Started Free Browse All Models