Reka Core
Reka's frontier multimodal model supporting text, image, video and audio inputs.
Reka Core is multimodal AI model from Custom, priced at €10.00 per 1M input tokens with a 128K tokens context window.
0.7
Pricing
API Integration
Use our OpenAI-compatible API to integrate Reka Core into your application.
npm install railwailimport railwail from "railwail";
const rw = railwail("YOUR_API_KEY");
// Simple — just pass a string
const reply = await rw.run("reka-core", "Hello! What can you do?");
console.log(reply);
// With message history
const reply2 = await rw.run("reka-core", [
{ role: "system", content: "You are a helpful assistant." },
{ role: "user", content: "Explain quantum computing simply." },
]);
console.log(reply2);
// Full response with usage info
const res = await rw.chat("reka-core", [
{ role: "user", content: "Hello!" },
], { temperature: 0.7, max_tokens: 500 });
console.log(res.choices[0].message.content);
console.log(res.usage);Deep dive — Reka AI's Reka Core
Reka AI was founded in mid-2022 by Dani Yogatama (CEO, ex-Google DeepMind, ex-Facebook), Yi Tay (Chief Scientist, ex-Google Brain), Donovan Ong (ex-Apple) and Qi Liu, with engineering teams in San Francisco, London and Singapore. The founders had previously worked on large-scale language models, mixture-of-experts and efficient long-context architectures inside DeepMind and Brain. Reka raised a $58M Series A in 2023 led by DST Global Partners and reportedly closed a Series B at a near-unicorn valuation in 2024. The Reka model line is intentionally multimodal-first and includes Edge (small), Flash (mid) and Core (flagship), all trained jointly on text, image, video and audio. Reka Core launched in April 2024 with the public Reka Core technical report, positioning it as the first multimodal-from-the-ground-up frontier model from a startup outside the FAANG/DeepMind orbit.
Visit Reka AI →Reka Core is the flagship of Reka's multimodal frontier family. According to the publicly released Reka Core technical report, the model is a decoder-only Transformer trained from scratch on a multimodal corpus comprising text, code, images, video frames and audio clips, with separate modality encoders that project into a shared token embedding space. Training was carried out on a custom cluster using Pathways-style infrastructure and a curriculum that first pretrains on text and code, then progressively introduces images, audio and video. The context window is 128K tokens, video input is supported up to several minutes, and audio is processed via a learned audio encoder (described as ImageBind-style on the audio side). The training compute is reported as roughly an order of magnitude less than GPT-4 while reaching competitive scores on MMMU, Perception Test and VideoMME. Core is offered exclusively through the Reka API, Reka Playground and Snowflake Cortex; weights are not released.
- Parameters
- Undisclosed (described as 'flagship', estimated tens of billions)
- Context
- 128K tokens
- True multimodal input: text, image, video and audio in the same context
- 128K-token context window
- Video understanding up to several minutes
- Audio understanding for transcription and sound classification
- Function calling, JSON output and tool use
- Available on Reka API, Reka Playground and Snowflake Cortex
- Multilingual coverage across 32+ languages
- Best for: multimodal assistants, video QA, audio-grounded chat, frontier features at startup pricing
Decoder-only multimodal training over a curated corpus of text, code, images, video frames and audio with progressive curriculum. Compute reported as roughly an order of magnitude less than GPT-4.
License: Proprietary commercial API. Generated outputs may be used commercially under the Reka terms.
Known limitations
- Closed weights, hosted only
- Quality below GPT-4o and Claude 3.5 Sonnet on hardest text reasoning
- Smaller ecosystem and tooling than OpenAI / Anthropic
- Video processing latency higher than competitors
- No fine-tuning available to external customers
Frequently asked questions
Related Models
View all MultimodalClaude Opus 4.7
Anthropic's April 2026 flagship. 87.6% on SWE-bench Verified, 3x higher image resolution, output self-verification, vision + reasoning.
Claude Sonnet 4.6
Anthropic's balanced mid-tier model from February 2026. Best price/performance for production workloads: 5x cheaper than Opus, near-flagship quality.
Depth Anything v2
Monocular depth-estimation model trained on 595k labeled and 62M unlabeled images. Strong zero-shot generalization in indoor and outdoor scenes.
GPT-5.4
OpenAI's unified flagship combining GPT and o-series reasoning into one model. 1M context, multimodal, top SWE-Bench Pro and OSWorld scores.
Start using Reka Core today
Get started with free credits. No credit card required. Access Reka Core and 100+ other models through a single API.