Blog — NexToken

Engineering 2026-06-21 · 5 min read

One image_url, 40+ models: multimodal vision passthrough

Send images to vision-capable models through the same OpenAI-compatible endpoint you already use — a backward-compatible content union, a per-provider image-translation layer, an edge vision gate, and image moderation. Zero changes for callers sending plain text.

Read post →

Release Notes 2026-05-12 · 6 min read

May 2026 — 23 shipments, zero breaking changes, 30–90% cache savings

Upstream prompt-cache pass-through, semantic cache, smart router, batch API, content moderation, PII redaction, vision token math fix, 7 new models behind 3 new providers, prompt templates, fine-tune lifecycle, reserved throughput, and Prometheus/Grafana observability — all additive. Existing OpenAI SDK code keeps working without a single line changed.

Read post →

Engineering, pricing, and product notes.

One image_url, 40+ models: multimodal vision passthrough

May 2026 — 23 shipments, zero breaking changes, 30–90% cache savings