Model Catalog · Live Pricing

40+ Models.
One API Key.

Every price shown is what you actually pay — NexToken's 20% routing margin already included. Switch your billing tier below to see discounted rates.

Billing tier
41
Public models
12
Providers
2
NexToken-native
SG
Self-hosted region

NexToken Native Models

Proprietary · Singapore GPU
★ Default
nex-pro
Chat, code, content, summarisation. Self-hosted on NexToken's Singapore GPU. Strong Chinese + English. OpenAI tool-calling compatible.
$0.12
/ 1M input tokens
$0.48
/ 1M output tokens
chat tools stream 32K ctx ~96% cheaper than GPT-4o
Embeddings
nex-embed-zh
Chinese-strong embeddings. BGE-M3, 1024-dim vectors. Drop-in replacement for text-embedding-3-small — at a fraction of the cost.
$0.012
/ 1M tokens
no output tokens
/v1/embeddings 512 ctx ~33% cheaper than text-embedding-3-small
Showing all 41 models
Sort by
Model Input / 1M Output / 1M Context Capabilities
How pricing works: All prices shown are retail — what you pay. NexToken charges provider cost × 1.20 (20% routing margin) at the Developer tier. Pro, Business, and Enterprise customers receive volume discounts — use the tier switcher above to see your rate. Pricing tiers are based on monthly token spend and reset on the 1st of each month. Loyalty bonuses (+3–12% wallet credits) are separate and apply at top-up time. Full pricing breakdown →