Question 1

How does HiWay2LLM reduce my costs?

Accepted Answer

Most LLM requests don't need the most powerful (and expensive) model. A simple "hello" doesn't need Claude Opus 4.7 at $25/M output tokens - Haiku 4.5 at $5/M handles it perfectly. HiWay2LLM analyzes every request in under 1 millisecond and routes it to the cheapest model in your BYOK roster that can handle it. On typical mixes, customers save 40-60% without changing their code or prompts.

Question 2

Will the quality of responses decrease?

Accepted Answer

No. HiWay2LLM only routes simple requests (greetings, short questions, confirmations) to cheaper models. Complex tasks - code generation, multi-step reasoning, agentic tool use - still go to the most powerful models. You can also override routing at any time with the X-Force-Model header if you need a specific model for a request.

Question 3

How long does it take to integrate?

Accepted Answer

About 2 minutes. You change one line of code - your base_url. That's it. HiWay2LLM is compatible with any LLM SDK: OpenAI, Anthropic, LangChain, Vercel AI SDK, n8n, curl, and anything that speaks the standard API format. No SDK to install, no config file to maintain.

Question 4

What LLM providers are supported?

Accepted Answer

Anthropic (Haiku 4.5, Sonnet 4.6, Opus 4.7), OpenAI (GPT-4o-mini, GPT-4o, GPT-5), Google (Gemini 2.5 Flash Lite, Flash, Pro), Mistral (Small, Large), and DeepSeek (V3, R1). You plug in your own keys for the providers you want to use - HiWay2LLM automatically picks the best price/quality for each request across your enabled set.

Question 5

Do you store my prompts or responses?

Accepted Answer

No. Zero prompt logging is a core architectural principle, not just a policy. Your prompts pass through our routing proxy in memory only, are forwarded to the LLM provider, and immediately discarded. No prompt data is ever written to disk. We only store metadata: token counts, model selected, cost, and routing latency.

Question 6

How does pricing work?

Accepted Answer

Token packs with three billing modes - Free (2M tokens/mo, no card), Spark ($5.50 once · $5.25/mo · $59.40/yr, 10M tokens), Boost ($25 once · $23.75/mo · $270/yr, 50M tokens), Pro ($85 once · $80.75/mo · $918/yr, 200M tokens), Scale ($360 once · $342/mo · $3,888/yr, 1B tokens), Enterprise on request. Inference is billed separately by your LLM providers on your own accounts - HiWay2LLM applies zero markup. You can switch packs or cancel any time from the dashboard.

Question 7

What happens when my costs spike?

Accepted Answer

HiWay2LLM watches your spend in real time and fires burn-rate alerts when a key, agent or workspace drifts above baseline. You get email + Slack notifications the moment something looks off - before the monthly bill does. You set the thresholds; we surface the signal.

Question 8

What if HiWay2LLM goes down?

Accepted Answer

We target 99.9% uptime. If our routing proxy is unavailable, your requests will fail with a clear error (502). We recommend implementing a simple fallback in your code that routes directly to your provider if HiWay2LLM is unreachable. This takes 3 lines of code.

Question 9

Can I force a specific model for certain requests?

Accepted Answer

Yes. Add the X-Force-Model header to any request to bypass smart routing. For example: X-Force-Model: anthropic/claude-opus-4-7 will always use Opus 4.7 regardless of the complexity score. Useful for critical requests where you always want the best model.

Question 10

Is this GDPR compliant?

Accepted Answer

Yes. We're a French company (Hiway2llm.com) hosted on EU servers (OVH, France). We don't store personal data beyond your email. We don't store prompts. We comply with GDPR and the EU AI Act. A Data Processing Agreement (DPA) is available for enterprise clients.

Question 11

How does this compare to OpenRouter?

Accepted Answer

OpenRouter is a multi-provider API gateway - you manually choose which model to use. HiWay2LLM is a smart router - it automatically picks the best model for each request based on complexity analysis. OpenRouter adds cost (their fee + no routing savings). HiWay2LLM saves cost (routing to cheaper models offsets the flat subscription fee).

Question 12

Can I self-host HiWay2LLM?

Accepted Answer

We offer a fully managed SaaS - no infrastructure to maintain. For enterprise clients with specific compliance or data residency requirements, we offer private deployment options. Contact us to discuss.

Fonctionnalité	FreeRoutage de base · 10M/mois	ScaleMarkup 12,5 → 10%	EnterpriseSur devis
USAGE & QUOTAS
Tokens inclus	par pack acheté	1B / achat	custom
Auto-reload
Sièges équipe	3	25	∞
Workspaces	1	5	∞
Conservation analytics	30j	1 an	∞
MOTEUR DE ROUTAGE
Smart routing (model=auto)
BYOK fournisseurs
0 % marge sur l'inférence
Fallback automatique
Guardian anti-loop
CORTEX alertes Inbox
CONTRÔLES AVANCÉS
Cache sémantique
A/B testing modèles
Journal d'audit
CORTEX complet (5 phases)
SSO (Google, Microsoft)
Masquage PII
Self-hosted
Règles routage custom
SUPPORT & CONFORMITÉ
Canal de support	Email	Priority	SLA 99.99%
DPA (RGPD)
Financement disponible
Ingénieur dédié

Feature	HiWay2LLM	OpenRouter	Portkey	LiteLLM	Requesty
Bring your own keys (BYOK)
Smart routing by request complexity
OpenAI-compatible API
Automatic fallback across providers
Prompt caching (Anthropic / OpenAI)
Per-workspace analytics + audit log
Burn-rate alerts (budget spikes)
EU hosting by default (GDPR)				self-host
Zero prompt logging
AI self-management (CORTEX)
Pricing model	flat €/mo	% markup	flat + % markup	self-host / SaaS	% markup

Use the best model.
Pay for the cheapest.

One thin layer between your app and the models

Get started in 3 steps

Sign up

Add your provider keys

Change one line. Ship.

Change one line. Save 50%.

Light

Standard

Heavy

Not just routing. Intelligence.

< 1ms Smart Routing

Control Layer - Anti-drift

Burn-rate Alerting

Advanced Budget Controls

Usage Reporting

200+ Models, Every Modality

1 Line Integration

Zero Prompt Logging

CORTEX AI Orchestrator

Prompt security built in.

Ship with an SDK. Today.

CLI

Python

TypeScript

Simple plans. Your keys, our brain.

Estimez votre économie réelle

Ce qui est inclus

Stop overpaying for
"bonjour"

Compared to OpenRouter, Portkey, LiteLLM

Frequently Asked Questions

Use the best model.Pay for the cheapest.