Question 1

How does HiWay2LLM reduce my costs?

Accepted Answer

Most LLM requests don't need the most powerful (and expensive) model. A simple "hello" doesn't need Claude Opus 4.7 at $25/M output tokens — Haiku 4.5 at $5/M handles it perfectly. HiWay2LLM analyzes every request in under 1 millisecond and routes it to the cheapest model in your BYOK roster that can handle it. On typical mixes, customers save 40-60% without changing their code or prompts.

Question 2

Will the quality of responses decrease?

Accepted Answer

No. HiWay2LLM only routes simple requests (greetings, short questions, confirmations) to cheaper models. Complex tasks — code generation, multi-step reasoning, agentic tool use — still go to the most powerful models. You can also override routing at any time with the X-Force-Model header if you need a specific model for a request.

Question 3

How long does it take to integrate?

Accepted Answer

About 2 minutes. You change one line of code — your base_url. That's it. HiWay2LLM is compatible with any LLM SDK: OpenAI, Anthropic, LangChain, Vercel AI SDK, n8n, curl, and anything that speaks the standard API format. No SDK to install, no config file to maintain.

Question 4

What LLM providers are supported?

Accepted Answer

Anthropic (Haiku 4.5, Sonnet 4.6, Opus 4.7), OpenAI (GPT-4o-mini, GPT-4o, GPT-5), Google (Gemini 2.5 Flash Lite, Flash, Pro), Mistral (Small, Large), and DeepSeek (V3, R1). You plug in your own keys for the providers you want to use — HiWay2LLM automatically picks the best price/quality for each request across your enabled set.

Question 5

Do you store my prompts or responses?

Accepted Answer

No. Zero prompt logging is a core architectural principle, not just a policy. Your prompts pass through our routing proxy in memory only, are forwarded to the LLM provider, and immediately discarded. No prompt data is ever written to disk. We only store metadata: token counts, model selected, cost, and routing latency.

Question 6

How does pricing work?

Accepted Answer

Flat monthly (or annual) subscription for routing intelligence — Free (2.5K req/mo), Build ($15/mo, 100K), Scale ($39/mo, 500K), Business ($249/mo, 5M), Enterprise on request. Inference is billed separately by your LLM providers on your own accounts — HiWay2LLM applies zero markup. You can upgrade, downgrade or cancel any time from the dashboard.

Question 7

What happens when my costs spike?

Accepted Answer

HiWay2LLM watches your spend in real time and fires burn-rate alerts when a key, agent or workspace drifts above baseline. You get email + Slack notifications the moment something looks off — before the monthly bill does. You set the thresholds; we surface the signal.

Question 8

What if HiWay2LLM goes down?

Accepted Answer

We target 99.9% uptime. If our routing proxy is unavailable, your requests will fail with a clear error (502). We recommend implementing a simple fallback in your code that routes directly to your provider if HiWay2LLM is unreachable. This takes 3 lines of code.

Question 9

Can I force a specific model for certain requests?

Accepted Answer

Yes. Add the X-Force-Model header to any request to bypass smart routing. For example: X-Force-Model: anthropic/claude-opus-4-7 will always use Opus 4.7 regardless of the complexity score. Useful for critical requests where you always want the best model.

Question 10

Is this GDPR compliant?

Accepted Answer

Yes. We're a French company (Mytm-Group SAS) hosted on EU servers (OVH, France). We don't store personal data beyond your email. We don't store prompts. We comply with GDPR and the EU AI Act. A Data Processing Agreement (DPA) is available for enterprise clients.

Question 11

How does this compare to OpenRouter?

Accepted Answer

OpenRouter is a multi-provider API gateway — you manually choose which model to use. HiWay2LLM is a smart router — it automatically picks the best model for each request based on complexity analysis. OpenRouter adds cost (their fee + no routing savings). HiWay2LLM saves cost (routing to cheaper models offsets the flat subscription fee).

Question 12

Can I self-host HiWay2LLM?

Accepted Answer

We offer a fully managed SaaS — no infrastructure to maintain. For enterprise clients with specific compliance or data residency requirements, we offer private deployment options. Contact us to discuss.

Feature	Free$0 /mo	Build$15 /mo	Scale$39 /mo	Business$249 /mo	EnterpriseOn request
Usage & limits
Routed requests / month	2,500	100,000	500,000	5,000,000	25M+
Burst limit	30/min	60/min	300/min	600/min	∞
Overage price	—	$8/100k	$5/100k	$3/100k	custom
Team seats	1	3	10	25	∞
Workspaces	1	1	3	∞	∞
Analytics retention	7j	30j	90j	1 an	∞
Routing engine
Smart routing (model=auto)
BYOK provider vault
0% markup on inference
Automatic provider fallback
Guardian anti-loop + kill-switch
Advanced controls
Semantic cache
A/B testing on models
Audit logs
SSO (Google, Microsoft)
PII masking
Self-hosted option
Custom routing rules
Support & compliance
Support channel	Community	Email	Priority	SLA 99.9%	SLA 99.99%
DPA (GDPR)
Dedicated support engineer

Feature	HiWay2LLM	OpenRouter	Portkey	LiteLLM	Requesty
Bring your own keys (BYOK)
Smart routing by request complexity
OpenAI-compatible API
Automatic fallback across providers
Prompt caching (Anthropic / OpenAI)
Per-workspace analytics + audit log
Burn-rate alerts (budget spikes)
EU hosting by default (GDPR)				self-host
Zero prompt logging
Pricing model	flat €/mo	% markup	flat + % markup	self-host / SaaS	% markup

Use the best model.
Pay for the cheapest.

One thin layer between your app and the models

Get started in 3 steps

Sign up

Add your provider keys

Change one line. Ship.

Change one line. Save 50%.

Light

Standard

Heavy

Not just routing. Intelligence.

< 1ms Smart Routing

Control Layer — Anti-drift

Burn-rate Alerting

Advanced Budget Controls

Usage Reporting

Multi-Provider Optimization

1 Line Integration

Zero Prompt Logging

Ship with an SDK. Today.

CLI

Python

TypeScript

Simple plans. Your keys, our brain.

Compare plans

How much you'd save

Stop overpaying for
"bonjour"

Compared to OpenRouter, Portkey, LiteLLM

Frequently Asked Questions

Use the best model.Pay for the cheapest.