OrangeRouter

LLM-native routing

OrangeRouter — LLM-native routing layer

A unified entry point for multi-model backends, with routing decisions made by LLM inference.

OrangeRouter is a lightweight piece of infrastructure: one external entry point, several model backends behind it, and an LLM in the middle that completes each routing decision before the request reaches a backend cluster — it understands the request first, then decides where it should go.

The aim is to let model capability, request semantics, and business policy jointly drive routing, rather than relying on static rule tables that are hard to maintain. This site covers the problem background, the technical design, and the tradeoffs behind it.

Design principles

03 / principles

The request path

conceptual · not final

// client → router → routing target

Client SDK / app
OrangeRouter LLM decides
Backend A · large model
Backend B · lightweight model
Backend C · self-hosted

Not the same as…

03 / distinctions

Routing should understand the data. OrangeRouter places the routing decision in a single LLM inference, so semantics and policy drive the outcome. The motivation is on the background page; the implementation on the architecture page.