We need a router sidecar that queries the telemetry from vllm and determine which vLLM instance has the least load and send more prompts there.