For some tool-call or complex agent scenario, the custom generate function is CPU heavy. it would be better to share the CPU load across nodes.