-
Notifications
You must be signed in to change notification settings - Fork 40.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Whether to support the session persistence function at the pod level #116496
Comments
There are no sig labels on this issue. Please add an appropriate label by using one of the following commands:
Please see the group list for a listing of the SIGs, working groups, and committees available. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
This issue is currently awaiting triage. If a SIG or subproject determines this is a relevant issue, they will accept it by applying the The Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
It seems that Istio ConsistentHashLB is just what you are looking for. |
Relevant to kubernetes-sigs/gateway-api#1619 |
The Kubernetes project currently lacks enough contributors to adequately respond to all issues. This bot triages un-triaged issues according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle stale |
The Kubernetes project currently lacks enough active contributors to adequately respond to all issues. This bot triages un-triaged issues according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle rotten |
/close please reopen if needed. |
@neolit123: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
What would you like to be added?
We are using k8s for cluster management of multiple GPU servers (multiple GPU cards), and the same AI model will be deployed on each GPU machine and provide services. However, a complete use of some AI services requires multiple http requests to complete. We hope that these HTTP requests all run on the same pod, because the data needs to be loaded into the GPU, otherwise other GPU servers or received requests will lack data and the call will fail.
In addition to using ClientIP for session persistence in sessionAffinity, does it support custom session persistence functions, such as defining a group of HTTP requests that need to be forwarded to the same pod?
Why is this needed?
Customize http request distribution
The text was updated successfully, but these errors were encountered: