-
Describe the bugHi, We were testing out Khepri and noticed that some auto-delete durable queues can become unavailable if there is a network partition. Reproduction steps
Expected behaviorAuto delete queue is deleted on majority partition because it lost the consumer. Additional context |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 8 replies
-
@luos I don't have any logs to work with but very likely this comes down to the fact that
So this is not an issue with the implementation, this is a fundamental incompatibility of non-replicated durable queues and Khepri's fundamental assumptions that stem from Raft (a node in a minority cannot perform any writes). You are welcome to investigate this further to have a more detailed description of what's going on. |
Beta Was this translation helpful? Give feedback.
-
@luos @dumbbell suggests that we have seen this before with exclusive CQs, and it is pending an investigation but currently there are higher priority Khepri improvements. |
Beta Was this translation helpful? Give feedback.
-
I would expect the auto delete queue to stay running until it is able to perform the meta data store update to delete itself. |
Beta Was this translation helpful? Give feedback.
-
After several iterations, I could improve the fix I prepared for a similar issue with exclusive queues. It’s available in #14573. @luos, could you please give it a try and tell me if it is fixed for you too? |
Beta Was this translation helpful? Give feedback.
After several iterations, I could improve the fix I prepared for a similar issue with exclusive queues. It’s available in #14573.
@luos, could you please give it a try and tell me if it is fixed for you too?