You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Someone has reported the following scenario that can be reproduced every few runs and looks very similar to #7743 but with a more recent version (e.g. 3.11.18).
The steps to reproduce are:
Cluster three nodes, add some stream SAC consumers
Restart RabbitMQ using a K8S rollout restart (so pods are restarted one by one)
Observe no active SAC consumer after the cluster restart (evidence is collected using rabbitmqctl list_stream_consumers)
Active consumer is not picked until a restart of the consumer applications
With 3.11.2 (which does not include #7743) it can be reproduced every 2-3 runs.
With 3.11.18 (which does include #7743), it takes 10 to 15 attempts but the issue still
can be reproduced.
Consumer setup code
Using RabbitMQ Java Stream client 0.10.0:
ConsumerBuildercreateConsumerBuilder(StringsteadyStream) {
returnrabbitMQStreamEnvironment.consumerBuilder()
.name(SERVICE_NAME)
.stream(steadyStream)
.singleActiveConsumer()
// use OffsetSpecification.next() for all cases to start consuming where it left
.offset(OffsetSpecification.next())
// use manualTrackingStrategy() because we want to commit offset if certain conditions are met
.manualTrackingStrategy()
.checkInterval(Duration.ofSeconds(rabbitMQProperties.getManualTrackingStrategyInterval()))
.builder();
}
Logged Exception
[warning] <0.1328.0> rabbit_stream_coordinator: failed to stop member [redacted]-publisher-prd_1680603078129697429 'rabbit@[redacted node 1]' Error: {{nodedown,'rabbit@[redacted node 1]'},{gen_server,call,[{osiris_server_sup,'rabbit@[redacted node 1]'},{terminate_child, …
Environment details
I cannot publish a collect-env tarball publicly but it will be available for the core team to inspect.
The text was updated successfully, but these errors were encountered:
It turns out, #7743 introduced a feature flag, stream_sac_coordinator_unblock_group , that is disabled in this environment. Which means the change in #7743 is only partially applied at best.
Once the flag is enabled, the reporter will conduct a new round of tests.
Someone has reported the following scenario that can be reproduced every few runs and looks very similar to #7743 but with a more recent version (e.g.
3.11.18
).The steps to reproduce are:
rabbitmqctl list_stream_consumers
)With
3.11.2
(which does not include #7743) it can be reproduced every 2-3 runs.With
3.11.18
(which does include #7743), it takes 10 to 15 attempts but the issue stillcan be reproduced.
Consumer setup code
Using RabbitMQ Java Stream client 0.10.0:
Logged Exception
Environment details
I cannot publish a
collect-env
tarball publicly but it will be available for the core team to inspect.The text was updated successfully, but these errors were encountered: