minikube flakes #21931

edsantiago · 2024-03-04T12:13:24Z

No useful diagnostics:

# X Exiting due to RUNTIME_ENABLE: Failed to enable container runtime: sudo systemctl restart cri-docker.socket: Process exited with status 1

Smells like the quay flakiness, but there's nothing to go on. Tests should probably be instrumented to run journalctl, minikube logs, and anything else that could give a user some hints.

fedora-39 : minikube podman fedora-39 rootless host sqlite
- PR fix(deps): update module github.com/stretchr/testify to v1.9.0 #21908
  - 03-01 13:52 in [minikube] minikube - check cluster is up
- PR Use machine image as specified in containers.conf #21862
  - 02-29 14:12 in [minikube] [001] minikube - deploy generated container yaml to minikube
- PR Make inspect compatible with docker v1.44 API #21601
  - 02-28 14:12 in [minikube] minikube - check cluster is up

x	x	x	x	x	x
minikube(3)	podman(3)	fedora-39(3)	rootless(3)	host(3)	sqlite(3)

The text was updated successfully, but these errors were encountered:

afbjorklund · 2024-03-04T13:15:09Z

Weird that already the cri-docker.socket fails, you would think it would wait until cri-docker.service

https://github.com/Mirantis/cri-dockerd/tree/master/packaging/systemd

New run_minikube() helper, modeled after run_podman(). Echoes each command being run and its output. On failure, runs minikube logs. Addresses (does not close) containers#21931 which is hitting us hard in CI. Probably quay flakes, but it's impossible to tell without logs. Also: bug fix: one "run podman" fixed to run_podman Signed-off-by: Ed Santiago <[email protected]>

edsantiago · 2024-04-02T19:32:16Z

Caught one:

<+010ms> # $ minikube kubectl -- apply -f /tmp/minikube_deploy_SEeITt.yaml
<+593ms> # pod/test-ctr-pod created
         #
<+023ms> # $ minikube kubectl get pods
<+266ms> # NAME           READY   STATUS              RESTARTS   AGE
         # test-ctr-pod   0/1     ContainerCreating   0          0s
....
<+1.03s> # $ minikube kubectl get pods
<+232ms> # NAME           READY   STATUS         RESTARTS   AGE
         # test-ctr-pod   0/1     ErrImagePull   0          18s        <<<<<<<<<<<<<--------------------------------
....
<+1.03s> # $ minikube kubectl get pods
<+265ms> # NAME           READY   STATUS             RESTARTS   AGE
         # test-ctr-pod   0/1     ImagePullBackOff   0          30s       <<<<<<<<<<<<------------------
....
         # #/vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
         # #| FAIL: Timed out waiting for pod to move to 'Running' state
         # #\^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

"ErrImagePull" smells to me like quay flake. Anyone know for sure?

cevich · 2024-05-06T17:33:58Z

I believe I hit this using a "new" F40 image (still under test) or possibly a new flake?

https://api.cirrus-ci.com/v1/artifact/task/5131719219609600/html/minikube-podman-fedora-40-rootless-host-sqlite.log.html

The output seems similar to a previous (above) hit on:

You are using the QEMU driver without a dedicated network, which doesn't support minikube service&minikube tunnel commands.

I don't think that test should be trying to use QEMU, but maybe that's a red herring? In any case, I re-ran the task and it passed.

edsantiago · 2024-05-15T12:52:07Z

This is starting to compete with #22551 for the Most Annoying Flake award.

fedora-39 : minikube podman fedora-39 rootless host sqlite
- PR [v5.0] fix swagger doc #22389
  - 04-16-2024 06:44 in [minikube] minikube - check cluster is up
- PR Fix some comments #22370
  - 04-15-2024 14:03 in [minikube] [001] minikube - deploy generated container yaml to minikube
- PR [v5.0] libpod: don't warn about cgroupsv1 on FreeBSD #22171
  - 03-26-2024 08:38 in [minikube] minikube - check cluster is up
- PR hyperv: fix machine rm -r #22140
  - 03-22-2024 14:07 in [minikube] [001] minikube - deploy generated container yaml to minikube
  - 03-22-2024 14:05 in [minikube] [001] minikube - deploy generated container yaml to minikube
- PR Bump CI VMs: new pasta, crun, rawhide kernel #22082
  - 03-18-2024 22:04 in [minikube] [001] minikube - deploy generated container yaml to minikube
- PR logformatter: handle Windows logs #22081
  - 03-18-2024 17:57 in [minikube] minikube - check cluster is up
- PR fix invalid HTTP header values when hijacking a connection #21979
  - 03-12-2024 18:35 in [minikube] minikube - check cluster is up
- PR machine: make more use of strongunits #21960
  - 03-06-2024 10:43 in [minikube] [001] minikube - deploy generated container yaml to minikube
- PR fix(deps): update module github.com/stretchr/testify to v1.9.0 #21908
  - 03-01-2024 13:52 in [minikube] minikube - check cluster is up
- PR Use machine image as specified in containers.conf #21862
  - 02-29-2024 14:12 in [minikube] [001] minikube - deploy generated container yaml to minikube
- PR Make inspect compatible with docker v1.44 API #21601
  - 02-28-2024 14:12 in [minikube] minikube - check cluster is up
fedora-40 : minikube podman fedora-40 rootless host sqlite
- PR Return StatusNotFound when multiple volumes matching occurs #22715
  - 05-15 07:03 in [minikube] minikube - check cluster is up
- PR Fix podman-remote support for podman farm build #22673
  - 05-11 14:48 in [minikube] minikube - check cluster is up
- PR Revert "container stop: kill conmon" #22662
  - 05-13 04:19 in [minikube] minikube - check cluster is up
  - 05-10 13:32 in [minikube] minikube - check cluster is up
  - 05-09 17:24 in [minikube] minikube - check cluster is up
- PR libpod: wait for healthy on main thread #22658
  - 05-13 16:43 in [minikube] minikube - check cluster is up
  - 05-13 16:03 in [minikube] minikube - check cluster is up
- PR Update CI VMs to F40, F39, D13 #22549
  - 05-06 16:01 in [minikube] minikube - check cluster is up

x	x	x	x	x	x
minikube(20)	podman(20)	fedora-39(12)	rootless(20)	host(20)	sqlite(20)
		fedora-40(8)

cevich · 2024-05-21T18:19:04Z

Maybe worth asking Urvashi to take a look? IIRC she wrote these tests, and might have a quick/easy answer.

cevich · 2024-07-09T18:35:38Z

FWIW, I attempted to reproduce this in a hack/get_ci_vm.sh environment. Painstakingly copy-pasting commands in the code-path one-by-one. This worked perfectly fine for minikube - check cluster is up and minikube - deploy generated container yaml to minikube. I was hoping to get lucky and it would reproduce for me given how seemingly often it breaks 😢 So I'm giving up.

Ref: #23237

edsantiago added the flakes Flakes from Continuous Integration label Mar 4, 2024

edsantiago mentioned this issue Mar 19, 2024

minikube: instrument tests, to allow debugging failures #22089

Merged

cevich mentioned this issue Jul 9, 2024

Drop minikube CI test #23237

Merged

openshift-merge-bot bot closed this as completed in #23237 Jul 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

minikube flakes #21931

minikube flakes #21931

edsantiago commented Mar 4, 2024

afbjorklund commented Mar 4, 2024 •

edited

Loading

edsantiago commented Apr 2, 2024

cevich commented May 6, 2024

edsantiago commented May 15, 2024

cevich commented May 21, 2024

cevich commented Jul 9, 2024

minikube flakes #21931

minikube flakes #21931

Comments

edsantiago commented Mar 4, 2024

afbjorklund commented Mar 4, 2024 • edited Loading

edsantiago commented Apr 2, 2024

cevich commented May 6, 2024

edsantiago commented May 15, 2024

cevich commented May 21, 2024

cevich commented Jul 9, 2024

afbjorklund commented Mar 4, 2024 •

edited

Loading