Replies: 5 comments 4 replies
-
@jmaerki This is incredible feedback, thanks for the write up!! |
Beta Was this translation helpful? Give feedback.
-
Regarding the fallback to PXE you might need a workaround telling 'config_url: ""' so it doesn't try to look into pxe again ... it's a workaround and a fix is coming |
Beta Was this translation helpful? Give feedback.
-
I agree that the users on different levels of the configuration is confusing. Being able to have them at the different stages is clear, but not needed if you're just getting started or don't need that level of detail. On the other hand, defining at the root of the config is simpler, but as soon as I know about stages makes me wonder at which stage those different root configurations are applied. Maybe we can do a better job at adding documentation about all this, and have a better way to walk the reader through docs, depending on their level of experience. |
Beta Was this translation helpful? Give feedback.
-
|
Beta Was this translation helpful? Give feedback.
-
Thanks for all your feedback so far. I've taken a few next steps. One thing I've abandoned for now is building a Debian Bullseye core image. I had to cut my time when I got to the step where I wanted to build a provider image with k3s based on the new core image. I guess I'd have to dive a lot deeper into that. I wondered whether there were plans to actually replace the provider images by a normal bundle. Some evidence of that can be found in the bundle repo but it looks like this is not actively pursued. |
Beta Was this translation helpful? Give feedback.
-
Hey there,
when I first stumbled over Kairos a short while ago, I thought: exactly what I'm looking for. Since I've hit a few road bumps trying it out (over about 2 days), I thought I'd share that...
Some background on where I come from: I'm running multiple k3s clusters for myself and others and I'm about to help a client bring up a more modern infrastructure but I have to keep things as simple as possible, so Kairos looks like a good candidate to help with just that. I'm also looking for a good way to set up edge clusters "as-a-box" for environments with no k8s knowledge. In my experiments, I have a solid iPXE setup that I use in my network to PXE boot all sorts of bare metal and VMs (on Proxmox and VirtualBox).
First of all, what cost me quite some time was the fact that I managed to overlook the "Note" on the "Getting Started" page. When I had the first installs working, I was wondering where my k3s was. I'm not sure how to better document that for guys like me who lose themselves in details and manage to miss the bigger points. I have no defense for my failure.
I haven't progressed to custom images, yet, so my installs always seem to load the images from the internet. Even though I have a 1 GBit/s internet line, the processes seemed rather slow compared to booting a basic Debian install. The boot log doesn't really inform that much over boot/copy progress. On my slower test machines, it could take over 2 minutes to have something show up indicating that booting/installing is still running properly. Also, machine shutdowns after a fresh install from the k3s provider images seems very slow. I had to learn to trust the process and be patient.
I tried different images. I run my own k3s nodes mostly on Debian Bullseye nowadays, so I wanted to try that. I started out on Kairos 1.5 and had some cases where progress really just stopped (hang) which made me look towards alpine-ubuntu and opensuse-leap. After finding 1.6.1 things got better. But after finding out that the Debian variant seems to be on "testing" repos, I started looking towards ubuntu-lts-22 which I ended up going for with my latest tests. With this I'm now able to reproducibly PXE-install k3s 1.24.10 nodes on VirtualBox and Proxmox. However, a bare-metal install via PXE boot on an HP t520 thin client containing a 256GB NGFF SATA SSD reproducibly yields a non-bootable system (falling back to PXE booting). Just to be sure that the t520 box isn't the problem, I PXE booted Debian Bullseye installer and ran a full manual install just to verify I can boot the machine after the installation. After that I switched to the ubuntu-lts-22 (kairos-ubuntu-22-lts-v1.6.1-k3sv1.24.10+k3s1) again and ended up with an unbootable installation. Not sure how to proceed here, except maybe to try opensuse or another image again to compare.
On the VM side, I ran into some trouble that might have to do with APIC. I had to set up VMs on both VirtualBox and Proxmox with a single core and I/O APIC disabled to install. After installation, I was able to up the core count >1.
In another experiment I tried a manual install like this:
I didn't go further there. In cases like this, it would have been useful to have ping and nslookup available from the core image. I went back to PXE booting after that.
Another slightly confusing thing is finding different syntaxes for setting up users. There's the cloud-init native root-level users tag. And the differently structured users tag under /stages/network also took some trial & error to get working. I still need to clean up there, but I can SSH into the machine with a key.
Next steps for me now: trying AuroraBoot on ESXi in the client's environment where there is no PXE boot infrastructure, yet. I will probably also need to try building my own images to get a non-US keyboard mapping, NTP, some additional packages and stuff. I would have loved to have my...
...in the cloud-init doing something. But I guess I need to solve that differently. Also the P2P feature looks really interesting.
So I guess I'm still in the learning curve and I'm missing some bits and pieces to get everything solved quickly. But Kairos looks promising and I like most of what I'm seeing so far. I guess having so many permutations of potential features and environments makes it very hard to get things flying. The documentation is impressive but with so many little details, one has to be on top of their game to read it carefully. I hope my writing this down was half way useful or at least interesting.
Beta Was this translation helpful? Give feedback.
All reactions