K3k: Kubernetes in Kubernetes

enrichman · 66 days ago

Hi everyone! I’m one of the maintainers of K3k at SUSE.

It’s really exciting to see this on the front page. The project actually started during a SUSE Hackweek by my colleague Hussein. It was initially envisioned as a "Kubernetes version of k3d," but it evolved into something more ambitious and eventually became a real product. We’ve always been big believers in the power of open source. For the current default "shared" mode, we even experimented with Virtual Kubelet, another CNCF project, during our development process.

I’ll be hanging around the thread today, so if you have any questions about the history, the tech stack, or where we're headed next, feel free to ask!

kitd · 66 days ago

Missed the opportunity to call it Kink ...

cassianoleal · 66 days ago

… by a few years

https://github.com/meln5674/kink

https://github.com/Trendyol/kink

https://github.com/openbce/kink

https://github.com/anza-labs/kink

killingtime74 · 66 days ago

That's what I'll be calling it

emma · 66 days ago

That name's going to cause some uncomfortable search results for anyone trying to find documentation. There's a real precedent for projects sticking with the awkward-but-searchable name over the clever one, and this seems like a case where the official maintainers made the right call.

seth · 66 days ago

We actually did call it that internally for a while. The nested RBAC gets confusing fast once you're past one layer deep, whatever you name it.

aldanor · 66 days ago

openkink

bheadmaster · 66 days ago

GNU + kink

matt123456789 · 66 days ago

This is, if I had to guess, a monument to a small team's stubborn insistence that such a thing could be done at all. If I can hope for a reward for them, may it be that they are allowed to hand off maintaining it to another team.

teleforce · 66 days ago

Why stop at K3k, should be named K3k3k in order to capture the truly recursive and nested nature of the container-in-container system?

Joking aside I think this can be a great tool in Kubernetes and container eco-system.

Unlike one of the sibling comments that claimed it's a very niche application, or 99.9% deployment will never ever use this nested feature, I beg to differ.

Apart for testing with container-in-container arrangement, it can be a killer application for realistic simulation of network elements as has been utilized in many network simulators including ComNetsEmu and others [1],[2],[3],[4].

[1] Chapter 13 - ComNetsEmu: a lightweight emulator:

https://www.sciencedirect.com/science/chapter/edited-volume/...

[2] ViPMesh: A virtual prototyping framework for IEEE 802.11s wireless mesh networks:

https://ieeexplore.ieee.org/document/7763263

[3] NestedNet: A Container-based Prototyping Tool for Hierarchical Software Defined Networks:

https://ieeexplore.ieee.org/document/9244858

[4] Network Virtualization and Emulation using Docker, OpenvSwitch and Mininet-based Link Emulation:

https://scholarworks.umass.edu/masters_theses_2/985/

redrove · 66 days ago

So this is basically vCluster[0] but Rancher branded?

[0] https://github.com/loft-sh/vcluster

phrotoma · 66 days ago

Thanks. I knew I'd seen this idea before but couldn't remember the project name.

AlfeG · 66 days ago

Closely related in pupopse, yes. Branded? no.

fsniper · 65 days ago

also kcp

randomtoast · 66 days ago

This type of approach carries a significantly higher operational risk compared to operating multiple Kubernetes clusters on separate VMs or physical hardware. If you eventually update the main Kubernetes cluster that manages the virtual clusters and something goes wrong, you could potentially bring down your entire fleet of Kubernetes clusters all at once.

lateral_cloud · 66 days ago

I don't think this is intended for production

rootnod3 · 65 days ago

Then why would SuSE spend money on it?

timber92 · 66 days ago

It's definitely used in production — we ran k3k for internal dev namespaces at my last job. Saved us from spinning up separate clusters for each team. Not as a replacement for real isolation, more like "we need 8 clusters but only have budget for 2 nodes."

ssousa666 · 65 days ago

My team runs several HarvesterHCI/RKE2 clusters, edge deployments of our validation, simulation and fleet management tools for autonomous vehicles. The Rancher ecosystem has really been a godsend for us.

Excited to experiment with k3k, but worried that I won't have the language to accurately describe the third layer of kubernetes in the stack. Host cluster -> Guest Host Cluster -> Guest Cluster? Host Cluster -> Guest Cluster -> Guest Guest Cluster?

rjzzleep · 66 days ago

Do Rancher side products generally make it into a stable state such that you would want to run mission-critical systems on?

sofixa · 66 days ago

RKE (their Kubernetes deployment and management platform, mostly for various flavours of self managed environments) is pretty popular with the self-managed crowd that needs something to manage their on Orem Kubernetes clusters.

rjzzleep · 66 days ago

That's why I wrote Rancher side products.

V99 · 66 days ago

(Former employee) They tend to either get enough traction very quickly and be supported for years, or not and be abandoned in weeks/months.

enrichman · 65 days ago

This is not a side product but it's currently GA and part of the Rancher Prime offering. :)

weitzj · 66 days ago

I don’t understand how they are separating security in the virtual mode as they only mention pods. It seems every workload still shares the underlying node, even when in virtual mode. Take for example the OCI cache on the nodes. What about cache poisoning?

enrichman · 66 days ago

In virtual mode, the only pods running directly on the host are the K3s servers and agents. All "virtual cluster pods" run within these components, meaning they do not appear as individual pods on the host cluster.

The only trade-off is that K3s currently requires privileged mode to operate. We are actively exploring ways to address this limitation and improve security, such as implementing user namespaces or microVMs.

weitzj · 66 days ago

Thank you for your feedback.

I understood from the host cluster perspective you won’t see the child cluster pods. And what is the perspective on nodes?

Can you have like a host cluster spawning on host nodes and the host cluster has control over spawning separate physical nodes which contain the child cluster (api server) + workload pods ?

bryanström · 66 days ago

We ran into this exact concern when evaluating similar setups. The node-level image cache is shared across all virtual clusters - a tenant could theoretically pull a compromised layer that gets cached and served to others. Whether that's actually exploitable depends on your registry setup.

ithkuil · 66 days ago

Aren't OCI caches content addressed?

weitzj · 66 days ago

I was thinking of people were to use an image…:$my_tag on the host cluster and some roughe pod on the child cluster (but same underlying physical nodes) somehow overwriting the local cached :my_tag, you could do something on the parent cluster.

But I don’t fully understand what you meant with content adressed :)

Maybe one has to ensure in the host cluster that the image pull policy is set to Always or all references to images have to be based on the shasum rather than Tags.

ohnei · 66 days ago

It doesn't seem like it is at a deep layer such that it could be used to test updates to kubernetes and CRDs in a cluster that isn't yet updated?

BobbyTables2 · 64 days ago

Multi-tenant hosting using containers? Thanks, I really needed a good laugh today…

nonameiguess · 66 days ago

Hacker News sure does love posting links to random Github repos with no context for why it was posted, then a bunch of comments come along and basically ask why.

Since I do have context, the original Rancher labs CTO created k3s, one of the earliest severely stripped down versions of Kubernetes, which bundles all of the required executables into a single multi-call binary, in order to be able to run Kubernetes on a Raspberry Pi. Along the lines of kind, k3d was released to be able to run k3s in Docker containers instead of full Linux hosts. The main use case is testing. We used it extensivel in the early days of Air Force and IC cloud migrations that insisted we needed to rehost all systems in Kubernetes so developers could have local targets to work with. Rancher eventually rebuilt its Kubernetes engine when Docker fell out of favor and based rke2 on k3s, but with the Kubernetes components as static pods instead of embedded multi-call binaries and kubelet and containerd extracted from an embedded virtual filesystem to the host when rke2 is first run.

When KubeVirt came out, Rancher also released an HCI product that uses it, Harvester, running on top of rke2 and Rancher's storage project Longhorn. This runs a full virtual machine manager with virtualized networking and storage, a la something like ESXI, vSAN, and vSphere, with Multus and the bridge CNI plugin providing the networking (it now has KubeOVN as well).

Harvester relies on being imported to and managed by Rancher to have things like SSO and Rancher's multi-cluster RBAC and node provisioners for Harvester to run guest clusters. A whole lot of customers migrating off of VMWare since the Broadcom acquisition want all of that, but without necessarily having an external Rancher. Early on, Harvester offered an experimental vCluster addon that created a guest cluster with Rancher installed on it and that automatically managed Harvester.

This had a lot of problems. I'm not going to rehash them because I don't want to come across as bashing vCluster, but it was not a tenable long-term option that crashed hard on most who tried to use it. Since Rancher already had k3d, it was pretty natural step to just create their own virtualized Kubernetes that runs in Kubernetes by adapting k3d to become k3k, which runs k3s in Kubernetes rather than in Docker. Now you can get a guest cluster to install Rancher onto and get the full suite of Rancher features and a much better experience than the bare Harvester UI without needing to run full VMs.

Why not just install Rancher directly onto the same rke2 cluster that is running Harvester itself? Because it already has one, but that was considered an implementation detail that developers used to bootstrap and not have to duplicate work that was already done, but not meant to be exposed to users. If you try to install a second Rancher to actually use, you'll conflict with a whole bunch of resources that already exist and it won't work.

It's a tangled mess of confusing layers, but that's the world we live in. It's why we still have IPv4, VLAN, VXLAN, virtual terminals, discretionary access control for Linux. We build on top of what is already there instead of rebuilding from scratch in a saner way. This isn't just how software works. It's why city designs rarely make sense. It's why life itself has vestigial anti-features. Cruft rarely disappears. It just gets buried underneath whatever comes next.

2ndorderthought · 66 days ago

Can someone explain what this even means? Explain it like I am a software engineer with 20 years experience who has not yet found a strong use case for running kubernetes outside of hand holding cloud provider options

phrotoma · 66 days ago

K8s encourages thinking about workloads as "cattle not pets". App running in K8s falls over? Blow it away and let K8s recreate it, etc.

However clusers themselves often become the new pets. Many orgs do not reach a level of operational maturity where they can blow away and recreate whole clusters without downtime and toil.

A meta-pattern has emerged where higher order tooling managers a whole fleet of clusters. This is an implementation of that meta pattern which uses K8s itself as the higher order tool to manage other clusters.

It's not a new idea, just a new implementation of the pattern.

2ndorderthought · 66 days ago

Thank you. Wow I had no idea this was a problem. Seems kind of nightmare territory. In a weird way it makes me respect elixir/erlang even more. It's not the exact same problem obviously but really had me thinking about beam etc

mystifyingpoi · 66 days ago

This is extremely niche. 99.9% of Kubernetes deployments will never need such nesting. It could be useful for testing tooling (I guess maybe operators?) without recreating the "top-level" cluster all the time.

Also it's a fun idea. Sandbox in a sandbox.

dboreham · 66 days ago

I've seen many bugs get to production for the lack of such testing.

geoffbp · 66 days ago

Send the link to AI and ask :)

2ndorderthought · 66 days ago

I have found I learn more when I talk to people who are really interested in a topic.

bloppe · 66 days ago

What does k3k stand for? Can we just put whatever number we want between 2 letters now?

olblak · 66 days ago

Disclosure as I am working for SUSE on Rancher.

It's Kubernetes in Kubernetes and a reference in k3s which is also a project we are heavily contributing to, at SUSE.

nextaccountic · 65 days ago

https://github.com/k3s-io/k3s#whats-with-the-name

> What's with the name?

> We wanted an installation of Kubernetes that was half the size in terms of memory footprint. Kubernetes is a 10 letter word stylized as k8s. So something half as big as Kubernetes would be a 5 letter word stylized as K3s. A '3' is also an '8' cut in half vertically. There is neither a long-form of K3s nor official pronunciation.

k3k is a play on k3s

but k3s is itself a play on words (k3s is supposed to be half the size of k8s, which stands for kubernetes)

wlonkly · 64 days ago

I've always liked to think that "k3s" is the numeric-abbreviation of "kates".[1]

[1] https://xcancel.com/PHP_CEO/status/823620960960053248

BurpyDave · 66 days ago

I suspect it’s ‘kubernetes in kubernetes’

wlonkly · 64 days ago

that's k2k!

stingraycharles · 66 days ago

I suspect it's a play on another kubernetes variant, `k3s` ?

pcald · 66 days ago

k in k

rootnod3 · 65 days ago

Cool, one more layer of indirection and abstraction. May I ask why? I fail to see the point, but I might just be grumpy.

madduci · 66 days ago

Nice, now we need K3Kind

freakynit · 66 days ago

Can we go deeper than two level? (inception vibes..)

quartz56 · 65 days ago

Kubernetes all the way down, and yet somehow still less complex than a production Helm chart.