question

Kernel updates

To pull this conversation public, I thought I would throw a question out to the senior staff at Cycle in regards to Kernel/OS updates. Lets kick this thread off with a few questions around server security

  1. How are the kernel and service/library updates performed?
  2. Do these require downtime? Are we required to run 2N infra or evacuate our nodes to perform kernel upgrades?
  3. How often are servers patched?
  4. What root OS variant is used to build the OS from track CVSS scorings and potential vulnerabilities and determine compatibility with various workloads?

I think these type of questions serve as a baseline for determining how folks can address security updates and ensure their servers are kept up to date.

avatar
3
  • Hi Jeff!

    1. CycleOS is an atomically built OS where everything is statically compiled before ever distributed. The only updates that occur on these nodes are to the Cycle agent, and other services. Those are also statically compiled binaries.
    2. Kernel updates only occur at server reboot at this time. Historically, we've updated the kernel about twice a year. Since kernel updates occur at reboot, it's entirely up to you when you'd like to do them.
    3. Outside of the kernel, the Cycle services will get auto-updated whenever we push an update live. Usually, this occurs every 2-3 weeks -- though may occasionally vary depending on features we're building. These updates don't typically introduce any downtime, unless we mark the update as 'major' update -- which issues staggered restarts. We haven't published an update that has required downtime since 2022 -- and that downtime was ~12 seconds.
    4. CycleOS is based on Alpine

    Hope this helps!

    avatar
    platform
  • That's great Jake. So I assume for #2, the go forward process for updating their kernels is to 'evac nodes; restart ; sanity and reintroduce into the pool'.

    avatar
  • You can do that. Or, if your containers are running in HA mode, and you have more than 1 LB, you should be able to just restart the node without even evacuating. Some of your instances will be inaccessible, but Cycle is built to work with that. Only if you were to have services hosted exclusively on that node would it become an issue of downtime.

    avatar
    platform
v2024.12.08.01 © 2024 Petrichor Holdings, Inc.