Backward compatibility in Kubernetes operators can be an afterthought

I'm an engineer at MayaData, a company dedicated to enabling data agility through the use of Kubernetes as a data layer and the sponsor of the CNCF project OpenEBS and other open source projects. Take it for a test drive via free and easy to use management software by registering here.

Overview - Backward compatibility in Kubernetes operators

There are scenarios when a Kubernetes operators can be used to manage multiple Kubernetes clusters. OpenEBS Director is one such multitenant visualization and management software for OpenEBS storage. OpenEBS Director manages multiple OpenEBS clusters from a single instance. It comes with two flavors SaaS and On-Premise. In the SaaS version, Kubernetes clusters can be connected to Director online. However, if SaaS is not an option, these clusters can be connected to an on-premise Director installation.

OpenEBS Director being a multitenant software the connected clusters can be expected to have different versions of OpenEBS. Managing multiple versions of OpenEBS is challenging due to the differences in the Kubernetes resources that are supported.  Once  the Director is upgraded, its agents get upgraded automatically in all its connected clusters. These upgraded agents are now supposed to act based on the version of OpenEBS that the Kubernetes cluster is using.

We shall dig deep on this problem statement from the perspective of CStor. CStor is one of the OpenEBS data engines. CStor is popular amongst workloads that need efficient capacity management, replicas, point in time snapshots, incremental backups, etc. Its pool management is operated via an operator that works based on a custom resource named CStorPoolCluster (CSPC). The CSPC API version was v1alpha1 in OpenEBS version 1.9. However, with OpenEBS 1.10, the CSPC API version is now v1.

Kubernetes Operator - Fig 1

CStorClusterConfig is another custom resource that operates on CSPC and manages the advanced life cycles (i.e., day-0, day-1, day-2 operations) of CStor pools. Director in turn relies on CStorClusterConfig to manage CStor pools. Note that the Kubernetes controllers that watch for CStorClusterConfig are available in CstorPoolAuto repo.

Since CStorClusterConfig is used by Director it needs to maintain backward compatibility with multiple versions of OpenEBS to manage CStor pools. CstorPoolAuto uses the Metacontroller SDK under the hood to implement Kubernetes reconciliation. One of the advantages of using Metacontroller is its ability to handle backward compatibility. Metacontroller allows you to write the Kubernetes controller(s) in a declarative way. We need to write controller specification(s) declaratively, and corresponding instances will be created & run by Metacontroller using these specifications. Hence, the term “meta,” 

We used labelSelector which is one of the declarative controller features from Metacontroller to achieve backward compatibility. This feature is a simple way to let a controller manage specific instances of a custom resource. In our case, we have CStorClusterConfig custom resource which manages the life cycle of CStor pools. We had to create two declarative controllers, one to manage v1 CStorPoolCluster and another to manage v1alpha1 CStorPoolCluster via CStorClusterConfig.

For older custom resources we do not have any label with respect to CStorPoolCluster version so we use below configuration to manage v1alpha1 CStorPoolCluster.

labelSelector:
  matchExpressions:
    - key: cspc.openebs.io/version
      operator: DoesNotExist

To manage v1 CStorPoolCluster we introduced cspc.openebs.io/version  as a new label  in CStorClusterConfig that was set with value v1. In other words to manage v1 CStorPoolCluster, CStorClusterConfig should contain cspc.openebs.io/version=v1 label. 

Following is the controller configuration to manage v1 CStorPoolCluster:

labelSelector:
  matchLabels:
    cspc.openebs.io/version: v1

With these controller configurations if CStorClusterConfig contains cspc.openebs.io/version=v1 label then it is handled by the controller which is assigned to manage v1 CStorPoolClutser. If CStorClusterConfig does not contain cspc.openebs.io/version label then it goes to the controller which should manage v1alpha1 CStorPoolClutser.

This was the sample Metacontroller config used earlier

apiVersion: metac.openebs.io/v1alpha1
kind: GenericController
spec:
  watch:
    apiVersion: dao.mayadata.io/v1alpha1
    resource: cstorclusterconfigs
Kubernetes Operator - Fig 2

Following is the updated Metacontroller specification that supports both v1alpha1 as well as v1 versions of CStorPoolCluster.

Controller to support v1alpha1

apiVersion: metac.openebs.io/v1alpha1
kind: GenericController
spec:
  watch:
    apiVersion: dao.mayadata.io/v1alpha1
    resource: cstorclusterconfigs
    labelSelector:
      matchExpressions:
        - key: cspc.openebs.io/version
          operator: DoesNotExist

Controller to support v1

apiVersion: metac.openebs.io/v1alpha1
kind: GenericController
spec:
  watch:
    apiVersion: dao.mayadata.io/v1alpha1
    resource: cstorclusterconfigs
    labelSelector:
      matchLabels:
        cspc.openebs.io/version: v1
Kubernetes Operator - Fig 3

 At MayaData we are using a fork of metacontroller. To know more about metacontroller join here.

References

Access Director online - https://director.mayadata.io
For help - https://help.mayadata.io
OpenEBS community — https://slack.openebs.io
Metacontroller community — link

Don Williams
Don is the CEO of MayaData and leading the company for last one year. He has an exceptional record of accomplishments leading technology teams for organizations ranging from private equity-backed start-ups to large, global corporations. He has deep experience in engineering, operations, and product development in highly technical and competitive marketplaces. His extensive professional network in several industries, large corporations and government agencies is a significant asset to early stage businesses, often essential to achieve product placement, growth and position for potential exit strategies.
Kiran Mova
Kiran evangelizes open culture and open-source execution models and is a lead maintainer and contributor to the OpenEBS project. Passionate about Kubernetes and Storage Orchestration. Contributor and Maintainer OpenEBS projects. Co-founder and Chief Architect at MayaData Inc.
Murat Karslioglu
VP @OpenEBS & @MayaData_Inc. Murat Karslioglu is a serial entrepreneur, technologist, and startup advisor with over 15 years of experience in storage, distributed systems, and enterprise hardware development. Prior to joining MayaData, Murat worked at Hewlett Packard Enterprise / 3PAR Storage in various advanced development projects including storage file stack performance optimization and the storage management stack for HPE’s Hyper-converged solution. Before joining HPE, Murat led virtualization and OpenStack integration projects within the Nexenta CTO Office. Murat holds a Bachelor’s Degree in Industrial Engineering from the Sakarya University, Turkey, as well as a number of IT certifications. When he is not in his lab, he loves to travel, advise startups, and spend time with his family. Lives to innovate! Opinions my own!