Creating an Argo Rollouts Plugin¶
High Level Overview¶
Argo Rollouts plugins depend on hashicorp's go-plugin library. This library provides a way for a plugin to be compiled as a standalone executable and then loaded by the rollouts controller at runtime. This works by having the plugin executable act as a rpc server and the rollouts controller act as a client. The plugin executable is started by the rollouts controller and is a long-lived process and that the rollouts controller connects to over a unix socket.
Here is an overview of how plugins are loaded:
The communication protocol uses golang built in net/rpc library so plugins have to be written in golang.
Plugin Repository¶
In order to get plugins listed in the main argo rollouts documentation we ask that the plugin repository be created under the argoproj-labs organization. Please open an issue under argo-rollouts requesting a repo which you would be granted admin access on.
There is also a standard naming convention for plugin names used for configmap registration, as well as what the plugin
uses for locating its specific configuration on rollout or analysis resources. The name needs to be in the form of
<namespace>/<name>
and both username/org
and repository name
. This requirement is in place to help with allowing multiple creators of the same plugin
types to exist such as <org1>/nginx
and <org2>/nginx
. These names could be based of the repo name such
as argoproj-labs/rollouts-plugin-metric-sample-prometheus
but it is not a requirement.
There will also be a standard for naming repositories under argoproj-labs in the form of rollouts-plugin-<type>-<tool>
where <type>
is one of metric
, step
, or trafficrouter
and <tool>
is the software the plugin is for, say nginx.
Plugin Name¶
So now that we have an idea on plugin naming and repository standards let's pick a name to use for the rest of this
documentation and call our plugin argoproj-labs/nginx
.
This name will be used in a few different spots the first is the config map that your plugin users will need to configure. It looks like this below.
kind: ConfigMap
metadata:
name: argo-rollouts-config
data:
metricProviderPlugins: |-
- name: "argoproj-labs/metrics"
location: "file:///tmp/argo-rollouts/metric-plugin"
args:
- "--log-level"
- "debug"
trafficRouterPlugins: |-
- name: "argoproj-labs/nginx"
location: "file:///tmp/argo-rollouts/traffic-plugin"
args:
- "--log-level"
- "debug"
stepPlugins: |-
- name: "argoproj-labs/canary-step"
location: "file:///tmp/argo-rollouts/canary-step-plugin"
disabled: false
args:
- "--log-level"
- "debug"
As you can see there is a field called name:
under each plugin type. This is the first place where your
end users will need to configure the name of the plugin. The second location
is either in the rollout object or the analysis
template which you can see the examples below. The third args
holds the command line arguments of the plugin.
Configuration Examples¶
AnalysisTemplate¶
You can see that we use the plugin name under spec.metrics[].provider.plugin
for analysis template.
You, as a plugin author, can then put any configuration you need under argoproj-labs/metrics
and you will be able to
look up that config in your plugin via the plugin name key. You will also want to document what configuration options your plugin supports.
apiVersion: argoproj.io/v1alpha1
kind: AnalysisTemplate
metadata:
name: success-rate
spec:
metrics:
- name: success-rate
...
provider:
plugin:
argoproj-labs/metrics:
address: http://prometheus.local
Traffic Router¶
You can see that we use the plugin name under spec.strategy.canary.trafficRouting.plugins
for traffic routers.
You, as a plugin author, can then put any configuration you need under argoproj-labs/metrics
and you will be able to
look up that config in your plugin via the plugin name key. You will also want to document what configuration options your plugin supports.
apiVersion: argoproj.io/v1alpha1
kind: Rollout
metadata:
name: example-plugin-ro
spec:
strategy:
canary:
canaryService: example-plugin-ro-canary-analysis
stableService: example-plugin-ro-stable-analysis
trafficRouting:
plugins:
argoproj-labs/nginx:
stableIngress: canary-demo
Step Plugin¶
You can see that we use the plugin name under spec.strategy.canary.steps[].plugin.name
for canary steps.
You, as a plugin author, can then put any configuration you need in the plugin object under config
property and you will receive it
as an arguments when your plugin is called. You will also want to document what configuration options your plugin supports.
apiVersion: argoproj.io/v1alpha1
kind: Rollout
metadata:
name: example-plugin-ro
spec:
strategy:
canary:
steps:
- plugin:
name: argoproj-labs/step-exec
config:
command: echo "hello world"
Plugin Interfaces¶
Argo Rollouts currently supports three plugin systems. As a plugin author, your end goal is to implement at least one of these interfaces as
a hashicorp go-plugin. The interfaces are MetricsPlugin
, TrafficRouterPlugin
and StepPlugin
for each of the respective plugins:
type MetricProviderPlugin interface {
// InitPlugin initializes the traffic router plugin. This gets called once when the plugin is loaded.
InitPlugin() RpcError
// Run start a new external system call for a measurement
// Should be idempotent and do nothing if a call has already been started
Run(*v1alpha1.AnalysisRun, v1alpha1.Metric) v1alpha1.Measurement
// Resume Checks if the external system call is finished and returns the current measurement
Resume(*v1alpha1.AnalysisRun, v1alpha1.Metric, v1alpha1.Measurement) v1alpha1.Measurement
// Terminate will terminate an in-progress measurement
Terminate(*v1alpha1.AnalysisRun, v1alpha1.Metric, v1alpha1.Measurement) v1alpha1.Measurement
// GarbageCollect is used to garbage collect completed measurements to the specified limit
GarbageCollect(*v1alpha1.AnalysisRun, v1alpha1.Metric, int) RpcError
// Type gets the provider type
Type() string
// GetMetadata returns any additional metadata which providers need to store/display as part
// of the metric result. For example, Prometheus uses is to store the final resolved queries.
GetMetadata(metric v1alpha1.Metric) map[string]string
}
type TrafficRouterPlugin interface {
// InitPlugin initializes the traffic router plugin. This gets called once when the plugin is loaded.
InitPlugin() RpcError
// UpdateHash informs a traffic routing reconciler about new canary, stable, and additionalDestination(s) pod hashes
UpdateHash(rollout *v1alpha1.Rollout, canaryHash, stableHash string, additionalDestinations []v1alpha1.WeightDestination) RpcError
// SetWeight sets the canary weight to the desired weight
SetWeight(rollout *v1alpha1.Rollout, desiredWeight int32, additionalDestinations []v1alpha1.WeightDestination) RpcError
// SetHeaderRoute sets the header routing step
SetHeaderRoute(rollout *v1alpha1.Rollout, setHeaderRoute *v1alpha1.SetHeaderRoute) RpcError
// SetMirrorRoute sets up the traffic router to mirror traffic to a service
SetMirrorRoute(rollout *v1alpha1.Rollout, setMirrorRoute *v1alpha1.SetMirrorRoute) RpcError
// VerifyWeight returns true if the canary is at the desired weight and additionalDestinations are at the weights specified
// Returns nil if weight verification is not supported or not applicable
VerifyWeight(rollout *v1alpha1.Rollout, desiredWeight int32, additionalDestinations []v1alpha1.WeightDestination) (RpcVerified, RpcError)
// RemoveManagedRoutes Removes all routes that are managed by rollouts by looking at spec.strategy.canary.trafficRouting.managedRoutes
RemoveManagedRoutes(ro *v1alpha1.Rollout) RpcError
// Type returns the type of the traffic routing reconciler
Type() string
}
type StepPlugin interface {
// InitPlugin initializes the canary step plugin. This gets called once when the plugin is loaded.
InitPlugin() RpcError
// Run executes a step plugin for the RpcStepContext and returns the result to the controller or an RpcError for unexpeted failures
Run(*v1alpha1.Rollout, *RpcStepContext) (RpcStepResult, RpcError)
// Terminate stops an uncompleted operation started by the Run operation
Terminate(*v1alpha1.Rollout, *RpcStepContext) (RpcStepResult, RpcError)
// Abort reverts the actions performed during the Run operation if necessary
Abort(*v1alpha1.Rollout, *RpcStepContext) (RpcStepResult, RpcError)
// Type returns the type of the step plugin
Type() string
}
Plugin Init Function¶
Each plugin interface has a InitPlugin
function, this function is called when the plugin is first started up and is only called
once per startup. The InitPlugin
function is used as a means to initialize the plugin it gives you the plugin author the ability
to either set up a client for a specific metrics provider or in the case of a traffic router construct a client or informer
for kubernetes api. The one thing to note about this though is because these calls happen over RPC the plugin author should
not depend on state being stored in the plugin struct as it will not be persisted between calls.
Kubernetes RBAC¶
The plugin runs as a child process of the rollouts controller and as such it will inherit the same RBAC permissions as the controller. This means that the service account for the rollouts controller will need the correct permissions for the plugin to function. This might mean instructing users to create a role and role binding to the standard rollouts service account for the plugin to use. This will probably affect traffic router plugins more than metrics plugins.
Sample Plugins¶
There are sample plugins within the argo-rollouts repo that you can use as a reference for creating your own plugin.