2019-04-24 11:33:34 +02:00

4.2 KiB

CAAS KUBERNETES POD DataDog monitors

How to use this module

module "datadog-monitors-caas-kubernetes-pod" {
  source = "git::ssh://git@git.fr.clara.net/claranet/pt-monitoring/projects/datadog/terraform/monitors.git//caas/kubernetes/pod?ref={revision}"

  environment = "${var.environment}"
  message     = "${module.datadog-message-alerting.alerting-message}"
}

Purpose

Creates DataDog monitors with the following checks:

  • Kubernetes Pod CrashLoopBackOff
  • Kubernetes Pod errors
  • Kubernetes Pod phase status failed

Inputs

Name Description Type Default Required
crashloopbackoff_enabled Flag to enable Pod crashloopbackoff monitor string "true" no
crashloopbackoff_extra_tags Extra tags for Pod crashloopbackoff monitor list [] no
crashloopbackoff_message Custom message for Pod crashloopbackoff monitor string "" no
crashloopbackoff_silenced Groups to mute for Pod crashloopbackoff monitor map {} no
crashloopbackoff_threshold_critical crashloopbackoff critical threshold string "5" no
crashloopbackoff_threshold_warning crashloopbackoff warning threshold string "4" no
crashloopbackoff_time_aggregator Monitor aggregator for Pod crashloopbackoff [available values: min, max or avg] string "sum" no
crashloopbackoff_timeframe Monitor timeframe for Pod crashloopbackoff [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_10m" no
environment Architecture environment string n/a yes
error_enabled Flag to enable Pod errors monitor string "true" no
error_extra_tags Extra tags for Pod errors monitor list [] no
error_message Custom message for Pod errors monitor string "" no
error_silenced Groups to mute for Pod errors monitor map {} no
error_threshold_critical error critical threshold string "1" no
error_threshold_warning error warning threshold string "0.5" no
error_time_aggregator Monitor aggregator for Pod errors [available values: min, max or avg] string "sum" no
error_timeframe Monitor timeframe for Pod errors [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_15m" no
evaluation_delay Delay in seconds for the metric evaluation string "15" no
filter_tags_custom Tags used for custom filtering when filter_tags_use_defaults is false string "*" no
filter_tags_custom_excluded Tags excluded for custom filtering when filter_tags_use_defaults is false string "" no
filter_tags_use_defaults Use default filter tags convention string "true" no
message Message sent when a monitor is triggered string n/a yes
new_host_delay Delay in seconds before monitor new resource string "300" no
pod_phase_status_enabled Flag to enable Pod phase status monitor string "true" no
pod_phase_status_extra_tags Extra tags for Pod phase status monitor list [] no
pod_phase_status_message Custom message for Pod phase status monitor string "" no
pod_phase_status_silenced Groups to mute for Pod phase status monitor map {} no
pod_phase_status_time_aggregator Monitor aggregator for Pod phase status [available values: min, max or avg] string "max" no
pod_phase_status_timeframe Monitor timeframe for Pod phase status [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
prefix_slug Prefix string to prepend between brackets on every monitors names string "" no

Outputs

Name Description
crashloopbackoff_id id for monitor crashloopbackoff
error_id id for monitor error
pod_phase_status_id id for monitor pod_phase_status