2019-07-19 11:40:41 +02:00

17 KiB

CLOUD AZURE STORAGE DataDog monitors

How to use this module

module "datadog-monitors-cloud-azure-storage" {
  source = "git::ssh://git@git.fr.clara.net/claranet/pt-monitoring/projects/datadog/terraform/monitors.git//cloud/azure/storage?ref={revision}"

  environment = "${var.environment}"
  message     = "${module.datadog-message-alerting.alerting-message}"
}

Purpose

Creates DataDog monitors with the following checks:

  • Azure Blob Storage too many authorization errors
  • Azure Blob Storage too many client_other errors
  • Azure Blob Storage too many network errors
  • Azure Blob Storage too many server_other errors
  • Azure Blob Storage too many throttling errors
  • Azure Blob Storage too many timeout errors
  • Azure File Storage too many authorization errors
  • Azure File Storage too many client_other errors
  • Azure File Storage too many network errors
  • Azure File Storage too many server_other errors
  • Azure File Storage too many timeout errors
  • Azure Queue Storage too many authorization errors
  • Azure Queue Storage too many client_other errors
  • Azure Queue Storage too many network errors
  • Azure Queue Storage too many server_other errors
  • Azure Queue Storage too many timeout errors
  • Azure Storage Blob service too few successful requests
  • Azure Storage Blob service too high end to end latency
  • Azure Storage Blob services is down
  • Azure Storage File service too few successful requests
  • Azure Storage File service too high end to end latency
  • Azure Storage File services is down
  • Azure Storage is down
  • Azure Storage Queue service is down
  • Azure Storage Queue service too few successful requests
  • Azure Storage Queue service too high end to end latency
  • Azure Storage Table service is down
  • Azure Storage Table service too few successful requests
  • Azure Storage Table service too high end to end latency
  • Azure Storage too many throttling errors
  • Azure Storage too many throttling errors
  • Azure Storage too many throttling errors
  • Azure Table Storage too many authorization errors
  • Azure Table Storage too many client_other errors
  • Azure Table Storage too many network errors
  • Azure Table Storage too many server_other errors
  • Azure Table Storage too many timeout errors

Inputs

Name Description Type Default Required
authorization_error_requests_enabled Flag to enable Storage authorization errors monitor string "true" no
authorization_error_requests_extra_tags Extra tags for Storage authorization errors monitor list [] no
authorization_error_requests_message Custom message for Storage authorization errors monitor string "" no
authorization_error_requests_silenced Groups to mute for Storage authorization errors monitor map {} no
authorization_error_requests_threshold_critical Maximum acceptable percent of authorization error requests for a storage string "90" no
authorization_error_requests_threshold_warning Warning regarding acceptable percent of authorization error requests for a storage string "50" no
authorization_error_requests_time_aggregator Monitor aggregator for Storage authorization errors [available values: min, max or avg] string "min" no
authorization_error_requests_timeframe Monitor timeframe for Storage authorization errors [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
availability_enabled Flag to enable Storage availability monitor string "true" no
availability_extra_tags Extra tags for Storage availability monitor list [] no
availability_message Custom message for Storage availability monitor string "" no
availability_silenced Groups to mute for Storage availability monitor map {} no
availability_threshold_critical Minimum acceptable percent of availability for a storage string "50" no
availability_threshold_warning Warning regarding acceptable percent of availability for a storage string "90" no
availability_time_aggregator Monitor aggregator for Storage availability [available values: min, max or avg] string "max" no
availability_timeframe Monitor timeframe for Storage availability [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
client_other_error_requests_enabled Flag to enable Storage other errors monitor string "true" no
client_other_error_requests_extra_tags Extra tags for Storage other errors monitor list [] no
client_other_error_requests_message Custom message for Storage other errors monitor string "" no
client_other_error_requests_silenced Groups to mute for Storage other errors monitor map {} no
client_other_error_requests_threshold_critical Maximum acceptable percent of client other error requests for a storage string "90" no
client_other_error_requests_threshold_warning Warning regarding acceptable percent of client other error requests for a storage string "50" no
client_other_error_requests_time_aggregator Monitor aggregator for Storage other errors [available values: min, max or avg] string "min" no
client_other_error_requests_timeframe Monitor timeframe for Storage other errors [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
environment Architecture environment string n/a yes
evaluation_delay Delay in seconds for the metric evaluation string "900" no
filter_tags_custom Tags used for custom filtering when filter_tags_use_defaults is false string "*" no
filter_tags_custom_excluded Tags excluded for custom filtering when filter_tags_use_defaults is false string "" no
filter_tags_use_defaults Use default filter tags convention string "true" no
latency_enabled Flag to enable Storage latency monitor string "true" no
latency_extra_tags Extra tags for Storage latency monitor list [] no
latency_message Custom message for Storage latency monitor string "" no
latency_silenced Groups to mute for Storage latency monitor map {} no
latency_threshold_critical Maximum acceptable end to end latency (ms) for a storage string "2000" no
latency_threshold_warning Warning regarding acceptable end to end latency (ms) for a storage string "1000" no
latency_time_aggregator Monitor aggregator for Storage latency [available values: min, max or avg] string "min" no
latency_timeframe Monitor timeframe for Storage latency [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
message Message sent when a Redis monitor is triggered string n/a yes
network_error_requests_enabled Flag to enable Storage network errors monitor string "true" no
network_error_requests_extra_tags Extra tags for Storage network errors monitor list [] no
network_error_requests_message Custom message for Storage network errors monitor string "" no
network_error_requests_silenced Groups to mute for Storage network errors monitor map {} no
network_error_requests_threshold_critical Maximum acceptable percent of network error requests for a storage string "90" no
network_error_requests_threshold_warning Warning regarding acceptable percent of network error requests for a storage string "50" no
network_error_requests_time_aggregator Monitor aggregator for Storage network errors [available values: min, max or avg] string "min" no
network_error_requests_timeframe Monitor timeframe for Storage network errors [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
new_host_delay Delay in seconds before monitor new resource string "300" no
prefix_slug Prefix string to prepend between brackets on every monitors names string "" no
server_other_error_requests_enabled Flag to enable Storage server other errors monitor string "true" no
server_other_error_requests_extra_tags Extra tags for Storage server other errors monitor list [] no
server_other_error_requests_message Custom message for Storage server other errors monitor string "" no
server_other_error_requests_silenced Groups to mute for Storage server other errors monitor map {} no
server_other_error_requests_threshold_critical Maximum acceptable percent of server other error requests for a storage string "90" no
server_other_error_requests_threshold_warning Warning regarding acceptable percent of server other error requests for a storage string "50" no
server_other_error_requests_time_aggregator Monitor aggregator for Storage other errors [available values: min, max or avg] string "min" no
server_other_error_requests_timeframe Monitor timeframe for Storage server other errors [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
status_enabled Flag to enable App Services status monitor string "true" no
status_extra_tags Extra tags for App Services status monitor list [] no
status_message Custom message for storage Services status monitor string "" no
status_silenced Groups to mute for App Services status monitor map {} no
status_time_aggregator Monitor aggregator for Storage Services status [available values: min, max or avg] string "max" no
status_timeframe Monitor timeframe for Storage Services status [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
successful_requests_enabled Flag to enable Storage sucessful requests monitor string "true" no
successful_requests_extra_tags Extra tags for Storage sucessful requests monitor list [] no
successful_requests_message Custom message for Storage sucessful requests monitor string "" no
successful_requests_silenced Groups to mute for Storage sucessful requests monitor map {} no
successful_requests_time_aggregator Monitor aggregator for Storage sucessful requests [available values: min, max or avg] string "max" no
successful_requests_timeframe Monitor timeframe for Storage sucessful requests [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
successful_storage_requests_threshold_critical Minimum acceptable percent of successful requests for a storage string "90" no
successful_storage_requests_threshold_warning Warning regarding acceptable percent of successful requests for a storage string "70" no
throttling_error_requests_enabled Flag to enable Storage throttling error monitor string "true" no
throttling_error_requests_extra_tags Extra tags for Storage throttling error monitor list [] no
throttling_error_requests_message Custom message for Storage throttling error monitor string "" no
throttling_error_requests_silenced Groups to mute for Storage throttling error monitor map {} no
throttling_error_requests_threshold_critical Maximum acceptable percent of throttling error requests for a storage string "90" no
throttling_error_requests_threshold_warning Warning regarding acceptable percent of throttling error requests for a storage string "50" no
throttling_error_requests_time_aggregator Monitor aggregator for Storage throttling errors [available values: min, max or avg] string "min" no
throttling_error_requests_timeframe Monitor timeframe for Storage throttling errors [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no
timeout_error_requests_enabled Flag to enable Storage timeout monitor string "true" no
timeout_error_requests_extra_tags Extra tags for Storage timeout monitor list [] no
timeout_error_requests_message Custom message for Storage timeout monitor string "" no
timeout_error_requests_silenced Groups to mute for Storage timeout monitor map {} no
timeout_error_requests_threshold_critical Maximum acceptable percent of timeout error requests for a storage string "90" no
timeout_error_requests_threshold_warning Warning regarding acceptable percent of timeout error requests for a storage string "50" no
timeout_error_requests_time_aggregator Monitor aggregator for Storage timeout [available values: min, max or avg] string "min" no
timeout_error_requests_timeframe Monitor timeframe for Storage timeout [available values: last_#m (1, 5, 10, 15, or 30), last_#h (1, 2, or 4), or last_1d] string "last_5m" no

Outputs

Name Description
blob_authorization_error_requests_id id for monitor blob_authorization_error_requests
blob_client_other_error_requests_id id for monitor blob_client_other_error_requests
blob_network_error_requests_id id for monitor blob_network_error_requests
blob_server_other_error_requests_id id for monitor blob_server_other_error_requests
blob_throttling_error_requests_id id for monitor blob_throttling_error_requests
blob_timeout_error_requests_id id for monitor blob_timeout_error_requests
blobservices_availability_id id for monitor blobservices_availability
blobservices_latency_id id for monitor blobservices_latency
blobservices_requests_error_id id for monitor blobservices_requests_error
file_authorization_error_requests_id id for monitor file_authorization_error_requests
file_client_other_error_requests_id id for monitor file_client_other_error_requests
file_network_error_requests_id id for monitor file_network_error_requests
file_server_other_error_requests_id id for monitor file_server_other_error_requests
file_throttling_error_requests_id id for monitor file_throttling_error_requests
file_timeout_error_requests_id id for monitor file_timeout_error_requests
fileservices_availability_id id for monitor fileservices_availability
fileservices_latency_id id for monitor fileservices_latency
fileservices_requests_error_id id for monitor fileservices_requests_error
queue_authorization_error_requests_id id for monitor queue_authorization_error_requests
queue_client_other_error_requests_id id for monitor queue_client_other_error_requests
queue_network_error_requests_id id for monitor queue_network_error_requests
queue_server_other_error_requests_id id for monitor queue_server_other_error_requests
queue_throttling_error_requests_id id for monitor queue_throttling_error_requests
queue_timeout_error_requests_id id for monitor queue_timeout_error_requests
queueservices_availability_id id for monitor queueservices_availability
queueservices_latency_id id for monitor queueservices_latency
queueservices_requests_error_id id for monitor queueservices_requests_error
storage_status_id id for monitor storage_status
table_authorization_error_requests_id id for monitor table_authorization_error_requests
table_availability_id id for monitor table_availability
table_client_other_error_requests_id id for monitor table_client_other_error_requests
table_network_error_requests_id id for monitor table_network_error_requests
table_server_other_error_requests_id id for monitor table_server_other_error_requests
table_throttling_error_requests_id id for monitor table_throttling_error_requests
table_timeout_error_requests_id id for monitor table_timeout_error_requests
tableservices_latency_id id for monitor tableservices_latency
tableservices_requests_error_id id for monitor tableservices_requests_error

DataDog documentation: https://docs.datadoghq.com/integrations/azure_storage/

DataDog blog: https://www.datadoghq.com/blog/monitor-azure-storage-datadog/

Azure Storage metrics documentation: https://docs.microsoft.com/en-us/azure/storage/common/storage-monitor-storage-account

Azure Storage metrics detailed documentation https://docs.microsoft.com/en-us/rest/api/storageservices/storage-analytics-metrics-table-schema