Skip to content

Commit 874c24b

Browse files
authored
Merge pull request #1911 from stackhpc/alert-on-radosgw-usage-exporter
Add alerting for RadosGW Usage Exporter metrics
2 parents e46c18c + 82323bb commit 874c24b

File tree

2 files changed

+17
-0
lines changed

2 files changed

+17
-0
lines changed

etc/kayobe/kolla/config/prometheus/ceph.rules

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -205,3 +205,14 @@ groups:
205205
description: "{{ $value }} OSD requests are taking too long to process (osd_op_complaint_time exceeded)"
206206

207207
{% endraw %}
208+
{% if stackhpc_enable_radosgw_usage_exporter | bool %}
209+
- name: rgws
210+
rules:
211+
- alert: RadosGWUsageExporterNotServingMetrics
212+
expr: absent(radosgw_user_metadata)
213+
for: 5m
214+
labels:
215+
severity: critical
216+
annotations:
217+
description: "The RadosGW Usage Exporter has not been serving RGW metrics for more than 5 minutes. The exporter may be unable to reach the RGWs."
218+
{% endif %}
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
---
2+
features:
3+
- |
4+
Added a new alert ``RadosGWUsageExporterNotServingMetrics``, which will
5+
fire when the RadosGW Usage Exporter fails to serve RGW metrics for more
6+
than 5 minutes.

0 commit comments

Comments
 (0)