Alerting   644

« earlier    

PagerDuty Incident Response Documentation
monitoring  alerting 
8 minutes ago by summerwind
messagebird/sachet: SMS alerts for Prometheus' Alertmanager
An HTTP API that accepts Alertmanager webhook calls and allows an end-user to configure it for the SMS provider of their dreams.
prometheus  alerting  sms 
6 days ago by some_hren
Time Series Anomaly Detection Algorithms – Stats and Bots
First, you can use supervised learning to teach trees to classify anomaly and non-anomaly data points. In order to do that you’d need to have labeled anomaly data points.
ml  algorithms  alerting  anomaly  stats  time  series 
18 days ago by dano
Telemetry: add prometheus endpoint option · Issue #2937 · hashicorp/vault
You can use blackbox for that. So for example in the blackbox.yml you can have
vault_unseal: prober: http timeout: 5s http: valid_status_codes: [200,429] method: GET no_follow_redirects: true fail_if_ssl: false fail_if_not_ssl: false fail_if_matches_regexp: - 'sealed":true'

The valid status codes are 200 and 429, because the standby node replies with a 429 (which is expected) and the active node with a 200

The rule in alertmanager to trigger the alerts:
- alert: Vault_node_sealed expr: probe_success{job="vault_sealed"} != 1 for: 1m labels: severity: xxx annotations:xxx

You can also use statsd-exporter to gather more specific stats and better alerts with expressions like:
expr: sum(increase(vault_core_leadership_lost_count{job="example"}[1h])) > 5

Hope it helps.
vault  prometheus  alerting  snippets 
5 weeks ago by bbrown
Alert on single (and special) log entries in elastic search)
elasticsearch  monitoring  alerting 
8 weeks ago by ahus1
Yelp/elastalert: Easy & Flexible Alerting With ElasticSearch
Easy & Flexible Alerting With ElasticSearch. Contribute to Yelp/elastalert development by creating an account on GitHub.
github  python  alerting  elasticsearch 
9 weeks ago by snahor
Dashbird - Full Serverless Visibility & Troubleshooting
Get instant overview of your whole serverless stack and save money by optimising your lambda functions. Health metrics on a powerful dashboard, error alerts through Slack and emails, tracing with AWS X-ray, API Gateway support, live tailing and much more. Sign up for free!
alerting  aws  error  lambda  monitoring  serverless  tracking 
11 weeks ago by flipchen

« earlier    

related tags

airbnb  alert  alerting  alerts  algorithms  analysis  analytics  anaylitics  anomaly-detection  anomaly  api  apm  application-performance-monitoring  architecture  argos  audio  automation  aws  aws_security  backend  base  canaries  change  charts  checks  cloud  cloudwatch  config  cool  dashboarding  data  database  datadog  dataseries  dev-ops  development  devops  dfir  downtime  eks  elasticsearch  elk  engineering  error  es  event  fail  fluentd  gcp  generators  github  go  golang  google  grafana  graph  graphics  graphite  graphs  ha  hawaii  home  homelab  hostedgraphite  hosting  http  incidentresponse  kariosdb  kibana  kinesis  kubernetes  lambda  linkedin  log  logging  metrics  microservices  ml  mobile  monitor  monitoring  nagios  network  nodejs  notification  notifications  open-source  opensource  operations  ops  opsgenie  pagerduty  paper  performance  platform  plugin  postgres  presentation  project  prometheus  push  python  rank:1  real-time  repo  reporting  rest  rum  saas  safety  search  security  series  server  serverless  services  sla  slack  slo  sms  snippets  software  splunk  sre  ssh  statistics  stats  suricata  sysadmin  sysdig  tech  testing  tests  thresholds  time-series  time  timeseries  tools  tracking  try  tutorial  uber  udp  ui  utilities  ux  vault  web-service  webhook  zabbix 

Copy this bookmark: