Quantcast
Channel: THWACK: Popular Discussions - Network Configuration Manager
Viewing all articles
Browse latest Browse all 4266

Methods to reduce the amount of email/alert spam?

$
0
0

I'm getting too many emails from our NCM instance and would like to reduce them somehow

 

The biggest spammer is from backup jobs that have a failure, and while I do want to know if there's failures I don't want to know about it every single time the job runs

I have one job that runs every hour for our core devices so I can be notified as quickly as possible if there's any unexpected changes. Or be able to roll back to a very recent config if a device is being worked on and goes pear shaped

However i'm getting emails often from false alerts of a device being down and unreachable (that is a separate performance issue being looked into) so i'm being bombarded with emails like...

 

  8/05/2019 3:10:19 PM : Started Core Backup - Download 'running' configs hourly

 

  Download Configs from Devices

  Where (Classification = 'Core')

 

 

  Devices: 127

  Errors: 2

 

I do still want to get emails as soon as a change is detected, but I don't want to be getting these emails every hour.

1) Is it possible to have these sorts of emails summarized and sent daily so I can see i.e. 'this is a summary of the devices that were down in the past 24 hours / 24 times the job has run'

2) Or can I simply disable this email entirely, yet still have the change summary emailed whenever a change is detected?

 

 

If I can do the latter and stop those failure emails entirely. Is there another job/schedule I can setup to alert me about devices that have failed to backup for an extended period of time? We do a daily backup all of our managed customer equipment as well, but quite frankly I don't care if that backup job fails for any of those routers once in a while as they very rarely change. And sometimes connections are congested, there's a power outage etc, it's not a priority and it's not important

 

However I do need to eventually get some sort of alert saying "hey router XYZ has failed to backup for 7 days consecutively, now you should look at this and find out whats going wrong" because maybe a tech has configured most of the router fine, it's working the client is online and happy and therefore has no need to contact us. But the installer put the wrong SSH username/password, forgot to add the right ACL etc. I don't want to find out 10 months down the track their router has failed and we don't have any backups of it


Viewing all articles
Browse latest Browse all 4266

Latest Images

Trending Articles



Latest Images

<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>