Hello,
We recently have put in a policy to reset all our DRACs each morning. We have done this because we found that sometimes errors would occur on servers that we didn't know about until someone physically noticed whilst walking around a datacentre. The errors would come through after a drac reset.
So, we thought, great idea, reset all the dracs each morning and we will pick up all the problems.
But, unfortunately, we are now getting a lot of false +ves each day. Things like: The LOM riser FRU for slot 9 FRU ID 10 is not functioning and other errors which make little to no sense to us. These errors seem to self-resolve if another drac reset is done, or sometimes, as is the case with PSUs failing, they come back after about 3 mins from a drac reset.
My question is: Is there a way, from the DRAC, to delay sending email alerts for a period (say 3 mins) and only send alerts if the condition is unresolved?
If not from the drac, then can this email delay be done from OME?
Thx,
John Bradshaw