I am trying to implement code that runs a check on multiple systems every minute indefinitely, and if that check comes true I have code that takes corrective action. The problem is that the corrective action takes longer than 1 minute and I don’t want to delay other checks and lock up the code clearing out the “failure.”

What would be the best and efficient way to continue the checks on all other systems except the one with the failure, until the failure is fixed, and then add the failed system back in when it’s online again?

Please let me know if something isn’t clear. I can’t give away too many technical details! Thanks!!!

