monit-general
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Avoid alert on log rotation


From: Paul Theodoropoulos
Subject: Re: Avoid alert on log rotation
Date: Thu, 5 Nov 2015 10:34:12 -0800
User-agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:42.0) Gecko/20100101 Thunderbird/42.0

Another option might be to use the 'cron' format to tell it not to monitor during just that window. We do this during the final part of our db backup process when it requires a lock for a minute or so to finalize.

check program db-status with path /usr/local/bin/check_status with timeout 15 seconds
not every "50-55 0 * * *"
if status != 0 then alert


On 11/5/2015 8:41 AM, Russell Simpkins wrote:
Have you considered adjusting how often monit checks and set your cron job for log rotate to run in between? E.g. run monit every 5 seconds and have your log rotate happen on second 2. The PID would change, but shouldn't throw an alert.

On Thu, Nov 5, 2015 at 11:35 AM, Guillaume François <address@hidden> wrote:
Hello,

So we should also concluded there is currently no way to delayed/avoid the first alerts and that the only possibility is to "unmonitor" which can cause issues and is against goal of Monit ?

Explanation: if we "unmonitor" the check (sometime process, sometime program) and an issue prevent the script in charge of "monitoring" again to successfully finished, the service will be down and we won't get alerted as it is "unmonitored".

In fact "Log Rotation" was only one simple possible restart cause, but we have lots of custom scripts also for our custom tools like "restart if some conditions are met" with help of monit process exit code.

Maybe a workaround would be to cut the action script in several part so that even if the restart is failing, the monitoring is still issued.

Another workaround would be to check the status of the "monit check" and if this status is "unmonitor" for several cycles, force "monitoring" again in order to be notified that something bad happens.

What is your point of view about these ?

Best Regards.


2015-11-05 12:31 GMT+01:00 Martin Pala <address@hidden>:
Hello,

monit tests if the process is running - it cannot differentiate that the process was stopped intentionally by 3rd party action ... if you shutdown the process intentionally during logrotate, you should disable its monitoring first (part of logratotation "prerotate" script):

        monit unmonitor myprocess

and when logratotation finished, enable it again (part of logratotation "postrotate" script):

        monit monitor myprocess



Regards,
Martin



> On 05 Nov 2015, at 11:42, Guillaume François <address@hidden> wrote:
>
> Hello,
>
> Everyday we got an "non-exist" then "exist" alert when the process is shutdown then started for log rotation purpose.
>
> We would like to NOT receive the email linked to this kind of behavior but still get the email when the process is really down for some time.
>
> We would like something like
>
> check process myprocess with pidfile /var/run/myprocesspid.pid
>         start program = "/usr/sbin/service myprocess start"
>         stop program  = "/usr/sbin/service myprocess stop"
>         if failed port 5555 protocol tcp for 3 cycles then restart
>         if 5 restarts within 5 cycles then timeout
> wanted ->        alert on {exist, non-exist} only if 5 event in 5 cycles.
>
> can be interpreted as -> only send alert for event 'exist' and event 'non-exist' if alert is triggerred 5 time in the last 5 cycles
>
> We saw we can change the 'alert/noalert' rule for some 'email/event' but we cannot get a "conditional" rules for theses.
>
> Best Regards,
>
>
> --
> -----BEGIN GEEK CODE BLOCK-----
> Version: 3.1
> GCS/IT d(+) s++:- a C++$ ULC(+)>+++$ !P--- L+>$ !E---? W+++$ !N* !o-- K--? w(+) !O---? !M- !V--? PS+? !PE Y+ PGP++>+++ !t-- !5 !X- R(+)>++* tv-? b(-) DI !D- G(+)>+ e+++ h--() r->$ y?*
> ------END GEEK CODE BLOCK------
> --
> To unsubscribe:
> https://lists.nongnu.org/mailman/listinfo/monit-general


--
To unsubscribe:
https://lists.nongnu.org/mailman/listinfo/monit-general



--
-----BEGIN GEEK CODE BLOCK-----
Version: 3.1
GCS/IT d(+) s++:- a C++$ ULC(+)>+++$ !P--- L+>$ !E---? W+++$ !N* !o-- K--? w(+) !O---? !M- !V--? PS+? !PE Y+ PGP++>+++ !t-- !5 !X- R(+)>++* tv-? b(-) DI !D- G(+)>+ e+++ h--() r->$ y?*
------END GEEK CODE BLOCK------

--
To unsubscribe:
https://lists.nongnu.org/mailman/listinfo/monit-general



--
To unsubscribe:
https://lists.nongnu.org/mailman/listinfo/monit-general

-- 
Paul Theodoropoulos
www.anastrophe.com


reply via email to

[Prev in Thread] Current Thread [Next in Thread]