|
From: | Avi Vigder |
Subject: | RE: problem with dead processes |
Date: | Mon, 20 Aug 2012 03:01:50 -0400 |
I do check with pid file, but this is not the issue. We did encounter what you describe, on server startup, and have an init processes to clean the pid files for that. The situation here is that the pid file contains a pid number that does not exist when checking with ‘ps –ef’ but the getpgid returns a positive number on that pid. From: address@hidden [mailto:address@hidden On Behalf Of Martin Pala Hello, what process check type do you use? The pidfile based or pattern based? If pidfile, monit depends on the pidfile content => it is possible that the pidfile contains valid PID, which was assigned to different process after the original one died => monit thinks that the process is running (which in fact is true, as monit is set to watch the given PID and has no other information about how the process should look like). The solution could be to use the pattern based process check. Regards, Martin On Aug 19, 2012, at 5:08 PM, Avi Vigder <address@hidden> wrote: We run monit 5.4 and monitor large number of processes (1K-2K). When I kill all the processes. Most are restarted by monit. but some are marked as running although the processes is not alive. I’ve notices that monit uses getpgid() to determine if the processes is alive, and it seems that the system call returns a positive value although the pid does not exist (checking with ps –ef). Any ideas? -- |
[Prev in Thread] | Current Thread | [Next in Thread] |