[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Hosts on M/Monit keep going offline and online at regular intervals

From: Abdul Munim Kazia
Subject: Re: Hosts on M/Monit keep going offline and online at regular intervals
Date: Thu, 23 Jan 2014 16:04:07 +0530

I just checked a few of my servers which have had this issue. 
We are using Monit 5.5 and m/monit 2.4. 
The monit poll time is 15 seconds. 
I have changed the Acceptable Report Skew from 3 to 10 for these servers. I will monitor it for a day to see if the issue persists for these servers. I will probably send the logs across if this doesn't fix it. 

Also, I ran monit -v on one of the hosts, and I saw the following:
 M/Monit(s)         = with timeout 5 seconds using credentials
Where is this 5 second timeout set? Will increasing this timeout fix the issue?

Abdul Munim Kazia

On 23 January 2014 15:53, Martin Pala <address@hidden> wrote:
I think this isn't scalability issue - we test with far higher load then 100 hosts (although in the long term we may add load balancing support).


On 23 Jan 2014, at 11:04, Roose, Marco <address@hidden> wrote:


I think this is a really interesting topic. One should think about the implementation of load balancing (and with that High Availability) into m/monit. Is there anything focusing into that in the development queue?


Best regards,

Marco Roose


From: address@hidden [mailto:address@hidden] On Behalf Of Abdul Munim Kazia
Sent: Thursday, January 23, 2014 10:38 AM
To: address@hidden
Subject: Hosts on M/Monit keep going offline and online at regular intervals




I have been using monit and m/monit at my organization for the past year or so. What started out as a simple setup with a handful of servers has now become a full monitoring dashboard with more than 100 hosts listed on m/monit. 


While monit itself works perfectly, at every regular interval of a few minutes, some of the hosts get reported as "no report from monit" in the m/monit console. The report comes in a few seconds and the host becomes green again. This happens fairly regularly, and it happens to a different set of hosts every time. All the hosts and the m/monit host itself are hosted in the same data center, so I don't think that network latency is an issue. 


This could be occurring because m/monit doesn't receive the data for all hosts in time, and it wouldn't be an issue for me, if it didn't crowd the events list with 20-30 events every half an hour. 


I have looked at the server.conf file, but I don't think any of those configuration settings will help me out, if I am not wrong. Has anyone faced this issue before? Is there any way to fix this, either from monit's or m/monit's end? Can I increase this timeout?


Thanks for you help


Abdul Munim Kazia

To unsubscribe:

To unsubscribe:

reply via email to

[Prev in Thread] Current Thread [Next in Thread]