monit-general
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: monit (latest CVS checkout) crashes


From: Martin Pala
Subject: Re: monit (latest CVS checkout) crashes
Date: Thu, 03 Feb 2005 21:52:39 +0100
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.5) Gecko/20050105 Debian/1.7.5-1

Thanks :) It seems however that the monit binary is stripped (thus it is not possible to resolve the symbols) - you can check it with 'file' command, for example:

unicorn:~/cvs/monit# file monit
monit: ELF 32-bit LSB executable, Intel 80386, version 1 (SYSV), for GNU/Linux 2.2.0, dynamically linked (uses shared libs), not stripped

There should be "not stripped" string on the end of the output.

Please can you compile monit again and keep the debug symbols? You can use this method in monit source directory (provided that you have cvs sources):

  ./autogen.sh
  make

Now there will be binary with debug symbols. Then please try to run monit again and send the core backtrace as in last mail.

Thanks,
Martin


Nowak, Bernhard wrote:
Hi,

here is the backtrace. I hope this helps.

Regards,
Bernhard


************************************
gdb /usr/local/bin/monit /core.3852
GNU gdb Red Hat Linux (6.1post-1.20040607.43rh)
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "i386-redhat-linux-gnu"...(no debugging symbols found)...Using 
host libthread_db library "/lib/tls/libthread_db.so.1".

Core was generated by `/usr/local/bin/monit -c /etc/monitrc'.
Program terminated with signal 6, Aborted.
Reading symbols from /lib/tls/libpthread.so.0...(no debugging symbols 
found)...done.
Loaded symbols for /lib/tls/libpthread.so.0
Reading symbols from /lib/libcrypt.so.1...(no debugging symbols found)...done.
Loaded symbols for /lib/libcrypt.so.1
Reading symbols from /lib/libresolv.so.2...(no debugging symbols found)...done.
Loaded symbols for /lib/libresolv.so.2
Reading symbols from /lib/libnsl.so.1...(no debugging symbols found)...done.
Loaded symbols for /lib/libnsl.so.1
Reading symbols from /lib/libssl.so.4...(no debugging symbols found)...done.
Loaded symbols for /lib/libssl.so.4
Reading symbols from /lib/libcrypto.so.4...(no debugging symbols found)...done.
Loaded symbols for /lib/libcrypto.so.4
Reading symbols from /lib/tls/libc.so.6...(no debugging symbols found)...done.
Loaded symbols for /lib/tls/libc.so.6
Reading symbols from /lib/ld-linux.so.2...(no debugging symbols found)...done.
Loaded symbols for /lib/ld-linux.so.2
Reading symbols from /usr/lib/libgssapi_krb5.so.2...(no debugging symbols 
found)...done.
Loaded symbols for /usr/lib/libgssapi_krb5.so.2
Reading symbols from /usr/lib/libkrb5.so.3...(no debugging symbols 
found)...done.
Loaded symbols for /usr/lib/libkrb5.so.3
Reading symbols from /lib/libcom_err.so.2...(no debugging symbols found)...done.
Loaded symbols for /lib/libcom_err.so.2
Reading symbols from /usr/lib/libk5crypto.so.3...(no debugging symbols 
found)...done.
Loaded symbols for /usr/lib/libk5crypto.so.3
Reading symbols from /lib/libdl.so.2...(no debugging symbols found)...done.
Loaded symbols for /lib/libdl.so.2
Reading symbols from /usr/lib/libz.so.1...(no debugging symbols found)...done.
Loaded symbols for /usr/lib/libz.so.1
Reading symbols from /lib/libnss_files.so.2...(no debugging symbols 
found)...done.
Loaded symbols for /lib/libnss_files.so.2
#0  0x0019e7a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2
(gdb) bt
#0  0x0019e7a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2
#1  0x001de955 in raise () from /lib/tls/libc.so.6
#2  0x001e0319 in abort () from /lib/tls/libc.so.6
#3  0x00211f9a in __libc_message () from /lib/tls/libc.so.6
#4  0x00218528 in _int_free () from /lib/tls/libc.so.6
#5  0x00218afa in free () from /lib/tls/libc.so.6
#6  0x080616e2 in signal ()
#7  0x08058ca8 in signal ()
#8  0x08058ef1 in signal ()
#9  0x0805892b in signal ()
#10 0x080501fe in ?? ()
#11 0x00000000 in ?? ()
(gdb)







-----Original Message-----
From: address@hidden
[mailto:address@hidden
]On Behalf Of Martin Pala
Sent: Tuesday, February 01, 2005 8:51 PM
To: This is the general mailing list for monit
Subject: Re: monit (latest CVS checkout) crashes


It could help, when you have core of crashed process (note that you can
use 'ulimit -c unlimited' command before actual test attempt to unset
possible core size limit). When you have core, you can use following
sequence to trace the stack:

1.) run gdb with monit binary and core as arguments:
   gdb /path/to/monit /path/to/core

2.) show backtrace - on gdb prompt:
(gdb) bt

... now you will see what preceeded the crash ... this output may help
us to fix the problem quickly.

(if you are running solaris, you can just use pstack command on core)

You can also can look on short debug howto in monit FAQ.txt. It
describes how to trace the system calls - these are useful informations
as well (solaris alternative is truss).

Thanks,
Martin


Nowak, Bernhard wrote:

Hi all,

I just compiled monit from CVS because I wanted to test the new URL and HTTP 
content monitoring feature.

I use monit to monitor two services (aaa and bbb) and restart them max two 
times. If that happens I want to switch over to standby system (by stopping 
heartbeat, monit was started before by heartbeat..I know, the manual tells me 
to do it the other way around...but I want it that way).
As long as the "Event ratio dependant action rules" are not yet supported I will do a 
workaround monitoring the monit service pages for the words "not monitored" (ccc_mont and 
ddd_monit).


Unfortunately monit crashes.

/etc/init.d/monit status gives me a "monit dead but pid file exists".
Last line of monit.log is: monit HTTP server started"

I also tried to use monit -c /etc/monit.conf -v. This one started monit (with 
all the debug messages), but also crashed after some seconds.

I attached my config as well.

I'm no developer, just user.

Thanks,

Bernhard

***************************************************

set daemon 30 set logfile /var/log/monit.log
set mailserver localhost
set mail-format { from: address@hidden
set alert address@hidden
set httpd port 2812 and allow localhost allow 192.168.9.41
    allow 192.168.10.171

#------services to 
watch-----------------------------------------------------------
 check host aaa with address 127.0.0.1
    start program = "/etc/init.d/ABC start"
    stop program  = "/etc/init.d/ABC stop"
    if failed port 20001 with timeout 60 seconds then restart
    if 2 restarts within 20 cycles then timeout


 check host bbb with address 192.168.10.173
    start program = "/etc/init.d/DEF start"
    stop program  = "/etc/init.d/DEF stop"
    if failed host 192.168.10.173 port 8080 protocol http with timeout 60 
seconds then restart
    if 2 restarts within 20 cycles then timeout

#-----use mont http content feature to check for "not monitored"  -------

 check host ccc_monit with address 192.168.10.173
 if failed url http://192.168.10.173:2812/aaa
    and content != "not monitored" with timeout 60 seconds
    then exec "/etc/init.d/heartbeat stop"


 check host ddd_monit with address 192.168.10.173
 if failed url http://192.168.10.173:2812/bbb
    and content != "not monitored" with timeout 60 seconds
    then exec "/etc/init.d/heartbeat stop"

***************************************************************************************************
The information in this email is confidential and may be legally privileged.  
Access to this email by anyone other than the intended addressee is 
unauthorized.  If you are not the intended recipient of this message, any 
review, disclosure, copying, distribution, retention, or any action taken or 
omitted to be taken in reliance on it is prohibited and may be unlawful.  If 
you are not the intended recipient, please reply to or forward a copy of this 
message to the sender and delete the message, any attachments, and any copies 
thereof from your system.
***************************************************************************************************


--
To unsubscribe:
http://lists.nongnu.org/mailman/listinfo/monit-general



--
To unsubscribe:
http://lists.nongnu.org/mailman/listinfo/monit-general

***************************************************************************************************
The information in this email is confidential and may be legally privileged.  
Access to this email by anyone other than the intended addressee is 
unauthorized.  If you are not the intended recipient of this message, any 
review, disclosure, copying, distribution, retention, or any action taken or 
omitted to be taken in reliance on it is prohibited and may be unlawful.  If 
you are not the intended recipient, please reply to or forward a copy of this 
message to the sender and delete the message, any attachments, and any copies 
thereof from your system.
***************************************************************************************************


--
To unsubscribe:
http://lists.nongnu.org/mailman/listinfo/monit-general




reply via email to

[Prev in Thread] Current Thread [Next in Thread]