freeipmi-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Freeipmi-devel] Re: Software Deliverables


From: robing
Subject: Re: [Freeipmi-devel] Re: Software Deliverables
Date: Thu, 18 Mar 2004 06:10:22 -0800 (PST)

> ,----[ address@hidden ]
> | 2) I'd like the sensors command to exit with a non-zero return code if
> | any of the sensors are in "ALERT" status.  Ideally there should be
> | some non-zero return code for an actual failure to execute the command
> | properly and a different return code for successful execution but one
> | or more sensors with an ALERT status.
> `----
> I initially thought about it. My concern is, few sensors always report
> ALERT. I can add "ignored sensors list" support and let the user
> eliminate unwanted sensors. 
> 
> Like,
> ;; sensors-conf.scm config file
> ; Ignore the following sensors completely
> (sensors-set-ignore-list!  52 100 101 102 103 104 105 106 107)
> 
> OR
> 
> ; Ignore exit status for the following sensors
> (sensors-set-ignore-exit-list!  52 100 101 102 103 104 105 106 107)

An ignore list sounds useful.  I noticed that when I was querying particular
groups like Temperature, there were some I knew I'd never care about
(e.g. HSC SCSI BP Temp) so it would be nice to just eliminate these up front.

> 
> One good example is PCI slots:
> If they are not populated OR if device has failed, sensors reports as ALERT.
> 
> SAMPLE OUTPUT ON TIGER4 (SINGLE CPU):
> -------------------------------------
> debian-ia64:~/ab# sensors | grep ALERT
> 12: Proc Bd +1.2V (Voltage): 1.22 V (low=1.18/nom=1.20/high=1.21) [ALERT]
> 52: HSC SCSI BP Temp (Temperature): 0.00 C (low=12.00/nom=28.00/high=38.00) 
> [ALERT]
> 69: Pwr Supply 2 (Power Supply): [ALERT]
> 93: Proc 2 Status (Processor): [ALERT]
> 94: Proc 3 Status (Processor): [ALERT]
> 95: Proc 4 Status (Processor): [ALERT]
> 100: PCI HP Slot 1 (Slot Connector): [ALERT]
> 101: PCI HP Slot 2 (Slot Connector): [ALERT]
> 102: PCI HP Slot 3 (Slot Connector): [ALERT]
> 103: PCI HP Slot 4 (Slot Connector): [ALERT]
> 104: PCI HP Slot 5 (Slot Connector): [ALERT]
> 105: PCI HP Slot 6 (Slot Connector): [ALERT]
> 106: PCI HP Slot 7 (Slot Connector): [ALERT]
> 107: PCI HP Slot 8 (Slot Connector): [ALERT]
> debian-ia64:~/ab# 
> 

Actually this makes me wonder if there is a better solution namely 
can't you differentiate between sensors that are truly out of range
(ALERT) versus the ones that appear out of range due to a component
not being present?  If you can tell the difference, it would be
nice if you could give these a different tag like maybe "N/A" 
instead of ALERT.  

-Robin




reply via email to

[Prev in Thread] Current Thread [Next in Thread]