From MAILER-DAEMON Mon Nov 01 14:24:39 2010 Received: from mailman by lists.gnu.org with archive (Exim 4.43) id 1PCz3f-0004d0-27 for mharc-freeipmi-users@gnu.org; Mon, 01 Nov 2010 14:24:39 -0400 Received: from [140.186.70.92] (port=43488 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PCz3c-0004Xl-5f for freeipmi-users@gnu.org; Mon, 01 Nov 2010 14:24:37 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PCz3a-0002e1-Na for freeipmi-users@gnu.org; Mon, 01 Nov 2010 14:24:36 -0400 Received: from nspiron-2.llnl.gov ([128.115.41.82]:22712) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PCz3a-0002dR-HT; Mon, 01 Nov 2010 14:24:34 -0400 X-Attachments: None Received: from crazyclimber.llnl.gov (HELO [134.9.93.113]) ([134.9.93.113]) by nspiron-2.llnl.gov with ESMTP; 01 Nov 2010 11:24:31 -0700 From: Al Chu To: freeipmi-users@gnu.org, freeipmi-devel@gnu.org Content-Type: text/plain Date: Mon, 01 Nov 2010 11:24:31 -0700 Message-Id: <1288635871.5443.30.camel@crazyclimber.llnl.gov> Mime-Version: 1.0 X-Mailer: Evolution 2.12.3 (2.12.3-19.el5) Content-Transfer-Encoding: 7bit X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. Cc: Subject: [Freeipmi-users] FreeIPMI 0.8.11 Released X-BeenThere: freeipmi-users@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GNU FreeIPMI User's Mailing List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 01 Nov 2010 18:24:37 -0000 Thanks to Holger Liebig, FreeIPMI now has support for all Fujitsu iRMC S1 and S2 systems. http://download.gluster.com/pub/freeipmi/0.8.11/ FreeIPMI 0.8.11 - 10/28/10 -------------------------- o Support Fujitsu iRMC S1 and iRMC S2 OEM extensions in ipmi-sensors and ipmi-sel. o Support --output-oem-event-string in ipmi-sel. o Other minor documentation/bug fixes. Al -- Albert Chu chu11@llnl.gov Computer Scientist High Performance Systems Division Lawrence Livermore National Laboratory From MAILER-DAEMON Tue Nov 09 12:41:08 2010 Received: from mailman by lists.gnu.org with archive (Exim 4.43) id 1PFsBw-0000q3-9g for mharc-freeipmi-users@gnu.org; Tue, 09 Nov 2010 12:41:08 -0500 Received: from [140.186.70.92] (port=33057 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PFpgj-0001OW-7C for freeipmi-users@gnu.org; Tue, 09 Nov 2010 10:00:48 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PFpgg-0002Ni-Pl for freeipmi-users@gnu.org; Tue, 09 Nov 2010 10:00:45 -0500 Received: from snark.apl.washington.edu ([128.208.78.3]:42937) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PFpgg-0002Fr-CH for freeipmi-users@gnu.org; Tue, 09 Nov 2010 10:00:42 -0500 Received: from snark.apl.washington.edu (localhost [127.0.0.1]) by localhost (Postfix) with ESMTP id 1BA743A018C; Tue, 9 Nov 2010 07:00:17 -0800 (PST) Received: from smtp.apl.washington.edu (smtp.apl.washington.edu [128.208.78.8]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by snark.apl.washington.edu (Postfix) with ESMTPS id 07FEE3A00D1 for ; Tue, 9 Nov 2010 07:00:16 -0800 (PST) Received: from NEWWORKSTATION (pool-96-255-71-169.washdc.fios.verizon.net [96.255.71.169]) (using TLSv1 with cipher RC4-MD5 (128/128 bits)) (No client certificate requested) by smtp.apl.washington.edu (Postfix) with ESMTP id 5C78BE39E6 for ; Tue, 9 Nov 2010 07:00:15 -0800 (PST) From: "James Aaron Nix-Gomez" To: Date: Tue, 9 Nov 2010 10:00:14 -0500 Message-ID: <026801cb801e$d137f270$73a7d750$@washington.edu> MIME-Version: 1.0 X-Mailer: Microsoft Office Outlook 12.0 Thread-Index: AcuAHtBOGZTmw30WTq6t5WuwtbdUmQ== Content-Language: en-us X-Antivirus: Scanned by F-Prot Antivirus (http://www.f-prot.com) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-Mailman-Approved-At: Tue, 09 Nov 2010 12:41:06 -0500 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error X-BeenThere: freeipmi-users@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GNU FreeIPMI User's Mailing List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 09 Nov 2010 15:00:48 -0000 Good morning, I am having problems using ipmi-sensors on a SuperMicro X8DTH-if. I consistently get "ipmi_sdr_cache_create: internal IPMI error" messages. I have attached the debug information below. We have other SuperMicro X8DTH-if machines, with slightly different RAID hardware, where ipmi-sensors works flawlessly. I would appreciate any advice on how to troubleshoot this issue. Best regards, Aaron Nix-Gomez Applied Physics Laboratory University of Washington aaron@apl.washington.edu [root@aaronacint2010 Aaron]# uname -a Linux aaronacint2010 2.6.32.12-115.fc12.x86_64 #1 SMP Fri Apr 30 19:46:25 UTC 2010 x86_64 x86_64 x86_64 GNU/Linux [root@aaronacint2010 Aaron]# ipmi-sensors --version ipmi-sensors - 0.7.12 Copyright (C) 2003-2008 FreeIPMI Core Team This program is free software; you may redistribute it under the terms of the GNU General Public License. This program has absolutely no warranty. [root@aaronacint2010 Aaron]# ipmi-sensors --debug Caching SDR repository information: /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost ===================================================== Get SDR Repository Info Request ===================================================== [ 20h] = cmd[ 8b] ===================================================== Get SDR Repository Info Response ===================================================== [ 20h] = cmd[ 8b] [ 0h] = comp_code[ 8b] [ 1h] = sdr_version_major[ 4b] [ 5h] = sdr_version_minor[ 4b] [ 0h] = record_count[ 8b] ipmi_sdr_cache_create: internal IPMI error [root@aaronacint2010 Aaron]# ipmi-sensors --debug Caching SDR repository information: /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost ===================================================== Get SDR Repository Info Request ===================================================== [ 20h] = cmd[ 8b] ===================================================== Get SDR Repository Info Response ===================================================== ipmi_sdr_cache_create: internal IPMI error [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate Caching SDR repository information: /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost ===================================================== Get SDR Repository Info Request ===================================================== [ 20h] = cmd[ 8b] ===================================================== Get SDR Repository Info Response ===================================================== ipmi_sdr_cache_create: internal IPMI error [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate Caching SDR repository information: /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost ===================================================== Get SDR Repository Info Request ===================================================== [ 20h] = cmd[ 8b] ===================================================== Get SDR Repository Info Response ===================================================== [ 20h] = cmd[ 8b] [ 0h] = comp_code[ 8b] [ 1h] = sdr_version_major[ 4b] [ 5h] = sdr_version_minor[ 4b] [ 1Ch] = record_count[16b] [ 80h] = free_space[ 8b] ipmi_sdr_cache_create: internal IPMI error [root@aaronacint2010 Aaron]# From MAILER-DAEMON Tue Nov 09 13:21:31 2010 Received: from mailman by lists.gnu.org with archive (Exim 4.43) id 1PFsp1-0005Ye-De for mharc-freeipmi-users@gnu.org; Tue, 09 Nov 2010 13:21:31 -0500 Received: from [140.186.70.92] (port=49146 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PFsoy-0005YU-Ue for freeipmi-users@gnu.org; Tue, 09 Nov 2010 13:21:30 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PFsox-0007PF-Gd for freeipmi-users@gnu.org; Tue, 09 Nov 2010 13:21:28 -0500 Received: from nspiron-3.llnl.gov ([128.115.41.83]:41697 helo=smtp.llnl.gov) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PFsox-0007Oi-44 for freeipmi-users@gnu.org; Tue, 09 Nov 2010 13:21:27 -0500 X-Attachments: None Received: from crazyclimber.llnl.gov (HELO [134.9.93.113]) ([134.9.93.113]) by smtp.llnl.gov with ESMTP; 09 Nov 2010 10:21:23 -0800 Subject: Re: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error From: Al Chu To: James Aaron Nix-Gomez In-Reply-To: <026801cb801e$d137f270$73a7d750$@washington.edu> References: <026801cb801e$d137f270$73a7d750$@washington.edu> Content-Type: text/plain Date: Tue, 09 Nov 2010 10:21:23 -0800 Message-Id: <1289326883.1979.21.camel@crazyclimber.llnl.gov> Mime-Version: 1.0 X-Mailer: Evolution 2.12.3 (2.12.3-19.el5) Content-Transfer-Encoding: 7bit X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. Cc: "freeipmi-users@gnu.org" X-BeenThere: freeipmi-users@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GNU FreeIPMI User's Mailing List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 09 Nov 2010 18:21:30 -0000 Hi James, Gut feeling is that something is wrong with the BMC on that motherboard. > [ 0h] = record_count[ 8b] > > ipmi_sdr_cache_create: internal IPMI error > [ 1Ch] = record_count[16b] > > [ 80h] = free_space[ 8b] > > ipmi_sdr_cache_create: internal IPMI error In the first payload above, you receive a 0 record_count, which is of course very bad. Then in the below you get a (seemingly) correct record_count and an extra field. The full payload of this packet is supposed to be something like this: The fact that on 3-4 different rusn in your debug output, the SDR Info command is returning inconsistent data, incorrect data, and multiple sizes of payloads is very bad. Do any other FreeIPMI commands work, like bmc-info, ipmi-sel, etc. Al On Tue, 2010-11-09 at 07:00 -0800, James Aaron Nix-Gomez wrote: > Good morning, > > > > I am having problems using ipmi-sensors on a SuperMicro X8DTH-if. I > consistently get "ipmi_sdr_cache_create: internal IPMI error" messages. I > have attached the debug information below. We have other SuperMicro > X8DTH-if machines, with slightly different RAID hardware, where ipmi-sensors > works flawlessly. I would appreciate any advice on how to troubleshoot this > issue. > > > > Best regards, > > > > Aaron Nix-Gomez > > Applied Physics Laboratory > > University of Washington > > aaron@apl.washington.edu > > > > > > [root@aaronacint2010 Aaron]# uname -a > > Linux aaronacint2010 2.6.32.12-115.fc12.x86_64 #1 SMP Fri Apr 30 19:46:25 > UTC 2010 x86_64 x86_64 x86_64 GNU/Linux > > [root@aaronacint2010 Aaron]# ipmi-sensors --version > > ipmi-sensors - 0.7.12 > > Copyright (C) 2003-2008 FreeIPMI Core Team > > This program is free software; you may redistribute it under the terms of > > the GNU General Public License. This program has absolutely no warranty. > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > Caching SDR repository information: > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > ===================================================== > > Get SDR Repository Info Request > > ===================================================== > > [ 20h] = cmd[ 8b] > > ===================================================== > > Get SDR Repository Info Response > > ===================================================== > > [ 20h] = cmd[ 8b] > > [ 0h] = comp_code[ 8b] > > [ 1h] = sdr_version_major[ 4b] > > [ 5h] = sdr_version_minor[ 4b] > > [ 0h] = record_count[ 8b] > > ipmi_sdr_cache_create: internal IPMI error > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > Caching SDR repository information: > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > ===================================================== > > Get SDR Repository Info Request > > ===================================================== > > [ 20h] = cmd[ 8b] > > ===================================================== > > Get SDR Repository Info Response > > ===================================================== > > ipmi_sdr_cache_create: internal IPMI error > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > Caching SDR repository information: > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > ===================================================== > > Get SDR Repository Info Request > > ===================================================== > > [ 20h] = cmd[ 8b] > > ===================================================== > > Get SDR Repository Info Response > > ===================================================== > > ipmi_sdr_cache_create: internal IPMI error > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > Caching SDR repository information: > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > ===================================================== > > Get SDR Repository Info Request > > ===================================================== > > [ 20h] = cmd[ 8b] > > ===================================================== > > Get SDR Repository Info Response > > ===================================================== > > [ 20h] = cmd[ 8b] > > [ 0h] = comp_code[ 8b] > > [ 1h] = sdr_version_major[ 4b] > > [ 5h] = sdr_version_minor[ 4b] > > [ 1Ch] = record_count[16b] > > [ 80h] = free_space[ 8b] > > ipmi_sdr_cache_create: internal IPMI error > > [root@aaronacint2010 Aaron]# > > _______________________________________________ > Freeipmi-users mailing list > Freeipmi-users@gnu.org > http://BLOCKEDlists.gnu.org/mailman/listinfo/freeipmi-users > -- Albert Chu chu11@llnl.gov Computer Scientist High Performance Systems Division Lawrence Livermore National Laboratory From MAILER-DAEMON Tue Nov 09 13:49:40 2010 Received: from mailman by lists.gnu.org with archive (Exim 4.43) id 1PFtGG-0003eg-RK for mharc-freeipmi-users@gnu.org; Tue, 09 Nov 2010 13:49:40 -0500 Received: from [140.186.70.92] (port=60680 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PFtGE-0003eR-DI for freeipmi-users@gnu.org; Tue, 09 Nov 2010 13:49:39 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PFtGC-00078o-Q1 for freeipmi-users@gnu.org; Tue, 09 Nov 2010 13:49:38 -0500 Received: from nspiron-2.llnl.gov ([128.115.41.82]:13273) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PFtGC-00078M-Fu for freeipmi-users@gnu.org; Tue, 09 Nov 2010 13:49:36 -0500 X-Attachments: None Received: from crazyclimber.llnl.gov (HELO [134.9.93.113]) ([134.9.93.113]) by nspiron-2.llnl.gov with ESMTP; 09 Nov 2010 10:49:34 -0800 Subject: Re: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error From: Al Chu To: James Aaron Nix-Gomez In-Reply-To: <1289326883.1979.21.camel@crazyclimber.llnl.gov> References: <026801cb801e$d137f270$73a7d750$@washington.edu> <1289326883.1979.21.camel@crazyclimber.llnl.gov> Content-Type: text/plain Date: Tue, 09 Nov 2010 10:49:34 -0800 Message-Id: <1289328574.1979.23.camel@crazyclimber.llnl.gov> Mime-Version: 1.0 X-Mailer: Evolution 2.12.3 (2.12.3-19.el5) Content-Transfer-Encoding: 7bit X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. Cc: "freeipmi-users@gnu.org" X-BeenThere: freeipmi-users@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GNU FreeIPMI User's Mailing List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 09 Nov 2010 18:49:39 -0000 Oops, forgot to cut and paste in what the payload was supposed to look like: [ 20h] = cmd[ 8b] [ 0h] = comp_code[ 8b] [ 1h] = sdr_version_major[ 4b] [ 5h] = sdr_version_minor[ 4b] [ 20h] = record_count[16b] [ FFFEh] = free_space[16b] [ 0h] = most_recent_addition_timestamp[32b] [ 0h] = most_recent_erase_timestamp[32b] [ 0h] = get_sdr_repository_allocation_info_command_supported[ 1b] [ 1h] = reserve_sdr_repository_command_supported[ 1b] [ 0h] = partial_add_sdr_command_supported[ 1b] [ 0h] = delete_sdr_command_supported[ 1b] [ 0h] = reserved[ 1b] [ 2h] = modal_non_modal_sdr_repository_update_operation_supported[ 2b] [ 0h] = overflow_flag[ 1b] Al On Tue, 2010-11-09 at 10:21 -0800, Al Chu wrote: > Hi James, > > Gut feeling is that something is wrong with the BMC on that motherboard. > > > [ 0h] = record_count[ 8b] > > > > ipmi_sdr_cache_create: internal IPMI error > > > [ 1Ch] = record_count[16b] > > > > [ 80h] = free_space[ 8b] > > > > ipmi_sdr_cache_create: internal IPMI error > > In the first payload above, you receive a 0 record_count, which is of > course very bad. Then in the below you get a (seemingly) correct > record_count and an extra field. The full payload of this packet is > supposed to be something like this: > > The fact that on 3-4 different rusn in your debug output, the SDR Info > command is returning inconsistent data, incorrect data, and multiple > sizes of payloads is very bad. > > Do any other FreeIPMI commands work, like bmc-info, ipmi-sel, etc. > > Al > > On Tue, 2010-11-09 at 07:00 -0800, James Aaron Nix-Gomez wrote: > > Good morning, > > > > > > > > I am having problems using ipmi-sensors on a SuperMicro X8DTH-if. I > > consistently get "ipmi_sdr_cache_create: internal IPMI error" messages. I > > have attached the debug information below. We have other SuperMicro > > X8DTH-if machines, with slightly different RAID hardware, where ipmi-sensors > > works flawlessly. I would appreciate any advice on how to troubleshoot this > > issue. > > > > > > > > Best regards, > > > > > > > > Aaron Nix-Gomez > > > > Applied Physics Laboratory > > > > University of Washington > > > > aaron@apl.washington.edu > > > > > > > > > > > > [root@aaronacint2010 Aaron]# uname -a > > > > Linux aaronacint2010 2.6.32.12-115.fc12.x86_64 #1 SMP Fri Apr 30 19:46:25 > > UTC 2010 x86_64 x86_64 x86_64 GNU/Linux > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --version > > > > ipmi-sensors - 0.7.12 > > > > Copyright (C) 2003-2008 FreeIPMI Core Team > > > > This program is free software; you may redistribute it under the terms of > > > > the GNU General Public License. This program has absolutely no warranty. > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > Caching SDR repository information: > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > ===================================================== > > > > Get SDR Repository Info Request > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > ===================================================== > > > > Get SDR Repository Info Response > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > [ 0h] = comp_code[ 8b] > > > > [ 1h] = sdr_version_major[ 4b] > > > > [ 5h] = sdr_version_minor[ 4b] > > > > [ 0h] = record_count[ 8b] > > > > ipmi_sdr_cache_create: internal IPMI error > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > Caching SDR repository information: > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > ===================================================== > > > > Get SDR Repository Info Request > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > ===================================================== > > > > Get SDR Repository Info Response > > > > ===================================================== > > > > ipmi_sdr_cache_create: internal IPMI error > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > Caching SDR repository information: > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > ===================================================== > > > > Get SDR Repository Info Request > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > ===================================================== > > > > Get SDR Repository Info Response > > > > ===================================================== > > > > ipmi_sdr_cache_create: internal IPMI error > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > Caching SDR repository information: > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > ===================================================== > > > > Get SDR Repository Info Request > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > ===================================================== > > > > Get SDR Repository Info Response > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > [ 0h] = comp_code[ 8b] > > > > [ 1h] = sdr_version_major[ 4b] > > > > [ 5h] = sdr_version_minor[ 4b] > > > > [ 1Ch] = record_count[16b] > > > > [ 80h] = free_space[ 8b] > > > > ipmi_sdr_cache_create: internal IPMI error > > > > [root@aaronacint2010 Aaron]# > > > > _______________________________________________ > > Freeipmi-users mailing list > > Freeipmi-users@gnu.org > > http://BLOCKEDBLOCKEDlists.gnu.org/mailman/listinfo/freeipmi-users > > -- Albert Chu chu11@llnl.gov Computer Scientist High Performance Systems Division Lawrence Livermore National Laboratory From MAILER-DAEMON Tue Nov 09 14:14:27 2010 Received: from mailman by lists.gnu.org with archive (Exim 4.43) id 1PFteF-0002KQ-Ob for mharc-freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:14:27 -0500 Received: from [140.186.70.92] (port=45460 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PFteD-0002KA-47 for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:14:26 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PFteB-0004ur-HH for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:14:25 -0500 Received: from snark.apl.washington.edu ([128.208.78.3]:40445) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PFteB-0004ud-4W for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:14:23 -0500 Received: from snark.apl.washington.edu (localhost [127.0.0.1]) by localhost (Postfix) with ESMTP id 29F4C3A01BC; Tue, 9 Nov 2010 11:14:22 -0800 (PST) Received: from smtp.apl.washington.edu (smtp.apl.washington.edu [128.208.78.8]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by snark.apl.washington.edu (Postfix) with ESMTPS id 146893A01AC; Tue, 9 Nov 2010 11:14:22 -0800 (PST) Received: from NEWWORKSTATION (pool-96-255-71-169.washdc.fios.verizon.net [96.255.71.169]) (using TLSv1 with cipher RC4-MD5 (128/128 bits)) (No client certificate requested) by smtp.apl.washington.edu (Postfix) with ESMTP id 44B3EE3A3C; Tue, 9 Nov 2010 11:14:20 -0800 (PST) From: "James Aaron Nix-Gomez" To: "'Al Chu'" References: <026801cb801e$d137f270$73a7d750$@washington.edu> <1289326883.1979.21.camel@crazyclimber.llnl.gov> <1289328574.1979.23.camel@crazyclimber.llnl.gov> In-Reply-To: <1289328574.1979.23.camel@crazyclimber.llnl.gov> Subject: RE: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error Date: Tue, 9 Nov 2010 14:14:19 -0500 Message-ID: <02fe01cb8042$50031fd0$f0095f70$@washington.edu> MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Office Outlook 12.0 Thread-Index: AcuAPtpcWh3Uj0BTS8m3duiWH8IeEQAAb9EQ Content-Language: en-us X-Antivirus: Scanned by F-Prot Antivirus (http://www.f-prot.com) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) Cc: freeipmi-users@gnu.org X-BeenThere: freeipmi-users@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GNU FreeIPMI User's Mailing List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 09 Nov 2010 19:14:26 -0000 Al, Thank you for the prompt reply. I tried a few more of the commands, see below, without much success. They too had varying sets of SDR info. We are able to run freeipmi on two other Supermicro X8DTH-iF motherboards without any problems. If you believe there is a BMC issue on this one motherboard then that is all I need to know. I have only just begun to look into sensor monitoring so I was not sure if I had a configuration issue. Best, Aaron Nix-Gomez [root@aaronacint2010 Aaron]# bmc-info --debug ===================================================== Get Device ID Request ===================================================== [ 1h] = cmd[ 8b] ===================================================== Get Device ID Response ===================================================== ipmi_cmd_get_device_id: driver timeout [root@aaronacint2010 Aaron]# ipmi-sel --debug Caching SDR repository information: /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost ===================================================== Get SDR Repository Info Request ===================================================== [ 20h] = cmd[ 8b] ===================================================== Get SDR Repository Info Response ===================================================== [ 20h] = cmd[ 8b] [ 0h] = comp_code[ 8b] [ 0h] = sdr_version_major[ 4b] [ 8h] = sdr_version_minor[ 4b] ipmi_sdr_cache_create: internal IPMI error [root@aaronacint2010 Aaron]# bmc-info --debug ===================================================== Get Device ID Request ===================================================== [ 1h] = cmd[ 8b] ===================================================== Get Device ID Response ===================================================== ipmi_cmd_get_device_id: driver timeout [root@aaronacint2010 Aaron]# ipmi-chassis --get-status --debug ===================================================== Get Chassis Status Request ===================================================== [ 1h] = cmd[ 8b] ===================================================== Get Chassis Status Response ===================================================== ipmi_cmd_get_chassis_status: driver timeout [root@aaronacint2010 Aaron]# ipmi-chassis --get-status --debug ===================================================== Get Chassis Status Request ===================================================== [ 1h] = cmd[ 8b] ===================================================== Get Chassis Status Response ===================================================== [ 0h] = cmd[ 8b] [ 80h] = comp_code[ 8b] ipmi_cmd_get_chassis_status: bad completion code [root@aaronacint2010 Aaron]# ipmi-locate Probing KCS device using DMIDECODE... done IPMI Version: 2.0 IPMI locate driver: DMIDECODE IPMI interface: KCS BMC driver device: BMC I/O base address: 0xCA2 Register spacing: 1 Probing SMIC device using DMIDECODE... FAILED Probing BT device using DMIDECODE... FAILED Probing SSIF device using DMIDECODE... FAILED Probing KCS device using SMBIOS... done IPMI Version: 2.0 IPMI locate driver: SMBIOS IPMI interface: KCS BMC driver device: BMC I/O base address: 0xCA2 Register spacing: 1 Probing SMIC device using SMBIOS... FAILED Probing BT device using SMBIOS... FAILED Probing SSIF device using SMBIOS... FAILED Probing KCS device using ACPI... FAILED Probing SMIC device using ACPI... FAILED Probing BT device using ACPI... FAILED Probing SSIF device using ACPI... FAILED Probing KCS device using PCI... FAILED Probing SMIC device using PCI... FAILED Probing BT device using PCI... FAILED Probing SSIF device using PCI... FAILED KCS device default values: IPMI Version: 1.5 IPMI locate driver: DEFAULT IPMI interface: KCS BMC driver device: BMC I/O base address: 0xCA2 Register spacing: 1 SMIC device default values: IPMI Version: 1.5 IPMI locate driver: DEFAULT IPMI interface: SMIC BMC driver device: BMC I/O base address: 0xCA9 Register spacing: 1 BT device default values: SSIF device default values: IPMI Version: 1.5 IPMI locate driver: DEFAULT IPMI interface: SSIF BMC driver device: /dev/i2c-0 BMC SMBUS slave address: 0x42 Register spacing: 1 [root@aaronacint2010 Aaron]# -----Original Message----- From: Al Chu [mailto:chu11@llnl.gov] Sent: Tuesday, November 09, 2010 1:50 PM To: James Aaron Nix-Gomez Cc: freeipmi-users@gnu.org Subject: Re: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error Oops, forgot to cut and paste in what the payload was supposed to look like: [ 20h] = cmd[ 8b] [ 0h] = comp_code[ 8b] [ 1h] = sdr_version_major[ 4b] [ 5h] = sdr_version_minor[ 4b] [ 20h] = record_count[16b] [ FFFEh] = free_space[16b] [ 0h] = most_recent_addition_timestamp[32b] [ 0h] = most_recent_erase_timestamp[32b] [ 0h] = get_sdr_repository_allocation_info_command_supported[ 1b] [ 1h] = reserve_sdr_repository_command_supported[ 1b] [ 0h] = partial_add_sdr_command_supported[ 1b] [ 0h] = delete_sdr_command_supported[ 1b] [ 0h] = reserved[ 1b] [ 2h] = modal_non_modal_sdr_repository_update_operation_supported[ 2b] [ 0h] = overflow_flag[ 1b] Al On Tue, 2010-11-09 at 10:21 -0800, Al Chu wrote: > Hi James, > > Gut feeling is that something is wrong with the BMC on that motherboard. > > > [ 0h] = record_count[ 8b] > > > > ipmi_sdr_cache_create: internal IPMI error > > > [ 1Ch] = record_count[16b] > > > > [ 80h] = free_space[ 8b] > > > > ipmi_sdr_cache_create: internal IPMI error > > In the first payload above, you receive a 0 record_count, which is of > course very bad. Then in the below you get a (seemingly) correct > record_count and an extra field. The full payload of this packet is > supposed to be something like this: > > The fact that on 3-4 different rusn in your debug output, the SDR Info > command is returning inconsistent data, incorrect data, and multiple > sizes of payloads is very bad. > > Do any other FreeIPMI commands work, like bmc-info, ipmi-sel, etc. > > Al > > On Tue, 2010-11-09 at 07:00 -0800, James Aaron Nix-Gomez wrote: > > Good morning, > > > > > > > > I am having problems using ipmi-sensors on a SuperMicro X8DTH-if. I > > consistently get "ipmi_sdr_cache_create: internal IPMI error" messages. I > > have attached the debug information below. We have other SuperMicro > > X8DTH-if machines, with slightly different RAID hardware, where ipmi-sensors > > works flawlessly. I would appreciate any advice on how to troubleshoot this > > issue. > > > > > > > > Best regards, > > > > > > > > Aaron Nix-Gomez > > > > Applied Physics Laboratory > > > > University of Washington > > > > aaron@apl.washington.edu > > > > > > > > > > > > [root@aaronacint2010 Aaron]# uname -a > > > > Linux aaronacint2010 2.6.32.12-115.fc12.x86_64 #1 SMP Fri Apr 30 19:46:25 > > UTC 2010 x86_64 x86_64 x86_64 GNU/Linux > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --version > > > > ipmi-sensors - 0.7.12 > > > > Copyright (C) 2003-2008 FreeIPMI Core Team > > > > This program is free software; you may redistribute it under the terms of > > > > the GNU General Public License. This program has absolutely no warranty. > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > Caching SDR repository information: > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > ===================================================== > > > > Get SDR Repository Info Request > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > ===================================================== > > > > Get SDR Repository Info Response > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > [ 0h] = comp_code[ 8b] > > > > [ 1h] = sdr_version_major[ 4b] > > > > [ 5h] = sdr_version_minor[ 4b] > > > > [ 0h] = record_count[ 8b] > > > > ipmi_sdr_cache_create: internal IPMI error > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > Caching SDR repository information: > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > ===================================================== > > > > Get SDR Repository Info Request > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > ===================================================== > > > > Get SDR Repository Info Response > > > > ===================================================== > > > > ipmi_sdr_cache_create: internal IPMI error > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > Caching SDR repository information: > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > ===================================================== > > > > Get SDR Repository Info Request > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > ===================================================== > > > > Get SDR Repository Info Response > > > > ===================================================== > > > > ipmi_sdr_cache_create: internal IPMI error > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > Caching SDR repository information: > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > ===================================================== > > > > Get SDR Repository Info Request > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > ===================================================== > > > > Get SDR Repository Info Response > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > [ 0h] = comp_code[ 8b] > > > > [ 1h] = sdr_version_major[ 4b] > > > > [ 5h] = sdr_version_minor[ 4b] > > > > [ 1Ch] = record_count[16b] > > > > [ 80h] = free_space[ 8b] > > > > ipmi_sdr_cache_create: internal IPMI error > > > > [root@aaronacint2010 Aaron]# > > > > _______________________________________________ > > Freeipmi-users mailing list > > Freeipmi-users@gnu.org > > http://BLOCKEDBLOCKEDlists.gnu.org/mailman/listinfo/freeipmi-users > > -- Albert Chu chu11@llnl.gov Computer Scientist High Performance Systems Division Lawrence Livermore National Laboratory From MAILER-DAEMON Tue Nov 09 14:20:37 2010 Received: from mailman by lists.gnu.org with archive (Exim 4.43) id 1PFtkD-00041w-0V for mharc-freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:20:37 -0500 Received: from [140.186.70.92] (port=52626 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PFtk9-00041m-Mt for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:20:35 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PFtk4-0006vN-2g for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:20:33 -0500 Received: from snark.apl.washington.edu ([128.208.78.3]:39155) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PFtk3-0006ux-Io for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:20:28 -0500 Received: from snark.apl.washington.edu (localhost [127.0.0.1]) by localhost (Postfix) with ESMTP id 9AB313A01BE; Tue, 9 Nov 2010 11:20:26 -0800 (PST) Received: from smtp.apl.washington.edu (smtp.apl.washington.edu [128.208.78.8]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by snark.apl.washington.edu (Postfix) with ESMTPS id 7D04C3A01BC; Tue, 9 Nov 2010 11:20:26 -0800 (PST) Received: from NEWWORKSTATION (pool-96-255-71-169.washdc.fios.verizon.net [96.255.71.169]) (using TLSv1 with cipher RC4-MD5 (128/128 bits)) (No client certificate requested) by smtp.apl.washington.edu (Postfix) with ESMTP id 37E59E3A3B; Tue, 9 Nov 2010 11:20:25 -0800 (PST) From: "James Aaron Nix-Gomez" To: "'Al Chu'" References: <026801cb801e$d137f270$73a7d750$@washington.edu> <1289326883.1979.21.camel@crazyclimber.llnl.gov> <1289328574.1979.23.camel@crazyclimber.llnl.gov> In-Reply-To: Subject: RE: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error Date: Tue, 9 Nov 2010 14:20:23 -0500 Message-ID: <02ff01cb8043$29398230$7bac8690$@washington.edu> MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Office Outlook 12.0 Thread-Index: AcuAPtpcWh3Uj0BTS8m3duiWH8IeEQAAb9EQAACJHwA= Content-Language: en-us X-Antivirus: Scanned by F-Prot Antivirus (http://www.f-prot.com) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) Cc: freeipmi-users@gnu.org X-BeenThere: freeipmi-users@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GNU FreeIPMI User's Mailing List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 09 Nov 2010 19:20:35 -0000 Al, I should add that I can remotely view the sensors status using the SuperMicro IPMI View V2.7.17. I realize that uses the out of band path. Aaron -----Original Message----- From: James Aaron Nix-Gomez [mailto:aaron@apl.washington.edu] Sent: Tuesday, November 09, 2010 2:14 PM To: 'Al Chu' Cc: 'freeipmi-users@gnu.org' Subject: RE: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error Al, Thank you for the prompt reply. I tried a few more of the commands, see below, without much success. They too had varying sets of SDR info. We are able to run freeipmi on two other Supermicro X8DTH-iF motherboards without any problems. If you believe there is a BMC issue on this one motherboard then that is all I need to know. I have only just begun to look into sensor monitoring so I was not sure if I had a configuration issue. Best, Aaron Nix-Gomez [root@aaronacint2010 Aaron]# bmc-info --debug ===================================================== Get Device ID Request ===================================================== [ 1h] = cmd[ 8b] ===================================================== Get Device ID Response ===================================================== ipmi_cmd_get_device_id: driver timeout [root@aaronacint2010 Aaron]# ipmi-sel --debug Caching SDR repository information: /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost ===================================================== Get SDR Repository Info Request ===================================================== [ 20h] = cmd[ 8b] ===================================================== Get SDR Repository Info Response ===================================================== [ 20h] = cmd[ 8b] [ 0h] = comp_code[ 8b] [ 0h] = sdr_version_major[ 4b] [ 8h] = sdr_version_minor[ 4b] ipmi_sdr_cache_create: internal IPMI error [root@aaronacint2010 Aaron]# bmc-info --debug ===================================================== Get Device ID Request ===================================================== [ 1h] = cmd[ 8b] ===================================================== Get Device ID Response ===================================================== ipmi_cmd_get_device_id: driver timeout [root@aaronacint2010 Aaron]# ipmi-chassis --get-status --debug ===================================================== Get Chassis Status Request ===================================================== [ 1h] = cmd[ 8b] ===================================================== Get Chassis Status Response ===================================================== ipmi_cmd_get_chassis_status: driver timeout [root@aaronacint2010 Aaron]# ipmi-chassis --get-status --debug ===================================================== Get Chassis Status Request ===================================================== [ 1h] = cmd[ 8b] ===================================================== Get Chassis Status Response ===================================================== [ 0h] = cmd[ 8b] [ 80h] = comp_code[ 8b] ipmi_cmd_get_chassis_status: bad completion code [root@aaronacint2010 Aaron]# ipmi-locate Probing KCS device using DMIDECODE... done IPMI Version: 2.0 IPMI locate driver: DMIDECODE IPMI interface: KCS BMC driver device: BMC I/O base address: 0xCA2 Register spacing: 1 Probing SMIC device using DMIDECODE... FAILED Probing BT device using DMIDECODE... FAILED Probing SSIF device using DMIDECODE... FAILED Probing KCS device using SMBIOS... done IPMI Version: 2.0 IPMI locate driver: SMBIOS IPMI interface: KCS BMC driver device: BMC I/O base address: 0xCA2 Register spacing: 1 Probing SMIC device using SMBIOS... FAILED Probing BT device using SMBIOS... FAILED Probing SSIF device using SMBIOS... FAILED Probing KCS device using ACPI... FAILED Probing SMIC device using ACPI... FAILED Probing BT device using ACPI... FAILED Probing SSIF device using ACPI... FAILED Probing KCS device using PCI... FAILED Probing SMIC device using PCI... FAILED Probing BT device using PCI... FAILED Probing SSIF device using PCI... FAILED KCS device default values: IPMI Version: 1.5 IPMI locate driver: DEFAULT IPMI interface: KCS BMC driver device: BMC I/O base address: 0xCA2 Register spacing: 1 SMIC device default values: IPMI Version: 1.5 IPMI locate driver: DEFAULT IPMI interface: SMIC BMC driver device: BMC I/O base address: 0xCA9 Register spacing: 1 BT device default values: SSIF device default values: IPMI Version: 1.5 IPMI locate driver: DEFAULT IPMI interface: SSIF BMC driver device: /dev/i2c-0 BMC SMBUS slave address: 0x42 Register spacing: 1 [root@aaronacint2010 Aaron]# -----Original Message----- From: Al Chu [mailto:chu11@llnl.gov] Sent: Tuesday, November 09, 2010 1:50 PM To: James Aaron Nix-Gomez Cc: freeipmi-users@gnu.org Subject: Re: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error Oops, forgot to cut and paste in what the payload was supposed to look like: [ 20h] = cmd[ 8b] [ 0h] = comp_code[ 8b] [ 1h] = sdr_version_major[ 4b] [ 5h] = sdr_version_minor[ 4b] [ 20h] = record_count[16b] [ FFFEh] = free_space[16b] [ 0h] = most_recent_addition_timestamp[32b] [ 0h] = most_recent_erase_timestamp[32b] [ 0h] = get_sdr_repository_allocation_info_command_supported[ 1b] [ 1h] = reserve_sdr_repository_command_supported[ 1b] [ 0h] = partial_add_sdr_command_supported[ 1b] [ 0h] = delete_sdr_command_supported[ 1b] [ 0h] = reserved[ 1b] [ 2h] = modal_non_modal_sdr_repository_update_operation_supported[ 2b] [ 0h] = overflow_flag[ 1b] Al On Tue, 2010-11-09 at 10:21 -0800, Al Chu wrote: > Hi James, > > Gut feeling is that something is wrong with the BMC on that motherboard. > > > [ 0h] = record_count[ 8b] > > > > ipmi_sdr_cache_create: internal IPMI error > > > [ 1Ch] = record_count[16b] > > > > [ 80h] = free_space[ 8b] > > > > ipmi_sdr_cache_create: internal IPMI error > > In the first payload above, you receive a 0 record_count, which is of > course very bad. Then in the below you get a (seemingly) correct > record_count and an extra field. The full payload of this packet is > supposed to be something like this: > > The fact that on 3-4 different rusn in your debug output, the SDR Info > command is returning inconsistent data, incorrect data, and multiple > sizes of payloads is very bad. > > Do any other FreeIPMI commands work, like bmc-info, ipmi-sel, etc. > > Al > > On Tue, 2010-11-09 at 07:00 -0800, James Aaron Nix-Gomez wrote: > > Good morning, > > > > > > > > I am having problems using ipmi-sensors on a SuperMicro X8DTH-if. I > > consistently get "ipmi_sdr_cache_create: internal IPMI error" messages. I > > have attached the debug information below. We have other SuperMicro > > X8DTH-if machines, with slightly different RAID hardware, where ipmi-sensors > > works flawlessly. I would appreciate any advice on how to troubleshoot this > > issue. > > > > > > > > Best regards, > > > > > > > > Aaron Nix-Gomez > > > > Applied Physics Laboratory > > > > University of Washington > > > > aaron@apl.washington.edu > > > > > > > > > > > > [root@aaronacint2010 Aaron]# uname -a > > > > Linux aaronacint2010 2.6.32.12-115.fc12.x86_64 #1 SMP Fri Apr 30 19:46:25 > > UTC 2010 x86_64 x86_64 x86_64 GNU/Linux > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --version > > > > ipmi-sensors - 0.7.12 > > > > Copyright (C) 2003-2008 FreeIPMI Core Team > > > > This program is free software; you may redistribute it under the terms of > > > > the GNU General Public License. This program has absolutely no warranty. > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > Caching SDR repository information: > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > ===================================================== > > > > Get SDR Repository Info Request > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > ===================================================== > > > > Get SDR Repository Info Response > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > [ 0h] = comp_code[ 8b] > > > > [ 1h] = sdr_version_major[ 4b] > > > > [ 5h] = sdr_version_minor[ 4b] > > > > [ 0h] = record_count[ 8b] > > > > ipmi_sdr_cache_create: internal IPMI error > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > Caching SDR repository information: > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > ===================================================== > > > > Get SDR Repository Info Request > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > ===================================================== > > > > Get SDR Repository Info Response > > > > ===================================================== > > > > ipmi_sdr_cache_create: internal IPMI error > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > Caching SDR repository information: > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > ===================================================== > > > > Get SDR Repository Info Request > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > ===================================================== > > > > Get SDR Repository Info Response > > > > ===================================================== > > > > ipmi_sdr_cache_create: internal IPMI error > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > Caching SDR repository information: > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > ===================================================== > > > > Get SDR Repository Info Request > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > ===================================================== > > > > Get SDR Repository Info Response > > > > ===================================================== > > > > [ 20h] = cmd[ 8b] > > > > [ 0h] = comp_code[ 8b] > > > > [ 1h] = sdr_version_major[ 4b] > > > > [ 5h] = sdr_version_minor[ 4b] > > > > [ 1Ch] = record_count[16b] > > > > [ 80h] = free_space[ 8b] > > > > ipmi_sdr_cache_create: internal IPMI error > > > > [root@aaronacint2010 Aaron]# > > > > _______________________________________________ > > Freeipmi-users mailing list > > Freeipmi-users@gnu.org > > http://BLOCKEDBLOCKEDlists.gnu.org/mailman/listinfo/freeipmi-users > > -- Albert Chu chu11@llnl.gov Computer Scientist High Performance Systems Division Lawrence Livermore National Laboratory From MAILER-DAEMON Tue Nov 09 14:26:26 2010 Received: from mailman by lists.gnu.org with archive (Exim 4.43) id 1PFtpq-00065x-2K for mharc-freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:26:26 -0500 Received: from [140.186.70.92] (port=35166 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PFtpm-00065m-Cq for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:26:23 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PFtpk-0008Qj-BS for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:26:22 -0500 Received: from nspiron-3.llnl.gov ([128.115.41.83]:30940 helo=smtp.llnl.gov) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PFtpj-0008Qb-PV for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:26:20 -0500 X-Attachments: None Received: from crazyclimber.llnl.gov (HELO [134.9.93.113]) ([134.9.93.113]) by smtp.llnl.gov with ESMTP; 09 Nov 2010 11:26:17 -0800 Subject: RE: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error From: Al Chu To: James Aaron Nix-Gomez In-Reply-To: <02fe01cb8042$50031fd0$f0095f70$@washington.edu> References: <026801cb801e$d137f270$73a7d750$@washington.edu> <1289326883.1979.21.camel@crazyclimber.llnl.gov> <1289328574.1979.23.camel@crazyclimber.llnl.gov> <02fe01cb8042$50031fd0$f0095f70$@washington.edu> Content-Type: text/plain Date: Tue, 09 Nov 2010 11:26:17 -0800 Message-Id: <1289330777.1979.27.camel@crazyclimber.llnl.gov> Mime-Version: 1.0 X-Mailer: Evolution 2.12.3 (2.12.3-19.el5) Content-Transfer-Encoding: 7bit X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. Cc: "freeipmi-users@gnu.org" X-BeenThere: freeipmi-users@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GNU FreeIPMI User's Mailing List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 09 Nov 2010 19:26:24 -0000 Hi James, Just as a quick double check, you may want to make sure that no other IPMI stuff is running on the system, possibly other daemons or something like that. Or perhaps the IPMI kernel driver is installed into a non-common path (it's usually /dev/ipmi0 if it's installed) and stuff going on there is conflicting. It's just a thought b/c of the driver-timeout errors you're seeing below. I'm still betting it's a HW problem though. Al On Tue, 2010-11-09 at 11:14 -0800, James Aaron Nix-Gomez wrote: > Al, > > Thank you for the prompt reply. I tried a few more of the commands, see > below, without much success. They too had varying sets of SDR info. > > We are able to run freeipmi on two other Supermicro X8DTH-iF motherboards > without any problems. If you believe there is a BMC issue on this one > motherboard then that is all I need to know. I have only just begun to look > into sensor monitoring so I was not sure if I had a configuration issue. > > Best, > > Aaron Nix-Gomez > > > [root@aaronacint2010 Aaron]# bmc-info --debug > ===================================================== > Get Device ID Request > ===================================================== > [ 1h] = cmd[ 8b] > ===================================================== > Get Device ID Response > ===================================================== > ipmi_cmd_get_device_id: driver timeout > [root@aaronacint2010 Aaron]# ipmi-sel --debug > Caching SDR repository information: > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > ===================================================== > Get SDR Repository Info Request > ===================================================== > [ 20h] = cmd[ 8b] > ===================================================== > Get SDR Repository Info Response > ===================================================== > [ 20h] = cmd[ 8b] > [ 0h] = comp_code[ 8b] > [ 0h] = sdr_version_major[ 4b] > [ 8h] = sdr_version_minor[ 4b] > ipmi_sdr_cache_create: internal IPMI error > [root@aaronacint2010 Aaron]# bmc-info --debug > ===================================================== > Get Device ID Request > ===================================================== > [ 1h] = cmd[ 8b] > ===================================================== > Get Device ID Response > ===================================================== > ipmi_cmd_get_device_id: driver timeout > [root@aaronacint2010 Aaron]# ipmi-chassis --get-status --debug > ===================================================== > Get Chassis Status Request > ===================================================== > [ 1h] = cmd[ 8b] > ===================================================== > Get Chassis Status Response > ===================================================== > ipmi_cmd_get_chassis_status: driver timeout > [root@aaronacint2010 Aaron]# ipmi-chassis --get-status --debug > ===================================================== > Get Chassis Status Request > ===================================================== > [ 1h] = cmd[ 8b] > ===================================================== > Get Chassis Status Response > ===================================================== > [ 0h] = cmd[ 8b] > [ 80h] = comp_code[ 8b] > ipmi_cmd_get_chassis_status: bad completion code > [root@aaronacint2010 Aaron]# ipmi-locate > Probing KCS device using DMIDECODE... done > IPMI Version: 2.0 > IPMI locate driver: DMIDECODE > IPMI interface: KCS > BMC driver device: > BMC I/O base address: 0xCA2 > Register spacing: 1 > > Probing SMIC device using DMIDECODE... FAILED > > Probing BT device using DMIDECODE... FAILED > > Probing SSIF device using DMIDECODE... FAILED > > Probing KCS device using SMBIOS... done > IPMI Version: 2.0 > IPMI locate driver: SMBIOS > IPMI interface: KCS > BMC driver device: > BMC I/O base address: 0xCA2 > Register spacing: 1 > > Probing SMIC device using SMBIOS... FAILED > > Probing BT device using SMBIOS... FAILED > > Probing SSIF device using SMBIOS... FAILED > > Probing KCS device using ACPI... FAILED > > Probing SMIC device using ACPI... FAILED > > Probing BT device using ACPI... FAILED > > Probing SSIF device using ACPI... FAILED > > Probing KCS device using PCI... FAILED > > Probing SMIC device using PCI... FAILED > > Probing BT device using PCI... FAILED > > Probing SSIF device using PCI... FAILED > > KCS device default values: > IPMI Version: 1.5 > IPMI locate driver: DEFAULT > IPMI interface: KCS > BMC driver device: > BMC I/O base address: 0xCA2 > Register spacing: 1 > > SMIC device default values: > IPMI Version: 1.5 > IPMI locate driver: DEFAULT > IPMI interface: SMIC > BMC driver device: > BMC I/O base address: 0xCA9 > Register spacing: 1 > > BT device default values: > SSIF device default values: > IPMI Version: 1.5 > IPMI locate driver: DEFAULT > IPMI interface: SSIF > BMC driver device: /dev/i2c-0 > BMC SMBUS slave address: 0x42 > Register spacing: 1 > > [root@aaronacint2010 Aaron]# > > -----Original Message----- > From: Al Chu [mailto:chu11@llnl.gov] > Sent: Tuesday, November 09, 2010 1:50 PM > To: James Aaron Nix-Gomez > Cc: freeipmi-users@gnu.org > Subject: Re: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error > > Oops, forgot to cut and paste in what the payload was supposed to look > like: > > [ 20h] = cmd[ 8b] > [ 0h] = comp_code[ 8b] > [ 1h] = sdr_version_major[ 4b] > [ 5h] = sdr_version_minor[ 4b] > [ 20h] = record_count[16b] > [ FFFEh] = free_space[16b] > [ 0h] = most_recent_addition_timestamp[32b] > [ 0h] = most_recent_erase_timestamp[32b] > [ 0h] = > get_sdr_repository_allocation_info_command_supported[ 1b] > [ 1h] = reserve_sdr_repository_command_supported[ 1b] > [ 0h] = partial_add_sdr_command_supported[ 1b] > [ 0h] = delete_sdr_command_supported[ 1b] > [ 0h] = reserved[ 1b] > [ 2h] = > modal_non_modal_sdr_repository_update_operation_supported[ 2b] > [ 0h] = overflow_flag[ 1b] > > Al > > On Tue, 2010-11-09 at 10:21 -0800, Al Chu wrote: > > Hi James, > > > > Gut feeling is that something is wrong with the BMC on that motherboard. > > > > > [ 0h] = record_count[ 8b] > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > [ 1Ch] = record_count[16b] > > > > > > [ 80h] = free_space[ 8b] > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > In the first payload above, you receive a 0 record_count, which is of > > course very bad. Then in the below you get a (seemingly) correct > > record_count and an extra field. The full payload of this packet is > > supposed to be something like this: > > > > The fact that on 3-4 different rusn in your debug output, the SDR Info > > command is returning inconsistent data, incorrect data, and multiple > > sizes of payloads is very bad. > > > > Do any other FreeIPMI commands work, like bmc-info, ipmi-sel, etc. > > > > Al > > > > On Tue, 2010-11-09 at 07:00 -0800, James Aaron Nix-Gomez wrote: > > > Good morning, > > > > > > > > > > > > I am having problems using ipmi-sensors on a SuperMicro X8DTH-if. I > > > consistently get "ipmi_sdr_cache_create: internal IPMI error" messages. > I > > > have attached the debug information below. We have other SuperMicro > > > X8DTH-if machines, with slightly different RAID hardware, where > ipmi-sensors > > > works flawlessly. I would appreciate any advice on how to troubleshoot > this > > > issue. > > > > > > > > > > > > Best regards, > > > > > > > > > > > > Aaron Nix-Gomez > > > > > > Applied Physics Laboratory > > > > > > University of Washington > > > > > > aaron@apl.washington.edu > > > > > > > > > > > > > > > > > > [root@aaronacint2010 Aaron]# uname -a > > > > > > Linux aaronacint2010 2.6.32.12-115.fc12.x86_64 #1 SMP Fri Apr 30 > 19:46:25 > > > UTC 2010 x86_64 x86_64 x86_64 GNU/Linux > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --version > > > > > > ipmi-sensors - 0.7.12 > > > > > > Copyright (C) 2003-2008 FreeIPMI Core Team > > > > > > This program is free software; you may redistribute it under the terms > of > > > > > > the GNU General Public License. This program has absolutely no > warranty. > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > > > Caching SDR repository information: > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > ===================================================== > > > > > > Get SDR Repository Info Request > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > ===================================================== > > > > > > Get SDR Repository Info Response > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > [ 0h] = comp_code[ 8b] > > > > > > [ 1h] = sdr_version_major[ 4b] > > > > > > [ 5h] = sdr_version_minor[ 4b] > > > > > > [ 0h] = record_count[ 8b] > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > > > Caching SDR repository information: > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > ===================================================== > > > > > > Get SDR Repository Info Request > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > ===================================================== > > > > > > Get SDR Repository Info Response > > > > > > ===================================================== > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > > > Caching SDR repository information: > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > ===================================================== > > > > > > Get SDR Repository Info Request > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > ===================================================== > > > > > > Get SDR Repository Info Response > > > > > > ===================================================== > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > > > Caching SDR repository information: > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > ===================================================== > > > > > > Get SDR Repository Info Request > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > ===================================================== > > > > > > Get SDR Repository Info Response > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > [ 0h] = comp_code[ 8b] > > > > > > [ 1h] = sdr_version_major[ 4b] > > > > > > [ 5h] = sdr_version_minor[ 4b] > > > > > > [ 1Ch] = record_count[16b] > > > > > > [ 80h] = free_space[ 8b] > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > [root@aaronacint2010 Aaron]# > > > > > > _______________________________________________ > > > Freeipmi-users mailing list > > > Freeipmi-users@gnu.org > > > http://BLOCKEDBLOCKEDBLOCKEDlists.gnu.org/mailman/listinfo/freeipmi-users > > > -- Albert Chu chu11@llnl.gov Computer Scientist High Performance Systems Division Lawrence Livermore National Laboratory From MAILER-DAEMON Tue Nov 09 14:35:41 2010 Received: from mailman by lists.gnu.org with archive (Exim 4.43) id 1PFtyn-0000sG-PZ for mharc-freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:35:41 -0500 Received: from [140.186.70.92] (port=52318 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PFtyk-0000sB-Lj for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:35:40 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PFtyi-0001k0-ON for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:35:38 -0500 Received: from snark.apl.washington.edu ([128.208.78.3]:41825) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PFtyi-0001jt-Cv for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:35:36 -0500 Received: from snark.apl.washington.edu (localhost [127.0.0.1]) by localhost (Postfix) with ESMTP id 510243A01A0; Tue, 9 Nov 2010 11:35:35 -0800 (PST) Received: from smtp.apl.washington.edu (smtp.apl.washington.edu [128.208.78.8]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by snark.apl.washington.edu (Postfix) with ESMTPS id 3F6153A018B; Tue, 9 Nov 2010 11:35:35 -0800 (PST) Received: from NEWWORKSTATION (pool-96-255-71-169.washdc.fios.verizon.net [96.255.71.169]) (using TLSv1 with cipher RC4-MD5 (128/128 bits)) (No client certificate requested) by smtp.apl.washington.edu (Postfix) with ESMTP id DE384E3A3B; Tue, 9 Nov 2010 11:35:33 -0800 (PST) From: "James Aaron Nix-Gomez" To: "'Al Chu'" References: <026801cb801e$d137f270$73a7d750$@washington.edu> <1289326883.1979.21.camel@crazyclimber.llnl.gov> <1289328574.1979.23.camel@crazyclimber.llnl.gov> <02fe01cb8042$50031fd0$f0095f70$@washington.edu> <1289330777.1979.27.camel@crazyclimber.llnl.gov> In-Reply-To: <1289330777.1979.27.camel@crazyclimber.llnl.gov> Subject: RE: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error Date: Tue, 9 Nov 2010 14:35:32 -0500 Message-ID: <030501cb8045$46e1c110$d4a54330$@washington.edu> MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Office Outlook 12.0 Thread-Index: AcuAQ/tzGAAI4c/pQy66lskNOCb3YwAALbsQ Content-Language: en-us X-Antivirus: Scanned by F-Prot Antivirus (http://www.f-prot.com) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) Cc: freeipmi-users@gnu.org X-BeenThere: freeipmi-users@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GNU FreeIPMI User's Mailing List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 09 Nov 2010 19:35:40 -0000 Al, Good catch, the lm_sensors service was running. However, the ipmi module is missing from it. I thought I could use lm_sensors for Intel CPU core temperatures and freeipmi for the motherboard sensors. Aaron -----Original Message----- From: Al Chu [mailto:chu11@llnl.gov] Sent: Tuesday, November 09, 2010 2:26 PM To: James Aaron Nix-Gomez Cc: freeipmi-users@gnu.org Subject: RE: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error Hi James, Just as a quick double check, you may want to make sure that no other IPMI stuff is running on the system, possibly other daemons or something like that. Or perhaps the IPMI kernel driver is installed into a non-common path (it's usually /dev/ipmi0 if it's installed) and stuff going on there is conflicting. It's just a thought b/c of the driver-timeout errors you're seeing below. I'm still betting it's a HW problem though. Al On Tue, 2010-11-09 at 11:14 -0800, James Aaron Nix-Gomez wrote: > Al, > > Thank you for the prompt reply. I tried a few more of the commands, see > below, without much success. They too had varying sets of SDR info. > > We are able to run freeipmi on two other Supermicro X8DTH-iF motherboards > without any problems. If you believe there is a BMC issue on this one > motherboard then that is all I need to know. I have only just begun to look > into sensor monitoring so I was not sure if I had a configuration issue. > > Best, > > Aaron Nix-Gomez > > > [root@aaronacint2010 Aaron]# bmc-info --debug > ===================================================== > Get Device ID Request > ===================================================== > [ 1h] = cmd[ 8b] > ===================================================== > Get Device ID Response > ===================================================== > ipmi_cmd_get_device_id: driver timeout > [root@aaronacint2010 Aaron]# ipmi-sel --debug > Caching SDR repository information: > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > ===================================================== > Get SDR Repository Info Request > ===================================================== > [ 20h] = cmd[ 8b] > ===================================================== > Get SDR Repository Info Response > ===================================================== > [ 20h] = cmd[ 8b] > [ 0h] = comp_code[ 8b] > [ 0h] = sdr_version_major[ 4b] > [ 8h] = sdr_version_minor[ 4b] > ipmi_sdr_cache_create: internal IPMI error > [root@aaronacint2010 Aaron]# bmc-info --debug > ===================================================== > Get Device ID Request > ===================================================== > [ 1h] = cmd[ 8b] > ===================================================== > Get Device ID Response > ===================================================== > ipmi_cmd_get_device_id: driver timeout > [root@aaronacint2010 Aaron]# ipmi-chassis --get-status --debug > ===================================================== > Get Chassis Status Request > ===================================================== > [ 1h] = cmd[ 8b] > ===================================================== > Get Chassis Status Response > ===================================================== > ipmi_cmd_get_chassis_status: driver timeout > [root@aaronacint2010 Aaron]# ipmi-chassis --get-status --debug > ===================================================== > Get Chassis Status Request > ===================================================== > [ 1h] = cmd[ 8b] > ===================================================== > Get Chassis Status Response > ===================================================== > [ 0h] = cmd[ 8b] > [ 80h] = comp_code[ 8b] > ipmi_cmd_get_chassis_status: bad completion code > [root@aaronacint2010 Aaron]# ipmi-locate > Probing KCS device using DMIDECODE... done > IPMI Version: 2.0 > IPMI locate driver: DMIDECODE > IPMI interface: KCS > BMC driver device: > BMC I/O base address: 0xCA2 > Register spacing: 1 > > Probing SMIC device using DMIDECODE... FAILED > > Probing BT device using DMIDECODE... FAILED > > Probing SSIF device using DMIDECODE... FAILED > > Probing KCS device using SMBIOS... done > IPMI Version: 2.0 > IPMI locate driver: SMBIOS > IPMI interface: KCS > BMC driver device: > BMC I/O base address: 0xCA2 > Register spacing: 1 > > Probing SMIC device using SMBIOS... FAILED > > Probing BT device using SMBIOS... FAILED > > Probing SSIF device using SMBIOS... FAILED > > Probing KCS device using ACPI... FAILED > > Probing SMIC device using ACPI... FAILED > > Probing BT device using ACPI... FAILED > > Probing SSIF device using ACPI... FAILED > > Probing KCS device using PCI... FAILED > > Probing SMIC device using PCI... FAILED > > Probing BT device using PCI... FAILED > > Probing SSIF device using PCI... FAILED > > KCS device default values: > IPMI Version: 1.5 > IPMI locate driver: DEFAULT > IPMI interface: KCS > BMC driver device: > BMC I/O base address: 0xCA2 > Register spacing: 1 > > SMIC device default values: > IPMI Version: 1.5 > IPMI locate driver: DEFAULT > IPMI interface: SMIC > BMC driver device: > BMC I/O base address: 0xCA9 > Register spacing: 1 > > BT device default values: > SSIF device default values: > IPMI Version: 1.5 > IPMI locate driver: DEFAULT > IPMI interface: SSIF > BMC driver device: /dev/i2c-0 > BMC SMBUS slave address: 0x42 > Register spacing: 1 > > [root@aaronacint2010 Aaron]# > > -----Original Message----- > From: Al Chu [mailto:chu11@llnl.gov] > Sent: Tuesday, November 09, 2010 1:50 PM > To: James Aaron Nix-Gomez > Cc: freeipmi-users@gnu.org > Subject: Re: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error > > Oops, forgot to cut and paste in what the payload was supposed to look > like: > > [ 20h] = cmd[ 8b] > [ 0h] = comp_code[ 8b] > [ 1h] = sdr_version_major[ 4b] > [ 5h] = sdr_version_minor[ 4b] > [ 20h] = record_count[16b] > [ FFFEh] = free_space[16b] > [ 0h] = most_recent_addition_timestamp[32b] > [ 0h] = most_recent_erase_timestamp[32b] > [ 0h] = > get_sdr_repository_allocation_info_command_supported[ 1b] > [ 1h] = reserve_sdr_repository_command_supported[ 1b] > [ 0h] = partial_add_sdr_command_supported[ 1b] > [ 0h] = delete_sdr_command_supported[ 1b] > [ 0h] = reserved[ 1b] > [ 2h] = > modal_non_modal_sdr_repository_update_operation_supported[ 2b] > [ 0h] = overflow_flag[ 1b] > > Al > > On Tue, 2010-11-09 at 10:21 -0800, Al Chu wrote: > > Hi James, > > > > Gut feeling is that something is wrong with the BMC on that motherboard. > > > > > [ 0h] = record_count[ 8b] > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > [ 1Ch] = record_count[16b] > > > > > > [ 80h] = free_space[ 8b] > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > In the first payload above, you receive a 0 record_count, which is of > > course very bad. Then in the below you get a (seemingly) correct > > record_count and an extra field. The full payload of this packet is > > supposed to be something like this: > > > > The fact that on 3-4 different rusn in your debug output, the SDR Info > > command is returning inconsistent data, incorrect data, and multiple > > sizes of payloads is very bad. > > > > Do any other FreeIPMI commands work, like bmc-info, ipmi-sel, etc. > > > > Al > > > > On Tue, 2010-11-09 at 07:00 -0800, James Aaron Nix-Gomez wrote: > > > Good morning, > > > > > > > > > > > > I am having problems using ipmi-sensors on a SuperMicro X8DTH-if. I > > > consistently get "ipmi_sdr_cache_create: internal IPMI error" messages. > I > > > have attached the debug information below. We have other SuperMicro > > > X8DTH-if machines, with slightly different RAID hardware, where > ipmi-sensors > > > works flawlessly. I would appreciate any advice on how to troubleshoot > this > > > issue. > > > > > > > > > > > > Best regards, > > > > > > > > > > > > Aaron Nix-Gomez > > > > > > Applied Physics Laboratory > > > > > > University of Washington > > > > > > aaron@apl.washington.edu > > > > > > > > > > > > > > > > > > [root@aaronacint2010 Aaron]# uname -a > > > > > > Linux aaronacint2010 2.6.32.12-115.fc12.x86_64 #1 SMP Fri Apr 30 > 19:46:25 > > > UTC 2010 x86_64 x86_64 x86_64 GNU/Linux > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --version > > > > > > ipmi-sensors - 0.7.12 > > > > > > Copyright (C) 2003-2008 FreeIPMI Core Team > > > > > > This program is free software; you may redistribute it under the terms > of > > > > > > the GNU General Public License. This program has absolutely no > warranty. > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > > > Caching SDR repository information: > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > ===================================================== > > > > > > Get SDR Repository Info Request > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > ===================================================== > > > > > > Get SDR Repository Info Response > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > [ 0h] = comp_code[ 8b] > > > > > > [ 1h] = sdr_version_major[ 4b] > > > > > > [ 5h] = sdr_version_minor[ 4b] > > > > > > [ 0h] = record_count[ 8b] > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > > > Caching SDR repository information: > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > ===================================================== > > > > > > Get SDR Repository Info Request > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > ===================================================== > > > > > > Get SDR Repository Info Response > > > > > > ===================================================== > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > > > Caching SDR repository information: > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > ===================================================== > > > > > > Get SDR Repository Info Request > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > ===================================================== > > > > > > Get SDR Repository Info Response > > > > > > ===================================================== > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > > > Caching SDR repository information: > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > ===================================================== > > > > > > Get SDR Repository Info Request > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > ===================================================== > > > > > > Get SDR Repository Info Response > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > [ 0h] = comp_code[ 8b] > > > > > > [ 1h] = sdr_version_major[ 4b] > > > > > > [ 5h] = sdr_version_minor[ 4b] > > > > > > [ 1Ch] = record_count[16b] > > > > > > [ 80h] = free_space[ 8b] > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > [root@aaronacint2010 Aaron]# > > > > > > _______________________________________________ > > > Freeipmi-users mailing list > > > Freeipmi-users@gnu.org > > > http://BLOCKEDBLOCKEDBLOCKEDlists.gnu.org/mailman/listinfo/freeipmi-users > > > -- Albert Chu chu11@llnl.gov Computer Scientist High Performance Systems Division Lawrence Livermore National Laboratory From MAILER-DAEMON Tue Nov 09 14:50:06 2010 Received: from mailman by lists.gnu.org with archive (Exim 4.43) id 1PFuCk-0005OK-QS for mharc-freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:50:06 -0500 Received: from [140.186.70.92] (port=36017 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PFuCh-0005NM-MJ for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:50:05 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PFuCc-0004FK-EL for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:50:03 -0500 Received: from snark.apl.washington.edu ([128.208.78.3]:41104) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PFuCc-0004F2-3c for freeipmi-users@gnu.org; Tue, 09 Nov 2010 14:49:58 -0500 Received: from snark.apl.washington.edu (localhost [127.0.0.1]) by localhost (Postfix) with ESMTP id E389D3A01A6; Tue, 9 Nov 2010 11:49:56 -0800 (PST) Received: from smtp.apl.washington.edu (smtp.apl.washington.edu [128.208.78.8]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by snark.apl.washington.edu (Postfix) with ESMTPS id D14303A01A0; Tue, 9 Nov 2010 11:49:56 -0800 (PST) Received: from NEWWORKSTATION (pool-96-255-71-169.washdc.fios.verizon.net [96.255.71.169]) (using TLSv1 with cipher RC4-MD5 (128/128 bits)) (No client certificate requested) by smtp.apl.washington.edu (Postfix) with ESMTP id 76A4FE3A3B; Tue, 9 Nov 2010 11:49:55 -0800 (PST) From: "James Aaron Nix-Gomez" To: "'Al Chu'" References: <026801cb801e$d137f270$73a7d750$@washington.edu> <1289326883.1979.21.camel@crazyclimber.llnl.gov> <1289328574.1979.23.camel@crazyclimber.llnl.gov> <02fe01cb8042$50031fd0$f0095f70$@washington.edu> <1289330777.1979.27.camel@crazyclimber.llnl.gov> In-Reply-To: Subject: RE: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error Date: Tue, 9 Nov 2010 14:49:54 -0500 Message-ID: <030a01cb8047$48734ba0$d959e2e0$@washington.edu> MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Office Outlook 12.0 Thread-Index: AcuAQ/tzGAAI4c/pQy66lskNOCb3YwAALbsQAACSS+A= Content-Language: en-us X-Antivirus: Scanned by F-Prot Antivirus (http://www.f-prot.com) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) Cc: freeipmi-users@gnu.org X-BeenThere: freeipmi-users@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GNU FreeIPMI User's Mailing List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 09 Nov 2010 19:50:06 -0000 Al, One final follow up. Freeipmi works great when I disable the lm_sensors service. Freeipmi also works if I enable the lm_sensors service and configure it to just check the Intel core temperatures. Thank you again for all of your help, Aaron -----Original Message----- From: James Aaron Nix-Gomez [mailto:aaron@apl.washington.edu] Sent: Tuesday, November 09, 2010 2:36 PM To: 'Al Chu' Cc: 'freeipmi-users@gnu.org' Subject: RE: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error Al, Good catch, the lm_sensors service was running. However, the ipmi module is missing from it. I thought I could use lm_sensors for Intel CPU core temperatures and freeipmi for the motherboard sensors. Aaron -----Original Message----- From: Al Chu [mailto:chu11@llnl.gov] Sent: Tuesday, November 09, 2010 2:26 PM To: James Aaron Nix-Gomez Cc: freeipmi-users@gnu.org Subject: RE: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error Hi James, Just as a quick double check, you may want to make sure that no other IPMI stuff is running on the system, possibly other daemons or something like that. Or perhaps the IPMI kernel driver is installed into a non-common path (it's usually /dev/ipmi0 if it's installed) and stuff going on there is conflicting. It's just a thought b/c of the driver-timeout errors you're seeing below. I'm still betting it's a HW problem though. Al On Tue, 2010-11-09 at 11:14 -0800, James Aaron Nix-Gomez wrote: > Al, > > Thank you for the prompt reply. I tried a few more of the commands, see > below, without much success. They too had varying sets of SDR info. > > We are able to run freeipmi on two other Supermicro X8DTH-iF motherboards > without any problems. If you believe there is a BMC issue on this one > motherboard then that is all I need to know. I have only just begun to look > into sensor monitoring so I was not sure if I had a configuration issue. > > Best, > > Aaron Nix-Gomez > > > [root@aaronacint2010 Aaron]# bmc-info --debug > ===================================================== > Get Device ID Request > ===================================================== > [ 1h] = cmd[ 8b] > ===================================================== > Get Device ID Response > ===================================================== > ipmi_cmd_get_device_id: driver timeout > [root@aaronacint2010 Aaron]# ipmi-sel --debug > Caching SDR repository information: > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > ===================================================== > Get SDR Repository Info Request > ===================================================== > [ 20h] = cmd[ 8b] > ===================================================== > Get SDR Repository Info Response > ===================================================== > [ 20h] = cmd[ 8b] > [ 0h] = comp_code[ 8b] > [ 0h] = sdr_version_major[ 4b] > [ 8h] = sdr_version_minor[ 4b] > ipmi_sdr_cache_create: internal IPMI error > [root@aaronacint2010 Aaron]# bmc-info --debug > ===================================================== > Get Device ID Request > ===================================================== > [ 1h] = cmd[ 8b] > ===================================================== > Get Device ID Response > ===================================================== > ipmi_cmd_get_device_id: driver timeout > [root@aaronacint2010 Aaron]# ipmi-chassis --get-status --debug > ===================================================== > Get Chassis Status Request > ===================================================== > [ 1h] = cmd[ 8b] > ===================================================== > Get Chassis Status Response > ===================================================== > ipmi_cmd_get_chassis_status: driver timeout > [root@aaronacint2010 Aaron]# ipmi-chassis --get-status --debug > ===================================================== > Get Chassis Status Request > ===================================================== > [ 1h] = cmd[ 8b] > ===================================================== > Get Chassis Status Response > ===================================================== > [ 0h] = cmd[ 8b] > [ 80h] = comp_code[ 8b] > ipmi_cmd_get_chassis_status: bad completion code > [root@aaronacint2010 Aaron]# ipmi-locate > Probing KCS device using DMIDECODE... done > IPMI Version: 2.0 > IPMI locate driver: DMIDECODE > IPMI interface: KCS > BMC driver device: > BMC I/O base address: 0xCA2 > Register spacing: 1 > > Probing SMIC device using DMIDECODE... FAILED > > Probing BT device using DMIDECODE... FAILED > > Probing SSIF device using DMIDECODE... FAILED > > Probing KCS device using SMBIOS... done > IPMI Version: 2.0 > IPMI locate driver: SMBIOS > IPMI interface: KCS > BMC driver device: > BMC I/O base address: 0xCA2 > Register spacing: 1 > > Probing SMIC device using SMBIOS... FAILED > > Probing BT device using SMBIOS... FAILED > > Probing SSIF device using SMBIOS... FAILED > > Probing KCS device using ACPI... FAILED > > Probing SMIC device using ACPI... FAILED > > Probing BT device using ACPI... FAILED > > Probing SSIF device using ACPI... FAILED > > Probing KCS device using PCI... FAILED > > Probing SMIC device using PCI... FAILED > > Probing BT device using PCI... FAILED > > Probing SSIF device using PCI... FAILED > > KCS device default values: > IPMI Version: 1.5 > IPMI locate driver: DEFAULT > IPMI interface: KCS > BMC driver device: > BMC I/O base address: 0xCA2 > Register spacing: 1 > > SMIC device default values: > IPMI Version: 1.5 > IPMI locate driver: DEFAULT > IPMI interface: SMIC > BMC driver device: > BMC I/O base address: 0xCA9 > Register spacing: 1 > > BT device default values: > SSIF device default values: > IPMI Version: 1.5 > IPMI locate driver: DEFAULT > IPMI interface: SSIF > BMC driver device: /dev/i2c-0 > BMC SMBUS slave address: 0x42 > Register spacing: 1 > > [root@aaronacint2010 Aaron]# > > -----Original Message----- > From: Al Chu [mailto:chu11@llnl.gov] > Sent: Tuesday, November 09, 2010 1:50 PM > To: James Aaron Nix-Gomez > Cc: freeipmi-users@gnu.org > Subject: Re: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error > > Oops, forgot to cut and paste in what the payload was supposed to look > like: > > [ 20h] = cmd[ 8b] > [ 0h] = comp_code[ 8b] > [ 1h] = sdr_version_major[ 4b] > [ 5h] = sdr_version_minor[ 4b] > [ 20h] = record_count[16b] > [ FFFEh] = free_space[16b] > [ 0h] = most_recent_addition_timestamp[32b] > [ 0h] = most_recent_erase_timestamp[32b] > [ 0h] = > get_sdr_repository_allocation_info_command_supported[ 1b] > [ 1h] = reserve_sdr_repository_command_supported[ 1b] > [ 0h] = partial_add_sdr_command_supported[ 1b] > [ 0h] = delete_sdr_command_supported[ 1b] > [ 0h] = reserved[ 1b] > [ 2h] = > modal_non_modal_sdr_repository_update_operation_supported[ 2b] > [ 0h] = overflow_flag[ 1b] > > Al > > On Tue, 2010-11-09 at 10:21 -0800, Al Chu wrote: > > Hi James, > > > > Gut feeling is that something is wrong with the BMC on that motherboard. > > > > > [ 0h] = record_count[ 8b] > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > [ 1Ch] = record_count[16b] > > > > > > [ 80h] = free_space[ 8b] > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > In the first payload above, you receive a 0 record_count, which is of > > course very bad. Then in the below you get a (seemingly) correct > > record_count and an extra field. The full payload of this packet is > > supposed to be something like this: > > > > The fact that on 3-4 different rusn in your debug output, the SDR Info > > command is returning inconsistent data, incorrect data, and multiple > > sizes of payloads is very bad. > > > > Do any other FreeIPMI commands work, like bmc-info, ipmi-sel, etc. > > > > Al > > > > On Tue, 2010-11-09 at 07:00 -0800, James Aaron Nix-Gomez wrote: > > > Good morning, > > > > > > > > > > > > I am having problems using ipmi-sensors on a SuperMicro X8DTH-if. I > > > consistently get "ipmi_sdr_cache_create: internal IPMI error" messages. > I > > > have attached the debug information below. We have other SuperMicro > > > X8DTH-if machines, with slightly different RAID hardware, where > ipmi-sensors > > > works flawlessly. I would appreciate any advice on how to troubleshoot > this > > > issue. > > > > > > > > > > > > Best regards, > > > > > > > > > > > > Aaron Nix-Gomez > > > > > > Applied Physics Laboratory > > > > > > University of Washington > > > > > > aaron@apl.washington.edu > > > > > > > > > > > > > > > > > > [root@aaronacint2010 Aaron]# uname -a > > > > > > Linux aaronacint2010 2.6.32.12-115.fc12.x86_64 #1 SMP Fri Apr 30 > 19:46:25 > > > UTC 2010 x86_64 x86_64 x86_64 GNU/Linux > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --version > > > > > > ipmi-sensors - 0.7.12 > > > > > > Copyright (C) 2003-2008 FreeIPMI Core Team > > > > > > This program is free software; you may redistribute it under the terms > of > > > > > > the GNU General Public License. This program has absolutely no > warranty. > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > > > Caching SDR repository information: > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > ===================================================== > > > > > > Get SDR Repository Info Request > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > ===================================================== > > > > > > Get SDR Repository Info Response > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > [ 0h] = comp_code[ 8b] > > > > > > [ 1h] = sdr_version_major[ 4b] > > > > > > [ 5h] = sdr_version_minor[ 4b] > > > > > > [ 0h] = record_count[ 8b] > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > > > Caching SDR repository information: > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > ===================================================== > > > > > > Get SDR Repository Info Request > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > ===================================================== > > > > > > Get SDR Repository Info Response > > > > > > ===================================================== > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > > > Caching SDR repository information: > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > ===================================================== > > > > > > Get SDR Repository Info Request > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > ===================================================== > > > > > > Get SDR Repository Info Response > > > > > > ===================================================== > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > > > Caching SDR repository information: > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > ===================================================== > > > > > > Get SDR Repository Info Request > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > ===================================================== > > > > > > Get SDR Repository Info Response > > > > > > ===================================================== > > > > > > [ 20h] = cmd[ 8b] > > > > > > [ 0h] = comp_code[ 8b] > > > > > > [ 1h] = sdr_version_major[ 4b] > > > > > > [ 5h] = sdr_version_minor[ 4b] > > > > > > [ 1Ch] = record_count[16b] > > > > > > [ 80h] = free_space[ 8b] > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > [root@aaronacint2010 Aaron]# > > > > > > _______________________________________________ > > > Freeipmi-users mailing list > > > Freeipmi-users@gnu.org > > > http://BLOCKEDBLOCKEDBLOCKEDlists.gnu.org/mailman/listinfo/freeipmi-users > > > -- Albert Chu chu11@llnl.gov Computer Scientist High Performance Systems Division Lawrence Livermore National Laboratory From MAILER-DAEMON Tue Nov 09 16:26:43 2010 Received: from mailman by lists.gnu.org with archive (Exim 4.43) id 1PFviF-00078J-P2 for mharc-freeipmi-users@gnu.org; Tue, 09 Nov 2010 16:26:43 -0500 Received: from [140.186.70.92] (port=36988 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PFviC-00076y-P3 for freeipmi-users@gnu.org; Tue, 09 Nov 2010 16:26:42 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PFviA-00037K-Nd for freeipmi-users@gnu.org; Tue, 09 Nov 2010 16:26:40 -0500 Received: from nspiron-3.llnl.gov ([128.115.41.83]:58966 helo=smtp.llnl.gov) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PFviA-00036s-9w for freeipmi-users@gnu.org; Tue, 09 Nov 2010 16:26:38 -0500 X-Attachments: None Received: from crazyclimber.llnl.gov (HELO [134.9.93.113]) ([134.9.93.113]) by smtp.llnl.gov with ESMTP; 09 Nov 2010 13:26:37 -0800 Subject: RE: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error From: Al Chu To: James Aaron Nix-Gomez In-Reply-To: <030a01cb8047$48734ba0$d959e2e0$@washington.edu> References: <026801cb801e$d137f270$73a7d750$@washington.edu> <1289326883.1979.21.camel@crazyclimber.llnl.gov> <1289328574.1979.23.camel@crazyclimber.llnl.gov> <02fe01cb8042$50031fd0$f0095f70$@washington.edu> <1289330777.1979.27.camel@crazyclimber.llnl.gov> <030a01cb8047$48734ba0$d959e2e0$@washington.edu> Content-Type: text/plain Date: Tue, 09 Nov 2010 13:26:36 -0800 Message-Id: <1289337996.1979.29.camel@crazyclimber.llnl.gov> Mime-Version: 1.0 X-Mailer: Evolution 2.12.3 (2.12.3-19.el5) Content-Transfer-Encoding: 7bit X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. Cc: "freeipmi-users@gnu.org" X-BeenThere: freeipmi-users@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GNU FreeIPMI User's Mailing List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 09 Nov 2010 21:26:42 -0000 I'm not sure how lm_sensors works under the covers. If they use the /dev/ipmi0 driver, then FreeIPMI & lm_sensors should work together. But I'm guessing lm_sensors does something else and it's mucking up something on the motherboard. Al On Tue, 2010-11-09 at 11:49 -0800, James Aaron Nix-Gomez wrote: > Al, > > One final follow up. Freeipmi works great when I disable the lm_sensors > service. Freeipmi also works if I enable the lm_sensors service and > configure it to just check the Intel core temperatures. > > Thank you again for all of your help, > > Aaron > > -----Original Message----- > From: James Aaron Nix-Gomez [mailto:aaron@apl.washington.edu] > Sent: Tuesday, November 09, 2010 2:36 PM > To: 'Al Chu' > Cc: 'freeipmi-users@gnu.org' > Subject: RE: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error > > Al, > > Good catch, the lm_sensors service was running. However, the ipmi module is > missing from it. I thought I could use lm_sensors for Intel CPU core > temperatures and freeipmi for the motherboard sensors. > > Aaron > > -----Original Message----- > From: Al Chu [mailto:chu11@llnl.gov] > Sent: Tuesday, November 09, 2010 2:26 PM > To: James Aaron Nix-Gomez > Cc: freeipmi-users@gnu.org > Subject: RE: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error > > Hi James, > > Just as a quick double check, you may want to make sure that no other > IPMI stuff is running on the system, possibly other daemons or something > like that. Or perhaps the IPMI kernel driver is installed into a > non-common path (it's usually /dev/ipmi0 if it's installed) and stuff > going on there is conflicting. It's just a thought b/c of the > driver-timeout errors you're seeing below. I'm still betting it's a HW > problem though. > > Al > > On Tue, 2010-11-09 at 11:14 -0800, James Aaron Nix-Gomez wrote: > > Al, > > > > Thank you for the prompt reply. I tried a few more of the commands, see > > below, without much success. They too had varying sets of SDR info. > > > > We are able to run freeipmi on two other Supermicro X8DTH-iF motherboards > > without any problems. If you believe there is a BMC issue on this one > > motherboard then that is all I need to know. I have only just begun to > look > > into sensor monitoring so I was not sure if I had a configuration issue. > > > > Best, > > > > Aaron Nix-Gomez > > > > > > [root@aaronacint2010 Aaron]# bmc-info --debug > > ===================================================== > > Get Device ID Request > > ===================================================== > > [ 1h] = cmd[ 8b] > > ===================================================== > > Get Device ID Response > > ===================================================== > > ipmi_cmd_get_device_id: driver timeout > > [root@aaronacint2010 Aaron]# ipmi-sel --debug > > Caching SDR repository information: > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > ===================================================== > > Get SDR Repository Info Request > > ===================================================== > > [ 20h] = cmd[ 8b] > > ===================================================== > > Get SDR Repository Info Response > > ===================================================== > > [ 20h] = cmd[ 8b] > > [ 0h] = comp_code[ 8b] > > [ 0h] = sdr_version_major[ 4b] > > [ 8h] = sdr_version_minor[ 4b] > > ipmi_sdr_cache_create: internal IPMI error > > [root@aaronacint2010 Aaron]# bmc-info --debug > > ===================================================== > > Get Device ID Request > > ===================================================== > > [ 1h] = cmd[ 8b] > > ===================================================== > > Get Device ID Response > > ===================================================== > > ipmi_cmd_get_device_id: driver timeout > > [root@aaronacint2010 Aaron]# ipmi-chassis --get-status --debug > > ===================================================== > > Get Chassis Status Request > > ===================================================== > > [ 1h] = cmd[ 8b] > > ===================================================== > > Get Chassis Status Response > > ===================================================== > > ipmi_cmd_get_chassis_status: driver timeout > > [root@aaronacint2010 Aaron]# ipmi-chassis --get-status --debug > > ===================================================== > > Get Chassis Status Request > > ===================================================== > > [ 1h] = cmd[ 8b] > > ===================================================== > > Get Chassis Status Response > > ===================================================== > > [ 0h] = cmd[ 8b] > > [ 80h] = comp_code[ 8b] > > ipmi_cmd_get_chassis_status: bad completion code > > [root@aaronacint2010 Aaron]# ipmi-locate > > Probing KCS device using DMIDECODE... done > > IPMI Version: 2.0 > > IPMI locate driver: DMIDECODE > > IPMI interface: KCS > > BMC driver device: > > BMC I/O base address: 0xCA2 > > Register spacing: 1 > > > > Probing SMIC device using DMIDECODE... FAILED > > > > Probing BT device using DMIDECODE... FAILED > > > > Probing SSIF device using DMIDECODE... FAILED > > > > Probing KCS device using SMBIOS... done > > IPMI Version: 2.0 > > IPMI locate driver: SMBIOS > > IPMI interface: KCS > > BMC driver device: > > BMC I/O base address: 0xCA2 > > Register spacing: 1 > > > > Probing SMIC device using SMBIOS... FAILED > > > > Probing BT device using SMBIOS... FAILED > > > > Probing SSIF device using SMBIOS... FAILED > > > > Probing KCS device using ACPI... FAILED > > > > Probing SMIC device using ACPI... FAILED > > > > Probing BT device using ACPI... FAILED > > > > Probing SSIF device using ACPI... FAILED > > > > Probing KCS device using PCI... FAILED > > > > Probing SMIC device using PCI... FAILED > > > > Probing BT device using PCI... FAILED > > > > Probing SSIF device using PCI... FAILED > > > > KCS device default values: > > IPMI Version: 1.5 > > IPMI locate driver: DEFAULT > > IPMI interface: KCS > > BMC driver device: > > BMC I/O base address: 0xCA2 > > Register spacing: 1 > > > > SMIC device default values: > > IPMI Version: 1.5 > > IPMI locate driver: DEFAULT > > IPMI interface: SMIC > > BMC driver device: > > BMC I/O base address: 0xCA9 > > Register spacing: 1 > > > > BT device default values: > > SSIF device default values: > > IPMI Version: 1.5 > > IPMI locate driver: DEFAULT > > IPMI interface: SSIF > > BMC driver device: /dev/i2c-0 > > BMC SMBUS slave address: 0x42 > > Register spacing: 1 > > > > [root@aaronacint2010 Aaron]# > > > > -----Original Message----- > > From: Al Chu [mailto:chu11@llnl.gov] > > Sent: Tuesday, November 09, 2010 1:50 PM > > To: James Aaron Nix-Gomez > > Cc: freeipmi-users@gnu.org > > Subject: Re: [Freeipmi-users] ipmi_sdr_cache_create: internal IPMI error > > > > Oops, forgot to cut and paste in what the payload was supposed to look > > like: > > > > [ 20h] = cmd[ 8b] > > [ 0h] = comp_code[ 8b] > > [ 1h] = sdr_version_major[ 4b] > > [ 5h] = sdr_version_minor[ 4b] > > [ 20h] = record_count[16b] > > [ FFFEh] = free_space[16b] > > [ 0h] = most_recent_addition_timestamp[32b] > > [ 0h] = most_recent_erase_timestamp[32b] > > [ 0h] = > > get_sdr_repository_allocation_info_command_supported[ 1b] > > [ 1h] = reserve_sdr_repository_command_supported[ 1b] > > [ 0h] = partial_add_sdr_command_supported[ 1b] > > [ 0h] = delete_sdr_command_supported[ 1b] > > [ 0h] = reserved[ 1b] > > [ 2h] = > > modal_non_modal_sdr_repository_update_operation_supported[ 2b] > > [ 0h] = overflow_flag[ 1b] > > > > Al > > > > On Tue, 2010-11-09 at 10:21 -0800, Al Chu wrote: > > > Hi James, > > > > > > Gut feeling is that something is wrong with the BMC on that motherboard. > > > > > > > [ 0h] = record_count[ 8b] > > > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > > [ 1Ch] = record_count[16b] > > > > > > > > [ 80h] = free_space[ 8b] > > > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > In the first payload above, you receive a 0 record_count, which is of > > > course very bad. Then in the below you get a (seemingly) correct > > > record_count and an extra field. The full payload of this packet is > > > supposed to be something like this: > > > > > > The fact that on 3-4 different rusn in your debug output, the SDR Info > > > command is returning inconsistent data, incorrect data, and multiple > > > sizes of payloads is very bad. > > > > > > Do any other FreeIPMI commands work, like bmc-info, ipmi-sel, etc. > > > > > > Al > > > > > > On Tue, 2010-11-09 at 07:00 -0800, James Aaron Nix-Gomez wrote: > > > > Good morning, > > > > > > > > > > > > > > > > I am having problems using ipmi-sensors on a SuperMicro X8DTH-if. I > > > > consistently get "ipmi_sdr_cache_create: internal IPMI error" > messages. > > I > > > > have attached the debug information below. We have other SuperMicro > > > > X8DTH-if machines, with slightly different RAID hardware, where > > ipmi-sensors > > > > works flawlessly. I would appreciate any advice on how to > troubleshoot > > this > > > > issue. > > > > > > > > > > > > > > > > Best regards, > > > > > > > > > > > > > > > > Aaron Nix-Gomez > > > > > > > > Applied Physics Laboratory > > > > > > > > University of Washington > > > > > > > > aaron@apl.washington.edu > > > > > > > > > > > > > > > > > > > > > > > > [root@aaronacint2010 Aaron]# uname -a > > > > > > > > Linux aaronacint2010 2.6.32.12-115.fc12.x86_64 #1 SMP Fri Apr 30 > > 19:46:25 > > > > UTC 2010 x86_64 x86_64 x86_64 GNU/Linux > > > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --version > > > > > > > > ipmi-sensors - 0.7.12 > > > > > > > > Copyright (C) 2003-2008 FreeIPMI Core Team > > > > > > > > This program is free software; you may redistribute it under the terms > > of > > > > > > > > the GNU General Public License. This program has absolutely no > > warranty. > > > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > > > > > Caching SDR repository information: > > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > > > ===================================================== > > > > > > > > Get SDR Repository Info Request > > > > > > > > ===================================================== > > > > > > > > [ 20h] = cmd[ 8b] > > > > > > > > ===================================================== > > > > > > > > Get SDR Repository Info Response > > > > > > > > ===================================================== > > > > > > > > [ 20h] = cmd[ 8b] > > > > > > > > [ 0h] = comp_code[ 8b] > > > > > > > > [ 1h] = sdr_version_major[ 4b] > > > > > > > > [ 5h] = sdr_version_minor[ 4b] > > > > > > > > [ 0h] = record_count[ 8b] > > > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug > > > > > > > > Caching SDR repository information: > > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > > > ===================================================== > > > > > > > > Get SDR Repository Info Request > > > > > > > > ===================================================== > > > > > > > > [ 20h] = cmd[ 8b] > > > > > > > > ===================================================== > > > > > > > > Get SDR Repository Info Response > > > > > > > > ===================================================== > > > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > > > > > Caching SDR repository information: > > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > > > ===================================================== > > > > > > > > Get SDR Repository Info Request > > > > > > > > ===================================================== > > > > > > > > [ 20h] = cmd[ 8b] > > > > > > > > ===================================================== > > > > > > > > Get SDR Repository Info Response > > > > > > > > ===================================================== > > > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > > > [root@aaronacint2010 Aaron]# ipmi-sensors --debug --sdr-cache-recreate > > > > > > > > Caching SDR repository information: > > > > /root/.freeipmi/sdr-cache/sdr-cache-aaronacint2010.localhost > > > > > > > > ===================================================== > > > > > > > > Get SDR Repository Info Request > > > > > > > > ===================================================== > > > > > > > > [ 20h] = cmd[ 8b] > > > > > > > > ===================================================== > > > > > > > > Get SDR Repository Info Response > > > > > > > > ===================================================== > > > > > > > > [ 20h] = cmd[ 8b] > > > > > > > > [ 0h] = comp_code[ 8b] > > > > > > > > [ 1h] = sdr_version_major[ 4b] > > > > > > > > [ 5h] = sdr_version_minor[ 4b] > > > > > > > > [ 1Ch] = record_count[16b] > > > > > > > > [ 80h] = free_space[ 8b] > > > > > > > > ipmi_sdr_cache_create: internal IPMI error > > > > > > > > [root@aaronacint2010 Aaron]# > > > > > > > > _______________________________________________ > > > > Freeipmi-users mailing list > > > > Freeipmi-users@gnu.org > > > > > http://BLOCKEDBLOCKEDBLOCKEDBLOCKEDlists.gnu.org/mailman/listinfo/freeipmi-users > > > > > -- > Albert Chu > chu11@llnl.gov > Computer Scientist > High Performance Systems Division > Lawrence Livermore National Laboratory > > -- Albert Chu chu11@llnl.gov Computer Scientist High Performance Systems Division Lawrence Livermore National Laboratory From MAILER-DAEMON Mon Nov 22 12:36:29 2010 Received: from mailman by lists.gnu.org with archive (Exim 4.43) id 1PKaJZ-0002GV-J0 for mharc-freeipmi-users@gnu.org; Mon, 22 Nov 2010 12:36:29 -0500 Received: from [140.186.70.92] (port=59494 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PKaJX-0002F4-6G for freeipmi-users@gnu.org; Mon, 22 Nov 2010 12:36:28 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PKaJV-0005h1-Mq for freeipmi-users@gnu.org; Mon, 22 Nov 2010 12:36:27 -0500 Received: from nspiron-2.llnl.gov ([128.115.41.82]:38003) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PKaJV-0005gS-G8 for freeipmi-users@gnu.org; Mon, 22 Nov 2010 12:36:25 -0500 X-Attachments: None Received: from crazyclimber.llnl.gov (HELO [134.9.93.113]) ([134.9.93.113]) by nspiron-2.llnl.gov with ESMTP; 22 Nov 2010 09:36:22 -0800 From: Al Chu To: Xiaomin Dong In-Reply-To: <4CE75D1F.5030405@sac.com> References: <4CE75D1F.5030405@sac.com> Content-Type: text/plain Date: Mon, 22 Nov 2010 09:36:22 -0800 Message-Id: <1290447382.6900.83.camel@crazyclimber.llnl.gov> Mime-Version: 1.0 X-Mailer: Evolution 2.12.3 (2.12.3-19.el5) Content-Transfer-Encoding: 7bit X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. Cc: freeipmi-users@gnu.org Subject: [Freeipmi-users] Re: Question about nagios_ipmimonitoring.sh X-BeenThere: freeipmi-users@gnu.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GNU FreeIPMI User's Mailing List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 22 Nov 2010 17:36:28 -0000 Hi Xiaomin, For in-band communication on the motherboard root is typically required for access, but it isn't required for out of band access. So I'd recommend doing out-of-band access. Out of band is also better b/c it won't interrupt the CPU like in-band does. If you really want to do in-band communication as non-root, my recommendation would be to load the IPMI kernel driver (/dev/ipmi0) and use that as the inband device instead of the default FreeIPMI driver (-D openipmi on the command line or edit /etc/freeipmi.conf appropriately). You can change the /dev/ipmi0 permissions to whatever you want, thus allowing non-root access. You will need to compile FreeIPMI with --with-dont-check-for-root too so it won't check for root. Hope that helps, Al On Fri, 2010-11-19 at 21:31 -0800, Xiaomin Dong wrote: > Hi Albert, > > I'd like to use your nagios_ipmimonitoring.sh in my nagios monitoring. > But it seems the no-root users can not run /usr/sbin/ipmimonitoring. > Can you please advise how you solve this problem? > > Thanks for help. > Xiaomin Dong -- Albert Chu chu11@llnl.gov Computer Scientist High Performance Systems Division Lawrence Livermore National Laboratory