[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Freeipmi-users] ipmi-sensors missing data on HP Gen7/8 servers
From: |
Albert Chu |
Subject: |
Re: [Freeipmi-users] ipmi-sensors missing data on HP Gen7/8 servers |
Date: |
Fri, 12 Apr 2013 10:25:26 -0700 |
Hey Stephen,
That's exactly what I need. I'll get it into a FreeIPMI branch so you
can try it out before I release.
Al
On Fri, 2013-04-12 at 09:57 -0700, Stephen Abbene wrote:
> Al
>
> HP finally got back to me with the information about the error lights. I
> have included the email below. Let me know if there is any other information
> that you need.
>
> -----Original Message-----
>
> Stephen,
>
> You are certainly welcome.
>
> I just now received the data you were asking for. As I previously wrote, the
> UID and HEALTH LED information in the SDR is not currently getting updated.
> The fix requires changes to iLO 4 firmware and SDR tables (ROM update). These
> changes will be included in the next iLO4 release 1.30 (ETA Sept 2013).
>
> The "UID Light" has a sensor type OEM LED (0xC0) and EventReadingTypeCode =
> UID: (0x70) 0x0001 = On. 0x0002 = Off. 0x0004 = Blinking.
>
> The "Sys. Health LED" has a sensor type OEM LED (0xC0) and
> EventReadingTypeCode = HealthLED: (0x71) 0x0001 = Green. 0x0002 = Amber.
> 0x0004 = Red.
>
> I hope that meets your needs and answers your questions.
>
>
> -----Original Message-----
> From: Albert Chu [mailto:address@hidden
> Sent: Friday, March 15, 2013 10:16 AM
> To: Stephen Abbene
> Cc: address@hidden
> Subject: RE: [Freeipmi-users] ipmi-sensors missing data on HP Gen7/8 servers
>
> Cool. It is definitely reverse engineer-able, but usually requires some
> vendor provided software to figure out what they think the magic is and doing
> some tricks to try and get the sensor to do what you want it to do. But
> there are gotchas along the way. The best bet is to just get the real info
> from the source.
>
> Al
>
> On Fri, 2013-03-15 at 10:01 -0700, Stephen Abbene wrote:
> > Thanks Albert,
> >
> > I will get a hold of my contacts at HP and see if they can get me that
> > information. I also have a few HP Gen7/Gen8 servers set aside for testing
> > and I could try to recreate the different error light states if you think
> > that would be helpful.
> >
> > -----Original Message-----
> > From: Albert Chu [mailto:address@hidden
> > Sent: Friday, March 15, 2013 9:57 AM
> > To: Stephen Abbene
> > Cc: address@hidden
> > Subject: RE: [Freeipmi-users] ipmi-sensors missing data on HP Gen7/8
> > servers
> >
> > Hey Stephen,
> >
> > Those are OEM specific sensors. While I support a number of OEM
> > sensors in FreeIPMI, I don't yet support these from HP. I've asked HP
> > several times for the "magic" to interpret those sensors, but they
> > have been unable or unwilling to provide me the magic.
> >
> > I see that you work at Nvidia. If HP is a partner of yours, perhaps
> > you might have some leverage to get the "magic" out of them?
> > Basically, I need the sensor event table so that I know
> > (hypothetically) 00h = "ok", 01h = "led on", 02h = "led off", 04h =
> > "blinking", or whatever it may be.
> >
> > Al
> >
> > On Thu, 2013-03-14 at 18:21 -0700, Stephen Abbene wrote:
> > > Thanks for replying so Quickly Albert. `ipmi-sensors -W
> > > discretereading` seems to have done the trick thank you. Any tips
> > > on getting the "System Chassis 1 UID Light" and "System Chassis 2
> > > Health LED" fields to populate with meaningful data?
> > >
> > > I have included the output of `ipmi-sensors -W discretereading --debug`
> > > below:
> > >
> > >
> > > -----Original Message-----
> > > From: Albert Chu [mailto:address@hidden
> > > Sent: Thursday, March 14, 2013 5:35 PM
> > > To: Stephen Abbene
> > > Cc: address@hidden
> > > Subject: Re: [Freeipmi-users] ipmi-sensors missing data on HP Gen7/8
> > servers
> > >
> > > Hi Stephen,
> > >
> > > Could you please try the "discretereading" workaround (i.e. -W
> > discretereading). There's a description of the issue in the manpage
> > about the issue and it's only been seen on HP systems. I believe HP
> > has acknowledged the issue, but due to legacy reasons I don't believe
> > they want to change it.
> > >
> > > If that doesn't work, please send the --debug output.
> > >
> > > Al
> > >
> > > On Thu, 2013-03-14 at 16:55 -0700, Stephen Abbene wrote:
> > > > Hello,
> > > >
> > > >
> > > >
> > > > Today I installed freeipmi 1.2.5 on several HP gen7 and Gen8
> > servers and have noticed that all the systems are missing the RPM from
> > the fans and the power meter information. Is there a way I can obtain
> > this information? I can provide the output from --debug or any other
> > information you may need.
> > > >
> > > >
> > > >
> > > > Here are some of the specs of one of the systems I am testing, I
> > can provide more information if it is required.
> > > >
> > > > . HP Proliant DL160 G8
> > > >
> > > > . CentOS 5.7
> > > >
> > > > . 2x Intel(R) Xeon(R) CPU E5-2680 0 @ 2.70GHz
> > > >
> > > > . 1x Power Supply (detected)
> > > >
> > > > . 8x fans (detected)
> > > >
> > > >
> > > >
> > > > I have included the output of ` ipmi-sensors
> > --entity-sensor-names` and `bmc-info` from a HP dl160 Gen8 below.
> > > >
> > > >
> > > >
> > > > # ipmi-sensors --entity-sensor-names
> > > >
> > > > ID | Name | Type
> > | Reading | Units | Event
> > > >
> > > > 0 | System Chassis 1 UID Light | OEM Reserved | N/A
> > | N/A | 'OEM Event = 0000h'
> > > >
> > > > 1 | System Chassis 2 Health LED | OEM Reserved | N/A |
> > N/A | 'OEM Event = 0000h'
> > > >
> > > > 2 | Power Supply 1 Power Supply 1 | Power Supply | N/A
> > | N/A | 'Presence detected'
> > > >
> > > > 3 | System Board 1 Fan 1 | Fan
> > | N/A | N/A | N/A
> > > >
> > > > 4 | System Board 2 Fan 2 | Fan
> > | N/A | N/A | 'transition to Running'
> > > >
> > > > 5 | System Board 3 Fan 3 | Fan
> > | N/A | N/A | 'transition to Running'
> > > >
> > > > 6 | System Board 4 Fan 4 | Fan
> > | N/A | N/A | 'transition to Running'
> > > >
> > > > 7 | System Board 5 Fan 5 | Fan
> > | N/A | N/A | 'transition to Running'
> > > >
> > > > 8 | System Board 6 Fan 6 | Fan
> > | N/A | N/A | 'transition to Running'
> > > >
> > > > 9 | System Board 7 Fan 7 | Fan
> > | N/A | N/A | 'transition to Running'
> > > >
> > > > 10 | System Board 8 Fan 8 | Fan
> > | N/A | N/A | 'transition to Running'
> > > >
> > > > 11 | System Board 9 Fans | Fan
> > | N/A | N/A | 'Fully Redundant'
> > > >
> > > > 13 | Air Inlet 01-Inlet Ambient | Temperature
> > | 25.00 | C | 'OK'
> > > >
> > > > 14 | Processor 1 02-CPU 1 | Temperature
> > | 40.00 | C | 'OK'
> > > >
> > > > 15 | Processor 2 03-CPU 2 | Temperature
> > | 40.00 | C | 'OK'
> > > >
> > > > 16 | Memory Device 1 04-P1 DIMM 1-6 | Temperature
> > | 38.00 | C | 'OK'
> > > >
> > > > 17 | Memory Device 2 05-P1 DIMM 7-12 | Temperature
> > | 39.00 | C | 'OK'
> > > >
> > > > 18 | Memory Device 3 06-P2 DIMM 1-6 | Temperature
> > | 34.00 | C | 'OK'
> > > >
> > > > 19 | Memory Device 4 07-P2 DIMM 7-12 | Temperature
> > | 29.00 | C | 'OK'
> > > >
> > > > 20 | Memory Device 5 08-P1 Mem Zone | Temperature
> > | 38.00 | C | 'OK'
> > > >
> > > > 21 | Memory Device 6 09-P1 Mem Zone | Temperature
> > | 38.00 | C | 'OK'
> > > >
> > > > 22 | Memory Device 7 10-P1 Mem Zone | Temperature
> > | 39.00 | C | 'OK'
> > > >
> > > > 23 | Memory Device 8 11-P1 Mem Zone | Temperature
> > | 42.00 | C | 'OK'
> > > >
> > > > 24 | Memory Device 9 12-P1 Mem Zone | Temperature
> > | 40.00 | C | 'OK'
> > > >
> > > > 25 | Memory Device 10 13-P1 Mem Zone | Temperature
> > | 39.00 | C | 'OK'
> > > >
> > > > 26 | Memory Device 11 14-P2 Mem Zone | Temperature
> > | 36.00 | C | 'OK'
> > > >
> > > > 27 | Memory Device 12 15-P2 Mem Zone | Temperature
> > | 35.00 | C | 'OK'
> > > >
> > > > 28 | Memory Device 13 16-P2 Mem Zone | Temperature
> > | 34.00 | C | 'OK'
> > > >
> > > > 29 | Memory Device 14 17-P2 Mem Zone | Temperature
> > | 33.00 | C | 'OK'
> > > >
> > > > 30 | Memory Device 15 18-P2 Mem Zone | Temperature
> > | 31.00 | C | 'OK'
> > > >
> > > > 31 | Memory Device 16 19-P2 Mem Zone | Temperature
> > | 30.00 | C | 'OK'
> > > >
> > > > 32 | Disk 20-HD Max
> > | Temperature | N/A | C | N/A
> > > >
> > > > 33 | System Board 1 21-Chipset | Temperature
> > | 46.00 | C | 'OK'
> > > >
> > > > 34 | Power Supply 2 22-P/S |
> > Temperature | 32.00 | C | 'OK'
> > > >
> > > > 35 | Power Unit 1 23-VR P1 | Temperature
> > | 43.00 | C | 'OK'
> > > >
> > > > 36 | Power Unit 2 24-VR P2 | Temperature
> > | 28.00 | C | 'OK'
> > > >
> > > > 37 | Power Unit 3 25-VR P1 Zone | Temperature
> > | 45.00 | C | 'OK'
> > > >
> > > > 38 | Power Unit 4 26-VR P1 Mem | Temperature
> > | 40.00 | C | 'OK'
> > > >
> > > > 39 | Power Unit 5 27-VR P1 Mem | Temperature
> > | 38.00 | C | 'OK'
> > > >
> > > > 40 | Power Unit 6 28-VR P2 Mem
> > | Temperature | 30.00 | C | 'OK'
> > > >
> > > > 41 | Power Unit 7 29-VR P2 Mem
> > | Temperature | 28.00 | C | 'OK'
> > > >
> > > > 42 | Battery 30-Supercap Max
> > | Temperature | N/A | C | N/A
> > > >
> > > > 43 | System Management Module 31-iLO Zone | Temperature
> > | 35.00 | C | 'OK'
> > > >
> > > > 44 | System Board 2 32-LOM
> > | Temperature | N/A | C | N/A
> > > >
> > > > 45 | Add-in Card 1 33-PCI 1 | Temperature
> > | N/A | C | N/A
> > > >
> > > > 46 | Add-in Card 2 34-PCI 2 | Temperature
> > | N/A | C | N/A
> > > >
> > > > 47 | System Internal Expansion Board 1 35-PCI 1 Zone | Temperature
> > | 33.00 | C | 'OK'
> > > >
> > > > 48 | System Internal Expansion Board 2 36-PCI 2 Zone | Temperature
> > | 33.00 | C | 'OK'
> > > >
> > > > 49 | Add-in Card 3 37-LOM Card | Temperature
> > | N/A | C | N/A
> > > >
> > > > 50 | System Board 3 38-System Board | Temperature
> > | 26.00 | C | 'OK'
> > > >
> > > > 51 | Back Panel Board 39-Sys Exhaust | Temperature
> > | 40.00 | C | 'OK'
> > > >
> > > > 52 | System Board 10 Power Meter | Current
> > | N/A | N/A | 'Device Enabled'
> > > >
> > > > 53 | System Board 11 Memory | Memory
> > | N/A | N/A | 'Presence detected'
> > > >
> > > >
> > > > # bmc-info
> > > >
> > > > Device ID : 19
> > > >
> > > > Device Revision : 1
> > > >
> > > > Device SDRs : unsupported
> > > >
> > > > Firmware Revision : 1.10
> > > >
> > > > Device Available : yes (normal operation)
> > > >
> > > > IPMI Version : 2.0
> > > >
> > > > Sensor Device : supported
> > > >
> > > > SDR Repository Device : supported
> > > >
> > > > SEL Device : supported
> > > >
> > > > FRU Inventory Device : supported
> > > >
> > > > IPMB Event Receiver : unsupported
> > > >
> > > > IPMB Event Generator : unsupported
> > > >
> > > > Bridge : unsupported
> > > >
> > > > Chassis Device : supported
> > > >
> > > > Manufacturer ID : Hewlett-Packard (11)
> > > >
> > > > Product ID : 8192
> > > >
> > > >
> > > >
> > > > Channel Information
> > > >
> > > >
> > > >
> > > > Channel Number : 0
> > > >
> > > > Medium Type : IPMB (I2C)
> > > >
> > > > Protocol Type : IPMB-1.0
> > > >
> > > > Active Session Count : 0
> > > >
> > > > Session Support : session-less
> > > >
> > > > Vendor ID : Intelligent Platform Management Interface
> > forum (7154)
> > > >
> > > >
> > > >
> > > > Channel Number : 2
> > > >
> > > > Medium Type : 802.3 LAN
> > > >
> > > > Protocol Type : IPMB-1.0
> > > >
> > > > Active Session Count : 0
> > > >
> > > > Session Support : multi-session
> > > >
> > > > Vendor ID : Intelligent Platform Management Interface
> > forum (7154)
> > > >
> > > >
> > > >
> > > > Channel Number : 7
> > > >
> > > > Medium Type : OEM
> > > >
> > > > Protocol Type : KCS
> > > >
> > > > Active Session Count : 0
> > > >
> > > > Session Support : session-less
> > > >
> > > > Vendor ID : Intelligent Platform Management Interface
> > forum (7154)
> > > >
> > > > Thanks,
> > > > --Stephen Abbene
> > > >
> > > > _______________________________________________
> > > > Freeipmi-users mailing list
> > > > address@hidden
> > > > https://lists.gnu.org/mailman/listinfo/freeipmi-users
> > > --
> > > Albert Chu
> > > address@hidden
> > > Computer Scientist
> > > High Performance Systems Division
> > > Lawrence Livermore National Laboratory
> > >
> > >
> > >
> > ----------------------------------------------------------------------
> > -------------
> > > This email message is for the sole use of the intended recipient(s)
> > and may contain
> > > confidential information. Any unauthorized review, use, disclosure
> > or distribution
> > > is prohibited. If you are not the intended recipient, please
> > contact the sender by
> > > reply email and destroy all copies of the original message.
> > >
> > ----------------------------------------------------------------------
> > -------------
> > --
> > Albert Chu
> > address@hidden
> > Computer Scientist
> > High Performance Systems Division
> > Lawrence Livermore National Laboratory
> >
> >
> --
> Albert Chu
> address@hidden
> Computer Scientist
> High Performance Systems Division
> Lawrence Livermore National Laboratory
>
>
--
Albert Chu
address@hidden
Computer Scientist
High Performance Systems Division
Lawrence Livermore National Laboratory