ProLiant Servers (ML,DL,SL)
cancel
Showing results for 
Search instead for 
Did you mean: 

Proliant DL385 & Linux & SNMP traps for SCSI

 
Christian Charette
Occasional Contributor

Proliant DL385 & Linux & SNMP traps for SCSI

Hey all.

I'm trying to get my box to spit out SNMP traps when I have HDD failure, but I'm having difficulties. Anyone have experience with SuSE 9 and the HP hpasm package?

I've installed net-snmp from HP, and install the hpasm package (7.5.1-8 sles9 x86_64). I get power supply traps ok and fan notifications if I pull one out. But the storage agents are dead on arrival.

Running a /etc/init.d/hpasm status, I get the following:

[root@mp-03 /]>> /etc/init.d/hpasm status
hpasmd is running...
Status of Foundation Agents (cmafdtn): cmathreshd cmahostd cmapeerd
cmathreshd is running...
cmahostd is running...
cmapeerd is running...
Status of Server Agents (cmasvr): cmastdeqd cmahealthd cmaperfd
cmastdeqd is running...
cmahealthd is running...
cmaperfd is running...
Status of Storage Agents (cmastor): cmaeventd cmaidad cmafcad cmaided cmascsid c
masasd
cmaeventd is stopped...
cmaidad is stopped...
cmafcad is stopped...
cmaided is running...
cmascsid is stopped...
cmasasd is stopped...


Ok. So looking at this, my storage agents are stopped. When I reboot, I get segment faults from the console:

cmaidad[5605]: segfault at 0000000000000000 rip 0000000000000000 rsp 00000000fff
fdb9c error 14
cmaeventd[5630]: segfault at 0000000000000000 rip 0000000000000000 rsp 00000000f
fffd54c error 14
cmafcad[5632]: segfault at 0000000000000000 rip 0000000000000000 rsp 00000000fff
fdb0c error 14

When I plug in a monitor to the box, I get different (???) segment fault errors:

cmaeventd[5628] segfault at 0000000000000000 rip 0000000000000000 rsp 000000ffffd54c error 14
cmaidad[5635]: segfault at 0000000000000000 rip 0000000000000000 rsp 000000ffffdb9c error 14
cmafcad[5643]: segfault at 0000000000000000 rip 0000000000000000 rsp 000000ffffdb0c error 14


These ones are specifically related to storage agents.

Any ideas?

I noticed that it's not the first time segment faults have been found on this driver:

http://h50146.www5.hp.com/products/software/oe/linux/mainstream/bin/support/doc/general/mgmt/ima/v730c/hpasm-7.3.0c-67.sles9.x86_64.rpm.txt
2 REPLIES
Christian Charette
Occasional Contributor

Re: Proliant DL385 & Linux & SNMP traps for SCSI

...more....

When I try to start the SNMP storage agent seperately, I get the following output:

/opt/compaq/storage/bin/cmascsid -p 15
SCSI agent (cmascsid) does not find any suppo
rted SCSI controller, exits.

?!

I tried installing the latest SCSI driver:
http://h18023.www1.hp.com/support/files/server/us/download/24900.html

But I still get nada:
>> rpm -qa | grep mptlinux
mptlinux-3.02.69.01-1
>> /opt/compaq/storage/bin/cmascsid -p 15 -s OK
>> SCSI agent (cmascsid) does not find any supported SCSI controller, exits.

Christian Charette
Occasional Contributor

Re: Proliant DL385 & Linux & SNMP traps for SCSI

The solution was to install the HP cciss driver. This allows the hpasm to read drive status and raise flags (log, snmp) when a drive is pulled or replaced.