Operating System - OpenVMS
1828625 Members
1840 Online
109983 Solutions
New Discussion

Re: Error Logging on VMS 7.3-2 on an Alpha 4100

 
SOLVED
Go to solution
Peter Quodling
Trusted Contributor

Error Logging on VMS 7.3-2 on an Alpha 4100



Have an Alpha 4100 running VMS 7.3-2, most patches in place.

Seeing disk errors on san attached disk (HDS) , and looking for more detail.

Of course, anal/err returns ....

Error Log Report Formatter (ERF) Version V7.3-2

%ERF-F-CEHFND, new header format found
-ERF-I-CEHCVT, use ANALYZE/ERROR_LOG/ELV CONVERT or DECevent conversion utility
P


anal/error/elv convert - appears to have done something.

anal/error/elv (this elv errorlog viewer sounds interesting) - except none of the commands in it seem to have much to do with viewing error logs. So I go googling, and find that I should try the following.

So I ...
mcr decevent_cvt_cef sys$errorlog:errlog.sys errlog.cvt

and then
anal/err errorlog.cvt

And I get a whole bunch of erf-w-unkpktfmt errors. and imcompatible entry formats errors.

the only other significant reference I find is in the 7.3-1 release notes that infers that you can use decevent, or compaq analyze.

I have seen references to say that DECEVENT needs to be licensed, And that Compaq analyze is in WEBES. Looking at the documentation for webes, it seems bigger than ben hur to install...

Is there a simple way to bring back Errorlog, like I used to have...

Leave the Money on the Fridge.
11 REPLIES 11
Ian Miller.
Honored Contributor
Solution

Re: Error Logging on VMS 7.3-2 on an Alpha 4100

$ ANALYZE/ERROR_LOG/ELV CONVERT
will convert new format to old then ANAL/ERROR should work on the converted file.

I think you can install DECevent on a system without a licence to do bit to text translation i.e interpret the error log.

See http://h18023.www1.hp.com/support/svctools/decevent/index.html

____________________
Purely Personal Opinion
Peter Quodling
Trusted Contributor

Re: Error Logging on VMS 7.3-2 on an Alpha 4100

Thanks, Ian that makes a bit more sense. now I am beating my head against a bunch of

%ERF-W-UNKPKTFMT, unknown packet format,

errors and various errors like

HW_MODEL: 00000647 Hardware Model = 1607.

DEVICE ERROR AlphaServer 4100 5/533 4MB

GENERIC DK SUB-SYSTEM, UNIT _$1$DGA121:

** INCOMPATIBLE ENTRY FORMAT **

LONGWORD 1. 30303030
/0000/
LONGWORD 2. 00000305
/..../
LONGWORD 3. 00000000
/..../
LONGWORD 4. 00020000
/..../
LONGWORD 5. 00000000
/..../
LONGWORD 6. 00000100
/..../
LONGWORD 7. 00000A00
/..../
%ERF-W-UNKPKTFMT, unknown packet format, entry 3726 skipped

Clearly, the SAN system. (HDS Thunder 9500) talking via McData Switches, while it claims "OpenVMS" compatability, doesn't extended to debugging errors. I have raised this with HDS, but would appreciate any usefull input. (Apart from "dump it and use Storageworks")

q
Leave the Money on the Fridge.
Ian Miller.
Honored Contributor

Re: Error Logging on VMS 7.3-2 on an Alpha 4100

Those entries are too new for ERF. They stopped updating it years ago and new hardware is not known to ERF.
DECevent should know how to decode them.
compaq analyze is needed for DS20 and so on but for the hardware you mentioned try DECevent V3.4 as its a lot easier to use.
____________________
Purely Personal Opinion
Peter Quodling
Trusted Contributor

Re: Error Logging on VMS 7.3-2 on an Alpha 4100

DECEVENT I have found a kit for but can't find a compaq analyze kit anywhere - I have a DS25 DR machine, that I will want to set up, once I find that kit...


q
Leave the Money on the Fridge.
Wim Van den Wyngaert
Honored Contributor

Re: Error Logging on VMS 7.3-2 on an Alpha 4100

Compaq analyze is party of WEBES.
http://h18023.www1.hp.com/support/svctools/webes/

Just enter x x x as false id and download it.

Wim
Wim
Peter Quodling
Trusted Contributor

Re: Error Logging on VMS 7.3-2 on an Alpha 4100

DEC Event asks for a PAK? Poking around the net, I get the impression that it's there for the asking, but can quite work out who/where to ask...

q
Leave the Money on the Fridge.
David B Sneddon
Honored Contributor

Re: Error Logging on VMS 7.3-2 on an Alpha 4100

Peter,

DECevent only needs the PAK to do nifty things.
It will still run without the PAK to allow you
to decode the errors.

Dave
Peter Quodling
Trusted Contributor

Re: Error Logging on VMS 7.3-2 on an Alpha 4100

Yeahbut, I am a geek, I love doing those "nifty things". (Yes, it is working for me, but I hate "nobbled" software...)

q
Leave the Money on the Fridge.
Ian Miller.
Honored Contributor

Re: Error Logging on VMS 7.3-2 on an Alpha 4100

If you have a hardware contract covering this alpahserver then you can ask hp(log a support call) and they will give you a pak for DECevent (it will have a termination date)
____________________
Purely Personal Opinion
Peter Quodling
Trusted Contributor

Re: Error Logging on VMS 7.3-2 on an Alpha 4100

I now have decevent tracking this, and we are seeing errors like the following . The storage (HDS and McData folk, know what's happening at these points in time, but haven't told us yet... possibly zone changes...) Can we a) find out more, b) disable these, or c) should we be worried.

q




**** V3.4 ********************* ENTRY 4 ********************************


Logging OS 1. OpenVMS
System Architecture 2. Alpha
OS version V7.3-2
Event sequence number 12717.
Timestamp of occurrence 16-SEP-2005 14:55:15
Time since reboot 40 Day(s) 0:48:50
Host name INFAC1

System Model AlphaServer 4100 5/533 4MB

Entry Type 1. Device Error


---- Device Profile ----
Unit $1$DGA130
Product Name DF600F
Vendor HITACHI


-- Driver Supplied Info -
Device Firmware Revision 0000
VMS SCSI Error Type 5. Extended Sense Data from Device
SCSI ID x0000000000000003
SCSI LUN x0000000000000300
Port Status x00000001 NORMAL - normal successful completion
SCSI Command Opcode x2A Write (10 byte command)
Command Data
x00
x01
x29
xFA
x59
x00
x00
x01
x00

SCSI Status x02 Check Condition
Remaining Byte Length 64.

--- Device Sense Data ---

Error Code x70 Current Error
Segment # x00
Information Byte 3 x00
Byte 2 x00
Byte 1 x00
Byte 0 x00
Sense Key x06 Unit Attention
Additional Sense Length x38
CMD Specific Info Byte 3 x00
Byte 2 x00
Byte 1 x00
Byte 0 x00
ASC & ASCQ x2A00 ASC = x002A
ASCQ = x0000
Parameters Changed
FRU Code x00
Sense Key Specific Byte 0 x00 Sense Key Data NOT Valid
Byte 1 x00
Byte 2 x00

Count of valid bytes: 46.


15--<-12 11--<-08 07--<-04 03--<-00 :Byte Order
0000: 00000000 81110000 00000000 00000040 *@...............*
0010: 00000000 00000000 00000000 00000000 *................*
0020: 00000000 00000000 00000000 00000000 *................*


----- Software Info -----
UCB$x_ERTCNT 16. Retries Remaining
UCB$x_ERTMAX 16. Retries Allowable
IRP$Q_IOSB x0000000000000000
UCB$x_STS x18021810 Online
Software Valid
Unload At Dismount
Volume is Valid on the local node
Unit supports the Extended Function bit
IRP$L_PID x00BD0038 Requestor "PID"
IRP$x_BOFF 6656. Byte Page Offset
IRP$x_BCNT 512. Transfer Size In Byte(s)
UCB$x_ERRCNT 9. Errors This Unit
UCB$L_OPCNT 484020620. QIO's This Unit
ORB$L_OWNER x00010004 Owners UIC
UCB$L_DEVCHAR1 x1C4D5008 Directory Structured
File Oriented
Sharable
Available
Mounted
Error Logging
Capable of Input
Capable of Output
Random Access


**** V3.4 ********************* ENTRY 5 ********************************


Logging OS 1. OpenVMS
System Architecture 2. Alpha
OS version V7.3-2
Event sequence number 12718.
Timestamp of occurrence 16-SEP-2005 14:55:15
Time since reboot 40 Day(s) 0:48:50
Host name INFAC1

System Model AlphaServer 4100 5/533 4MB

Entry Type 1. Device Error


---- Device Profile ----
Unit $1$DGA130
Product Name DF600F
Vendor HITACHI

-- Driver Supplied Info -
Device Firmware Revision 0000
VMS SCSI Error Type 5. Extended Sense Data from Device
SCSI ID x0000000000000003
SCSI LUN x0000000000000300
Port Status x00000001 NORMAL - normal successful completion
SCSI Command Opcode x2A Write (10 byte command)
Command Data
x00
x01
x29
xFA
x59
x00
x00
x01
x00

SCSI Status x02 Check Condition
Remaining Byte Length 64.

--- Device Sense Data ---

Error Code x70 Current Error
Segment # x00
Information Byte 3 x00
Byte 2 x00
Byte 1 x00
Byte 0 x00
Sense Key x04 Hardware Error
Additional Sense Length x38
CMD Specific Info Byte 3 x00
Byte 2 x00
Byte 1 x00
Byte 0 x00
ASC & ASCQ x9599 ASC = x0095
ASCQ = x0099
Device Vendor Specific ASC/ASCQ
unrecognized.
FRU Code x00
Sense Key Specific Byte 0 x00 Sense Key Data NOT Valid
Byte 1 x00
Byte 2 x00

Count of valid bytes: 46.


15--<-12 11--<-08 07--<-04 03--<-00 :Byte Order
0000: 82000000 CE130000 00000000 00000040 *@...............*
0010: FFFF0000 00002200 10000320 03025D00 *.].. ...."......*
0020: 00000000 00000000 00004000 010059FA *.Y...@..........*


----- Software Info -----
UCB$x_ERTCNT 16. Retries Remaining
UCB$x_ERTMAX 16. Retries Allowable
IRP$Q_IOSB x0000000000000000
UCB$x_STS x18021810 Online
Software Valid
Unload At Dismount
Volume is Valid on the local node
Unit supports the Extended Function bit
IRP$L_PID x00BD0038 Requestor "PID"
IRP$x_BOFF 6656. Byte Page Offset
IRP$x_BCNT 512. Transfer Size In Byte(s)
UCB$x_ERRCNT 10. Errors This Unit
UCB$L_OPCNT 484020626. QIO's This Unit
ORB$L_OWNER x00010004 Owners UIC
UCB$L_DEVCHAR1 x1C4D5008 Directory Structured
File Oriented
Sharable
Available
Mounted
Error Logging
Capable of Input
Capable of Output
Random Access


Leave the Money on the Fridge.
Peter Quodling
Trusted Contributor

Re: Error Logging on VMS 7.3-2 on an Alpha 4100

Storage people advise that this coincides with RSCN (Registered State change notifications) on the MCDATA Switches, and is asking switch vendor about it.
Output from switch manager...

INFORMATIONAL Fabric Product Audit Event Zone Set enabled: Mustard_Prodn 131.242.218.195 2005/09/16 14:55:12 1000080088040D1A

INFORMATIONAL Fabric Product Audit Event Zone Set enabled: Mustard_Prodn 10.1.8.11 2005/09/16 14:55:12 100008008803A62B

INFORMATIONAL Fabric Product Audit Event Zone Set enabled: Mustard_Prodn 10.1.8.9 2005/09/16 14:55:12 10000800886042A6

INFORMATIONAL Fabric Product Audit Event Zone Set enabled: Mustard_Prodn 10.1.8.7 2005/09/16 14:55:12 10000800880207D6

INFORMATIONAL Fabric Product Audit Event Zone Set enabled: Mustard_Prodn 10.1.8.5 2005/09/16 14:55:12 10000800880207D8

INFORMATIONAL Fabric Product Audit Event Zone Set enabled: Mustard_Prodn 10.0.41.31 2005/09/16 14:55:12 1000080088602AB9

INFORMATIONAL Application Interface Product Audit Event Zone Set enabled: Mustard_Prodn 10.1.8.3 2005/09/16 14:55:12 100008008860D29E





Leave the Money on the Fridge.