- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- HBA (TL_adapter) problem
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 01:10 AM
тАО11-23-2004 01:10 AM
Nov 22 19:19:30 srs083 vmunix: 0/2/1/0: Unable to access previously accessed device at nport ID 0x
b0100.
Nov 22 19:19:30 srs083 EMS [2456]: ------ EMS Event Notification ------ Value: "CRITICAL (5)" fo
r Resource: "/adapters/events/TL_adapter/0_2_1_0" (Threshold: >= " 3") Execute the followi
ng command to obtain event details: /opt/resmon/bin/resdata -R 160956418 -r /adapters/events/TL_
adapter/0_2_1_0 -n 160956429 -a
# /opt/resmon/bin/resdata -R 160956418 -r /adapters/events/TL_adapter/0_2_1_0 -n 160956429 -a
CURRENT MONITOR DATA:
Event Time..........: Mon Nov 22 19:19:30 2004
Severity............: CRITICAL
Monitor.............: dm_TL_adapter
Event #.............: 40
System..............: srs083
Summary:
Adapter at hardware path 0/2/1/0 : Unable to open previously opened target
Description of Error:
lbolt value: 314842115
Unable to access previously accessed target
nport ID = 0xb0100
Probable Cause / Recommended Action:
An attempt to re-open a device which had been opened earlier
has failed.
There should be additional logging messages which will
allow diagnosis of the problem.
Additional Event Data:
System IP Address...: 10.125.20.83
Event Id............: 0x41a2821200000000
Monitor Version.....: B.01.00
Event Class.........: I/O
Client Configuration File...........:
/var/stm/config/tools/monitor/default_dm_TL_adapter.clcfg
Client Configuration File Version...: A.01.00
Qualification criteria met.
Number of events..: 1
Associated OS error log entry id(s):
0x41a2821200000000
Additional System Data:
System Model Number.............: 9000/800/rp4440
OS Version......................: B.11.11
EMS Version.....................: A.04.00
STM Version.....................: A.45.00
Latest information on this event:
http://docs.hp.com/hpux/content/hardware/ems/dm_TL_adapter.htm#40
v-v-v-v-v-v-v-v-v-v-v-v-v D E T A I L S v-v-v-v-v-v-v-v-v-v-v-v-v
Component Data:
Physical Device Path....: 0/2/1/0
Vendor Id...............: 0x0000103C
Serial Number(WWN)......: 50060B0000255C72
I/O Log Event Data:
Driver Status Code..................: 0x00000028
Length of Logged Hardware Status....: 0 bytes.
Offset to Logged Manager Information: 0 bytes.
Length of Logged Manager Information: 61 bytes.
Manager-Specific Information:
Raw data from FCMS Adapter driver:
00000001 12C41C03 00000001 00000001 000B0100 2F75782F 6B65726E 2F6B6973
752F544C 2F737263 2F636F6D 6D6F6E2F 7773696F 2F74645F 6465762E 63
Now, although I can access to the LUNs in our EVA5000, but I found something abnormal:
# spmgr display
Server: srs083 Report Created: Tue, Nov 23 08:46:24 2004
Command: spmgr display
= = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = =
Storage: 5000-1FE1-5004-5440
Load Balance: On Auto-restore: Off Balance Policy: Round Robin
Path Verify: On Verify Interval: 30
HBAs: td0 td1
Controller: P5849E1AAQE04W, Operational
P5849E1AAQD02S, Operational
Devices: c12t0d0 c12t0d1 c12t0d2 c12t0d3 c12t0d4 c12t0d5 c12t0d6
c12t0d7
TGT/LUN Device WWLUN_ID H/W_Path #_Paths
0/ 0 c12t0d0 6005-08B4-0010-102C-0000-9000-004F-0000 4
255/255/0/0.0
Controller Path_Instance HBA Preferred? Path_Status
P5849E1AAQE04W no
c4t0d1 td0 YES Active
c9t0d1 td1 YES Active
Controller Path_Instance HBA Preferred? Path_Status
P5849E1AAQD02S no
c8t0d1 td0 no Standby
c5t0d1 td1 no Standby
TGT/LUN Device WWLUN_ID H/W_Path #_Paths
0/ 1 c12t0d1 6005-08B4-0010-102C-0000-9000-0052-0000 4
255/255/0/0.1
Controller Path_Instance HBA Preferred? Path_Status
P5849E1AAQE04W no
c4t0d2 td0 no Standby
c9t0d2 td1 no Standby
Controller Path_Instance HBA Preferred? Path_Status
P5849E1AAQD02S no
c8t0d2 td0 YES Active
c5t0d2 td1 YES Active
...
...
* There are totally 4 paths to the LUNs, they are supposed to be one "Active", one "Available" and the other two "Standby", but now, two of them are "Active", there must be something wrong !
So, could you please tell me what happen and what I can do ? Hardware problem with HBA ?
Thanks !
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 01:31 AM
тАО11-23-2004 01:31 AM
Re: HBA (TL_adapter) problem
You can check it out with cstm mstm or X based xstm
You will find it non-functional and will want to arrange replacement. Since you have an alternate path, you can afford to wait a while.
Just to be sure, I'd make sure the fabric network is working and nobody re-zoned your fiber switch.
SEP
Owner of ISN Corporation
http://isnamerica.com
http://hpuxconsulting.com
Sponsor: http://hpux.ws
Twitter: http://twitter.com/hpuxlinux
Founder http://newdatacloud.com
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 01:42 AM
тАО11-23-2004 01:42 AM
Re: HBA (TL_adapter) problem
But why "spmgr" shows wrong now: two "Active" ?
Thanks !
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 02:35 AM
тАО11-23-2004 02:35 AM
Re: HBA (TL_adapter) problem
Have you checked to see whether that nPortID is not a disk?
Run the following:
fcmsutil /dev/tdX devstat 0xb0100
to obtain info & stats on that nPortID
then to check it run:
fcmsutil /dev/tdX test 0xb0100 1024 Y
Replace X with the appropriate td value & the Y with a count vaue - the 1024 is a numeric value indicating a packet size and must be a multiple of four.
Could be you got an intermittent disk error indicating a possible imminant disk failure.
Rgds,
Jeff
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 03:03 AM
тАО11-23-2004 03:03 AM
Re: HBA (TL_adapter) problem
mark
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 03:15 AM
тАО11-23-2004 03:15 AM
Re: HBA (TL_adapter) problem
# fcmsutil /dev/td0 devstat 0xb0100
Tue Nov 23 11:09:42 2004
Device Statistics for Nport_id 0x0b0100
Successful opens of the device 311691
Failed Open of previously opened device 10
PLOGIs sent to the device 311688
PLOGIs Timedout 10
PRLIs sent to the device 311678
PRLIs Timedout 0
Bad PRLI resps 10
PRLOs received 0
ADISCs sent to the device 0
ADISCs Timedout 0
Authentication failures 0
LOGOs sent to the device 311678
LOGOs Timedout 0
LOGOs received 0
Target resets sent 0
Target resets failed 0
Implicit Logouts on the device 0
Bad TPRLO resp 0
PLOGI Resps error statistics ...
LS_RJTs recvd for PLOGI/PDISC sent 0
Short PLOGI Resps recvd 0
Low supported version higher than FC-PH-3 0
High supported version lower than FC-PH 4.3 0
No Class3 support 0
PWWN authentication failure 0
NWWN authentication failure 0
PLOGI retries in state dvs_open_plogi_delay 0
I/O Statistics ...
Assisted I/O requests 997744
Timedout I/Os 0
No CDB available for I/O 0
2nd Level Error Recovery 0
I/O Completion Statistics ...
Good I/O completions 568174
Read underflows 429570
Link Failure During FCP_RSP 0
FCP_RSP Overflow 0
Outbound Error For FCP_CMND 0
No resource For IO 0
Channel transient conditions 0
Channel/Device not Online 0
Implicit aborts 0
I/Os aborted 0
I/O Inbound Error Statistics ...
PLDA Non-Compliance 0
Unassisted FCP_RSP 0
Unassisted FCP_DATA 0
Bad Unassisted FCP_DATA 0
Unassisted FCP_CMND 0
UA FCP With Bad OX_ID 0
UA FCP With Bad F_CTL 0
Bad FCP_XFER_RDY 0
FCP_XFER_RDY and SEST invalid 0
FCP_XFER_RDY in invalid state 0
Bad Length For FCP_RSP Frame 0
FCP_RSP in invalid state 0
Bad Category For FCP Frame 0
Bad data_ro In FCP_XFER_RDY 0
Late ABTS Responses Received 0
BA_RJT for ABTS received 0
Bad responses to ABTS 0
IO Underruns 0
Other I/O Event Statistics ...
Unassisted FCP_XFER_RDY 0
Retries For Resources 0
Host Programming Errors 0
I/O Overflow Errors 0
LKF On Outbound Sequence 0
ASN On Outbound Sequence 0
Frame TimeOut Errors 0
Unexpected OCMs for I/Os 0
New I/Os on ERQ At LDN 0
I/Os on SLL At LDN 0
LUP Events For I/O 0
ABTS Sent 0
ABTS Resent 0
Unaccepted ABTS 0
LDNs Before Sending ABTS 0
LUPs For Sending ABTS 0
IFCM While Aborting I/O 0
FCP_RSP While Aborting I/O Requests 0
RRQ sent 0
RRQ send failures 0
RRQ replies recvd 0
-------------------------------------------------------------
# fcmsutil /dev/td0 test 0xb0100 1024 3
WARNING: Can send only up to a max of 220 bytes, continuing
Sent a Test frame of size 220 bytes to nport_id 0x0b0100
Sent a Test frame of size 220 bytes to nport_id 0x0b0100
Sent a Test frame of size 220 bytes to nport_id 0x0b0100
# fcmsutil /dev/td0 test 0xb0100 220 3
Sent a Test frame of size 220 bytes to nport_id 0x0b0100
Sent a Test frame of size 220 bytes to nport_id 0x0b0100
Sent a Test frame of size 220 bytes to nport_id 0x0b0100
So, how to solve the two "Active" problem ?
Thanks !
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 03:20 AM
тАО11-23-2004 03:20 AM
Re: HBA (TL_adapter) problem
# ioscan -fnC disk
# insf -e
but still have the same issue: two "Active" in "spmgr display" output. Is that normal ?
I found that all mount points related to the EVA work fine.
So... ?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 03:41 AM
тАО11-23-2004 03:41 AM
Re: HBA (TL_adapter) problem
* There are totally 4 paths to the LUNs, they are supposed to be one "Active", one "Available" and the other two "Standby", but now, two of them are "Active", there must be something wrong !
Whwn you had 1 active, 1 available and 2 standby - that means your SecurePath configuration was NOT load balancing (meaning it does not use the bandwidth of the two HBAs concurrently). Someone probably set it to load balance which is the way to go .. that is why you have your LUNs now having 2 Paths on via 2 HBAs to one of the EVA controllers on which the LUNs are served.
Note that on the EVA.. each LUN can only be served on 1 EVA (HSV100/110) controller AND SecurePath Load Balancing means - you have your total number of HBA's on your server accessing the HSV controller which a LUN is assigned to. On the EVA end.. Controller/Path preferencing should be disabled so SecurePath manages which HSV controller to communicate with in accessing a LIN.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 08:32 AM
тАО11-23-2004 08:32 AM
Re: HBA (TL_adapter) problem
Yes, maybe the load balance setting made those two "Active", I will confirm this later.
But from the SAN side, we still have the connection problem from OVSAM ( OpenView Storage Area Manager ), the host icon in the diagram is blue instead of green, which indicates bad connection:
Error: Cannot connect to host: srs083
Details: No HostAgent service contacted on host: 10.125.20.83
May not be started (check hosts logs)
So, what should I do ?
Thanks !
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 08:37 AM
тАО11-23-2004 08:37 AM
Re: HBA (TL_adapter) problem
S076hostwatchdog
S790opendial
S800hostagent
for status and to check if all of the necessary server end processes are up.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 08:39 AM
тАО11-23-2004 08:39 AM
Re: HBA (TL_adapter) problem
yes, you are right, the two "Active" paths issue is because of the setting of "Load Balance" ! Thanks !
But please help to find out why the OVSAM still shows the connection problem. Or I need to do something to refresh the status of OVSAM diagram ?
Thanks again !
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 08:43 AM
тАО11-23-2004 08:43 AM
Re: HBA (TL_adapter) problem
You can try re-installing OVSAM or rebooting the client server.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 08:43 AM
тАО11-23-2004 08:43 AM
Re: HBA (TL_adapter) problem
Thanks !
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 08:46 AM
тАО11-23-2004 08:46 AM
Re: HBA (TL_adapter) problem
/sbin/rc3.d/S076hostwatchdog start
...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 08:46 AM
тАО11-23-2004 08:46 AM
Re: HBA (TL_adapter) problem
Thanks !
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2004 08:48 AM
тАО11-23-2004 08:48 AM
Re: HBA (TL_adapter) problem
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-24-2004 12:29 AM
тАО11-24-2004 12:29 AM
Re: HBA (TL_adapter) problem
Restarting the agent did help ! Now, I can reach the server from OVSAM.
But this morning, I found the error continued to add to the syslog.log last night:
Nov 23 17:05:43 srs083 vmunix: 0/2/1/0: Unable to access previously accessed device at nport ID 0xb0100.
However, I can access the data on LUNs without any problem, and OVSAM still can reach the server, everything looks ok.
How can I stop that message ? Thanks again !
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-24-2004 01:14 AM
тАО11-24-2004 01:14 AM
SolutionHowever, you may be able to minimize it. Have you read the release notes of SecurePath that suggests turning off EMS on the hardware Paths relating to EVA devices? It supposedly will reduce the amount of diagnostic and often times erronoeus messages coming out of STM/EMS...
Most "classical admins" who've no experience with StorageWorks SANS will often suggest HBA replacement -- which is wrong. Rememeber SecurePath/STorgaeWorks are to a certain degree not yet totally "friendly" to the HP-Us environment.
HTH.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-24-2004 02:02 AM
тАО11-24-2004 02:02 AM