- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- server problems - need help
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-13-2001 07:41 AM
02-13-2001 07:41 AM
server problems - need help
We started noticing the problems with the backup that we were running. I was backing up the server using an UNIX agent on our NT server, since it was handy. The backup would not backup whole folders at a time and complained about others. Occasionally, /home would totally shut down and not let anyone see it (reboot fixed this every time). There was also a time when the system froze up during shutdown or reboot, but that seems to have passed. I went to the old backup system we used (using fbackup, I think) and the server just crashed again, so it is not the backup that is causing the problem. i have two ideas.
a)corrupt file or files that are being accessed through backup and that is bringing down /home. I have deleted files that backup noted were problems, but that did not fix the problem. On a side note, There was a folder that I wanted to delete, but the rm process kept hanging and I could not kill it. Upon reboot, I could remove the folder. May be related??
b) the RAID drive that /home is mounted to is whacked and causing problems. When the server reboots, it does the fsck check, and fixes problems that it sees. this last time, it said that everything was clean, but sometimes it finds problems. Does the fsck check the mounted drives and can it handle a RAID drive?
I have looked through the syslog (most recent) and can't find anything that jumps out at me. I will attach the last few. I know the UPS doesn't work, and that it in the log.
Any ideas on where to go from here would be greatly appreciated. I can give any info anyone wants to see (past logs, etc). Thanks in advance!!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-13-2001 08:03 AM
02-13-2001 08:03 AM
Re: server problems - need help
From your syslog, it looks very much a SCSI device failing, possibly SCSI disk housing /home or the SCSI controller connected to it. Try to identify the device by looking for the file with minor number 1f000000 in /dev/dsk.
If possible, perform a mstm exercise or verify on the disk, which does not require downtime but will slow down the system.
If you have predictive support, run psconfig and take a look at your predictive logs on any hardware errors.
Hope this helps. Regards.
Steven Sim Kok Leong
Brainbench MVP for Unix Admin
http://www.brainbench.com
==
Feb 7 08:52:01 munix vmunix: lsp: 5ae7d80
Feb 7 08:52:01 munix vmunix: bp->b_dev: 1f000000
Feb 7 08:52:01 munix vmunix: scb->io_id: 6faf2
Feb 7 08:52:01 munix vmunix: scb->cdb: 28 00 00 9f 25 80 00 00 80 00
Feb 7 08:52:01 munix vmunix: lbolt_at_timeout: 0, lbolt_at_start: 0
Feb 7 08:52:01 munix vmunix: lsp->state: 1
Feb 7 08:52:01 munix vmunix: lbp->owner: 60c5900
Feb 7 08:52:01 munix vmunix: bp->b_dev: 1f000000
Feb 7 08:52:01 munix vmunix: scb->io_id: 6faf4
Feb 7 08:52:01 munix vmunix: scb->cdb: 28 00 00 9f 29 80 00 00 80 00
Feb 7 08:52:01 munix vmunix: lbolt_at_timeout: 0, lbolt_at_start: 0
Feb 7 08:52:01 munix vmunix: lsp->state: 15
Feb 7 08:52:01 munix vmunix: scratch_lsp: 0
Feb 7 08:52:01 munix vmunix:
Feb 7 08:52:01 munix vmunix: SCSI: Ignoring redundant reset request -- lbolt: 7537735, bus: 0
Feb 7 08:52:01 munix vmunix: LVM: vg[1]: pvnum=0 (dev_t=0x1f000000) is POWERFAILED
Feb 7 08:52:01 munix vmunix: LVM: PV 0 has been returned to vg[1].
==
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-13-2001 09:50 AM
02-13-2001 09:50 AM
Re: server problems - need help
http://us-support2.external.hp.com/cki/bin/doc.pl/sid=6fd514230202de19ee/screen=ckiDisplayDocument?docId=200000052999502
This may be more helpful: http://us-support2.external.hp.com/cki/bin/doc.pl/sid=6fd514230202de19ee/screen=ckiDisplayDocument?docId=200000053002056
A patch that supposedly contains a fix for this (HP-UX 10.20 - PHKL_22690):
http://us-support2.external.hp.com/cki/bin/doc.pl/sid=a847f5020353437c66/screen=ckiDisplayDocument?docId=200000053104320
A patch that supposedly contains a fix for this (HP-UX 11.0 - PHKL_22941):
http://us-support2.external.hp.com/cki/bin/doc.pl/sid=512d18cd0ad2f0beb0/screen=ckiDisplayDocument?docId=200000054127026
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-13-2001 10:10 AM
02-13-2001 10:10 AM
Re: server problems - need help
there are several entrys that do not look well:
- btlan4 has trouble negotiating autosense
- / - Filesystem is full
- pv0 is loosing power
- and the rest of the scsi errors does not look good as well
I would recommend:
- fix freespace problem on / first !
- check disk 0 for loose powercables
- check overall SCSI cable length
After this, go for btlan4 checking esp. half/full-duplex mismatches, to reestablish your network backup.
Hope this helps
Volker
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-13-2001 11:53 AM
02-13-2001 11:53 AM
Re: server problems - need help
the dev = 1f gave you the hex for the actual device
(1f = 31) and 31 = disk
So 1f000000 shows it's c0t0d0 (you drop last 2 0's)
Now my question is this.....is this disk by any chance in a disk array connected by a fiber adapter????
Your problem could be the actual disk, OR....it could be caused by a bad fiber adapter card. I received these messages with the timeout...and it turned out to be bad fiber cards. You may want to check on both of these possibilites. The fact that your having sporatic problems, tends to make think the fiber card...if it were the drive you would be seeing more than timeouts.
/rcw
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-13-2001 12:26 PM
02-13-2001 12:26 PM
Re: server problems - need help
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-14-2001 12:17 PM
02-14-2001 12:17 PM
Re: server problems - need help
I'd check the patch level of your C720 SCSI drivers on your system, and also check the SCSI cabling.
If the errors persist, then replace the drive.