- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- fbackup kills system
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-06-2007 10:49 PM
02-06-2007 10:49 PM
I have an issue whereby my HPUX11.11 L1000 occasionally enters a state does not respond at the GSP console or to remote sessions. My only recourse is to reset the system from the GSP. It happens about 1 per week, but I haven't established a firm pattern yet. From what I can tell, the last thing the system does is a cron'd fbackup job the does not complete.
There is nothing that provides a hint of what may be happening in /var/adm/syslog, or the backup log, or the chassis error logs. The system is patched to Goldapps/Goldbase June 2006 and hardware enablement to March 2003.
Any pointers to help with diagnosis would be appreciated.
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-06-2007 10:57 PM
02-06-2007 10:57 PM
Re: fbackup kills system
I suggest, check crontab entries by
crontab -l.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-06-2007 10:58 PM
02-06-2007 10:58 PM
Re: fbackup kills system
Waht is the exact command that runs this fbackup.
have you tryed looking in dmesg ?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-06-2007 11:42 PM
02-06-2007 11:42 PM
Re: fbackup kills system
The fbackup is a standard SAM configured unattended backup.
00 2 * * 2-6 /usr/sam/lbin/br_backup DAT FULL Y /dev/rmt/0m /var/sam/graphPCAa23177 root Y 1 N > /tmp/SAM_br_msgs 2>&1 #sambackup
dmesg just give me output since the last boot, with no problems evident.
@remsha - thanks, but I was aware how you list cron jobs (my fbackup is cron'd)
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-06-2007 11:53 PM
02-06-2007 11:53 PM
Re: fbackup kills system
Also, does once a week mean every week at the same day, or could it be for example tuesday on one week and thursday on the next ? Does it happpen outside of peak hours as well ?
Cheers
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-07-2007 12:01 AM
02-07-2007 12:01 AM
Re: fbackup kills system
> Strange.. have you tried leaving a session (remote or console) open at all times ? Any clues there ?
Console shows no hints in console history and won't bring up the login prompt, but a remote session might remain logged in and allow me some view - good idea, i'll try that.
> Also, does once a week mean every week at the same day, or could it be for example tuesday on one week and thursday on the next ? Does it happpen outside of peak hours as well ?
The last two times it happened during the nightly backup (between 2 and 3am), but not on the same day. I'm thinking there's something being "consumed" like file handles and the backup consumes a big chunk of them. I cannot point to anything that has changed over the last couple of months to bring about the more frequent occurance.
I've patched it to HW Enablement Sept 2005 to see if that helps. I was hoping someone else might have experienced similar symptoms.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-07-2007 12:06 AM
02-07-2007 12:06 AM
Re: fbackup kills system
has the nettl traced anything?
e.g.
# nettl -status all
Has the machine dumped anything to /var/adm/crash,
or something in /var/tombstones?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-07-2007 12:15 AM
02-07-2007 12:15 AM
Re: fbackup kills system
nothing in /var/adm/crash since 2003 either
/var/adm/tombstones contains tombstones from each of my resets, but no errors.
I think the system is up and running, just not responding to console or network logins.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-07-2007 12:45 AM
02-07-2007 12:45 AM
SolutionMemory is something that 'fbackup' craves. I'd begin by examining 'swapinfo -tam' and running 'vmstat' looking for page-outs (the 'po' column) that are in double-digits.
If swap utilization is high and you see significant memory pressure you may have found your reason.
'fbackup' uses shared memory segments (see 'shmmax' in your kernel) to buffer the files it is copying. 'fbackup' can have up to six (6) reader processes running at a time. I assume that you have a default 'fbackup' configuration file in place because you are doing a standard SAM backup.
If you have ever killed (-9) any 'fbackup' session, you probably have left orphaned memory segments lying about, tying up part of your memory. A 'ps -ef|grep fbackup' might expose some old processes lingering. If so, a reboot is the easist penalty to pay.
Regards!
...JRF...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-07-2007 01:20 AM
02-07-2007 01:20 AM
Re: fbackup kills system
I agree with James that this may be a memory starvation issue as fbackup can use hundreds of megs of RAM to store all the filenames and this may be a lot more than is available, causing excessive paging as well as a massive slowdown in the backup speed. Depending on memory fragmentation due to orphaned shared memory segments and other process memory pressure, there may be a lot of virtual memory paging. This paging can indeed cause loss of console or session prompts -- well, eventually they will respond but it may be several minutes.
So check ipcs -bmop to see if there are orphaned segments and clear them. Then run fbackup during the day for a few minutes (until the tape starts moving) and check ipcs -bmop to see just how much shared memory is needed for fbackup. Then terminate fbackup (NEVER use kill -9) with a straight kill or kill -15. If root has been using kill -9 a lot, that is a major source of problems. kill -9 is a last resort and you must cleanup all the program's resources by hand (every time).
You might want to bring HWE patches up to June 2006 level too.
Bill Hassell, sysadmin
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-07-2007 07:54 PM
02-07-2007 07:54 PM
Re: fbackup kills system
I had set (a long time ago) the dbc_max_pct down to 10% to avoid fbackup hogging hundreds of MB of RAM, so that should help there.
I've put HWE Sept 2005 on so I'll let it run for a couple of days while monitoring the above and see how the patches help.