- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- FYI: Newly-identified HP-UX 11i v1 kernel bug
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-20-2010 07:09 AM
тАО11-20-2010 07:09 AM
FYI: Newly-identified HP-UX 11i v1 kernel bug
The panics occurred in psl_search with an invalid pointer dereference.
From the initial crash dumps it was possible to determine that the list of memory regions (pregions) in a virtual address space associated with the iomappers table had been corrupted, but it wasn't possible to tell where the corruption had originated- in a third-party driver or in the kernel itself.
Things would hum along fine for a while, minutes, or hours, or possibly even days, but once that free VAS was allocated to something else and reinitialized, the next time the iomappers list was scanned and the pregion skip list for that one VAS was searched, the next link in the list went off to 0x00, resulting in a data page fault panic.
The first round of debug code captured timestamps and the stack trace of the creator of each VAS. All this looked normal and the new VAS was created properly, so it wasn't possible in that crash dump to tell where the corruption originated - only how long ago the VAS which caused the panic had been freed. In the first debugged crash, it was about 3 minutes earlier, so the culprit was long gone.
The next debug code modified the freevas() call and was set up panic the system whenever an attempt was made to free a VAS that was still part of the iomappers list.
Unfortunately the first crash that occurred about a month after this debug code was provided was on a system that I had overlooked modifying the LIF AUTO file, so during a subsequent reboot it had reverted to the old debug kernel. Needless to say I fixed that problem immediately, grumbling the whole time.
One month and two days later, the debug-induced panic finally happened, and it nailed the bug right to the wall.
WTEC was able to identify two missing lines of code in the back-out from an ENOMEM error leg during the duplication of a VAS in a fork() system call. The duplicated iomapper entry was created during the dupvas procedure, but was not removed during the backout from the error condition.
The problem has been referred to the support lab to go through the patch development process.
So, if you have a data page fault panic which tracks back to psl_search, ask about this case.
Another moral of the story - make sure that your system is configured with enough crash dump space, and enough space in /var/adm/crash or wherever you set it in /etc/rc.config.d/savecrash, to store a full crash dump, even if you have to buy more disks. It would have been impossible to find this problem if it weren't for those steps.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-21-2010 12:13 PM
тАО11-21-2010 12:13 PM
Re: FYI: Newly-identified HP-UX 11i v1 kernel bug
I hope the patching won't create such hiccups.
Shibin
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО11-23-2010 05:40 AM
тАО11-23-2010 05:40 AM
Re: FYI: Newly-identified HP-UX 11i v1 kernel bug
Here's the stack trace of a typical crash dump resulting from the iomapper entry which was earlier corrupted:
panic+0xa0
report_trap_or_int_and_panic+0x94
trap+0xef8
thandler+0xd24
psl_search+0x4
findpreg+0x74
iomap_search+0x38
io_map_internal+0x1f8
kernel_iomap+0x38
iomemrw+0x180
iomem_read+0x14
spec_rdwr+0x10c
vno_rw+0x1ac
read+0x184
syscall+0x204
syscallinit+0x55c
The "psl_search" call is searching the "pregion skip list" while searching for an iomapper.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО01-11-2011 02:03 PM
тАО01-11-2011 02:03 PM
Re: FYI: Newly-identified HP-UX 11i v1 kernel bug
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО01-11-2011 02:13 PM
тАО01-11-2011 02:13 PM
Re: FYI: Newly-identified HP-UX 11i v1 kernel bug
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-28-2011 08:33 AM
тАО02-28-2011 08:33 AM
Re: FYI: Newly-identified HP-UX 11i v1 kernel bug
Patch Name: PHKL_41910
Patch Description: s700_800 11.11 IO mapping
Creation Date: 11/02/22
Post Date: 11/02/25
Hardware Platforms - OS Releases:
s700: 11.11
s800: 11.11
Products: N/A
Filesets:
ProgSupport.PAUX-ENG-A-MAN
OS-Core.CORE2-KRN
OS-Core.CORE2-KRN
Automatic Reboot?: Yes
Status: General Release