- Community Home
- >
- Servers and Operating Systems
- >
- HPE ProLiant
- >
- ProLiant Servers (ML,DL,SL)
- >
- Re: proliant DL585 G7 Opteron 6238, vSphere 4.1 U3...
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-25-2013 04:31 AM
тАО02-25-2013 04:31 AM
proliant DL585 G7 Opteron 6238, vSphere 4.1 U3 die with PSOD PF 14
Hello,
we lost 2 servers last week and are waiting for a solution from HP/VMWare, there seem to be a lot of PSODs coming around.
cheers
chr3kdrs
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-25-2013 05:13 AM
тАО02-25-2013 05:13 AM
Re: proliant DL585 G7 Opteron 6238, vSphere 4.1 U3 die with PSOD PF 14
Hi
When new worlds are created, an integer is incremented. When this integer overflows, the kernel panics and fails with a purple diagnostic screen. In ESXi, new worlds are created for all processes because there is no service console operating system. Therefore, this number increments much faster in ESXi than ESX. In ESXi, this issue occurs when the system has been running for a very long time (over a year) without a reboot and has been actively creating processes. In ESX classic, it is almost impossible to hit this threshold.
It seems to be strange. VmWare is aware of this problem and published that it is solved in ESX/ESXi 4.1 U3 (the same as you have).
They also write that the issue does not affect ESXi 5.1. (May be it's to update?)
Anyway you can try this steps:
Rebooting the host restarts the counter and eliminates the risk of any failure.
highWID=$(vsish -e ls world | sed 's!/$!!' | sort -n | tail -n 1) let microFull=highWID/7400 echo ${microFull}
If this script returns a value close to 100,000 it is recommended to schedule a reboot.
It's not a good solution, but it's better than nothing.
More info here.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-26-2013 06:31 AM
тАО02-26-2013 06:31 AM
Re: proliant DL585 G7 Opteron 6238, vSphere 4.1 U3 die with PSOD PF 14
Hello xmate,
thank you for your response. I noticed I was not accurate: the vsphere version is ESX.
The other servers with this hardware configuration gave me a zero back from your script.
The two failed servers had their BIOS updated to the newest version from a HP tecnician and
started to crash with the PSOD.
regards
chr3
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО03-15-2013 03:38 AM
тАО03-15-2013 03:38 AM
Re: proliant DL585 G7 Opteron 6238, vSphere 4.1 U3 die with PSOD PF 14
HP released an advisory that is supposed to help:
I must commit that I'm bit sceptical whether this will solve our problems.
regards
chr3