- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - Linux
- >
- Re: RHEL 5.5 Oracle 11G Host Hangs during Heay I.O...
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-24-2011 08:10 AM
тАО02-24-2011 08:10 AM
RHEL 5.5 Oracle 11G Host Hangs during Heay I.O - RMAN
The System becomes responsive, remains on network however but inaccessible and the console is flooded with messages:
INFO: task processname:5064 blocked for more than 120 seconds.
INFO: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Has anyone experienced this issue? We're clueless and all vendors are engaged.
RHEL came back with a suggestion to change our I/O Scheduler (ELEVEATOR) to DEADLINE from the default CFQ scheduling. Our backend SAN storage is an XP12K array.
The I/O Schedulers I thought is just a sugegstive setting depending on the Array used and load. Unddoubtedly we really should be using DEADLINE scheduler for the DB LUNs alright but I don't believe it should HANG a Linux system.
I am still poring through several Bugzillas that seem to match the kernel messages.
TIA for any ideas, comments, leads, etc.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-24-2011 11:43 AM
тАО02-24-2011 11:43 AM
Re: RHEL 5.5 Oracle 11G Host Hangs during Heay I.O - RMAN
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-24-2011 11:49 AM
тАО02-24-2011 11:49 AM
Re: RHEL 5.5 Oracle 11G Host Hangs during Heay I.O - RMAN
Right now, there are several theories:
- Boot Disk (SAS 15krpm disk) Firmware
- I.O Elevator default of CFQ caused it so we changed the DB disks elevators to DEADLINE
Our DBAs ceased running RMAN and we're about to hit that same period the past 2 days where we got the hit.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-24-2011 06:43 PM
тАО02-24-2011 06:43 PM
Re: RHEL 5.5 Oracle 11G Host Hangs during Heay I.O - RMAN
YMMV depending on the intelligence of your SAN controller, but I've never used anything other than "elevator=noop" for anything resident on our SAN.
We're using Oracle 10g and 11g on 32GB and 64GB RHEL5 systems. In our SAN the controller is caching and queuing based on its knowledge of actual data placement, so beyond basic bunching of adjacent requests I believe it's counterproductive to ask Linux to try to optimize I/O for some virtual device that looks nothing like how the data is placed in reality. I want the requests down the channel ASAP so the SAN can get to work on them.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-27-2011 04:18 PM
тАО02-27-2011 04:18 PM
Re: RHEL 5.5 Oracle 11G Host Hangs during Heay I.O - RMAN
I didn't see that specific error, but experienced similar issues, i.e. one "imp" rendering the server completely useless, same with md_resync on a smaller box. I can just say that ionice is your friend, and that not all things that should not happen on a Unix server still hold true when using Linux.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-28-2011 06:18 AM
тАО02-28-2011 06:18 AM
Re: RHEL 5.5 Oracle 11G Host Hangs during Heay I.O - RMAN
I know that this isn't really the problem you're having, it's just a suggestion - but in general, I'd try to stay away from having big I/O problems in the first place...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО02-28-2011 06:52 AM
тАО02-28-2011 06:52 AM
Re: RHEL 5.5 Oracle 11G Host Hangs during Heay I.O - RMAN
After implementing the I/O scheduler change to "deadline" - which I had doubts was really the crux of the matter -- we again had an episode of a hang on our RHEL 5.6 (not 5.5 after all -- 2.6.18-194.26.1 kernel system. Unfortunately -- we were not able to do a forced crash to capture image as we did not have sysrq turned on.
SO the issue is puzzling.
We now are adviced to go to the 2.6.18-194.32.1 kernel. We did so but moved the DB on to a different server (same model/specs) and the updated kernel.... SO far so good.
The old server -- we woll try to attempt to replicate the issue by running iozone and Swingbench or RHEL
s stress suite.
The old server we will try to replicate the issue.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО09-07-2011 02:43 AM
тАО09-07-2011 02:43 AM
Re: RHEL 5.5 Oracle 11G Host Hangs during Heay I.O - RMAN
Alzhy
Was this ever resolved? We have a very similar issue only difference being HDS storage.
Thanks
Mike.