- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- Re: Compilation Intensive Load rx2660 vs rx5670 pe...
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Discussions
Discussions
Forums
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-30-2008 01:46 PM
тАО10-30-2008 01:46 PM
Are there any recommendations anyone has on optimizing for "aCC" and "ecom" programs when they are apparently not memory bound but bound by cpu performance and perhaps disk io?
I've been unable to build on a dual core 1398MHz rx2660 in less than 6 hours what a slightly beefier server dual-core 1500 MHz rx5670 can build in 1.5 hours. The latter machine actually has less ram installed.
I guess my question is: Is the difference between these two models hardware and io performance that pronounced?
If not should I look to tuning and / or hardware failure as a remedy for the differences.
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-30-2008 02:00 PM
тАО10-30-2008 02:00 PM
Re: Compilation Intensive Load rx2660 vs rx5670 performance
How many sources?
Are your source and object directories on NFS?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-30-2008 08:03 PM
тАО10-30-2008 08:03 PM
Re: Compilation Intensive Load rx2660 vs rx5670 performance
No sir, there are many factors, it requres a lot of deep analysis.
Regards,
Gokul Chandola
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-31-2008 07:04 AM
тАО10-31-2008 07:04 AM
Re: Compilation Intensive Load rx2660 vs rx5670 performance
Number of Source Files: 8218
# find . -type f -name '*.c*' | wc -l
8218
NFS: No nfs is not used for accessing any files used except when a build completes, compilation is performed on a local filesystem.
Building 64 bit code using appropriate flags for that purpose.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-31-2008 07:12 AM
тАО10-31-2008 07:12 AM
Solution- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-31-2008 08:41 AM
тАО10-31-2008 08:41 AM
Re: Compilation Intensive Load rx2660 vs rx5670 performance
I had been tuning the rx2660 dbc_max_pct and dbc_min_pct in the following ranges with no difference in compilation time, it's fixed at 6 hours with each.
* defaults 50/5 max/min
* 20/20 (as configured on the dev machine rs5670)
* 90/5 (most recent setting)
Top down make time was unaffected with each, though with the middle value other tunings were applied, essentially attempting to slavishly match the 5670's values. I suspect that the 5670 had been tuned by HP hosts during a 64 bit porting seminar some years ago.
I'm going to do a complete reset to default kctune'd values to establish a baseline time later today and see what information I can get with those values then apply some different values for whatever anyone suggests here.
Seeing about getting a copy of glance as well, but it's hard to know when I'll be able to run that tool for this.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-31-2008 10:03 AM
тАО10-31-2008 10:03 AM
Re: Compilation Intensive Load rx2660 vs rx5670 performance
Also what version of aC++ do you have?
You didn't mention how many CPUs cores you have and whether you are doing parallel makes.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-31-2008 10:18 AM
тАО10-31-2008 10:18 AM
Re: Compilation Intensive Load rx2660 vs rx5670 performance
Caliper is something that is available, but I honestly don't know what to look for with it.
aCC version: HP C/aC++ Developer's Bundle C.11.23.15.2
The builds are non-parallel makes, gmake is performing them, and enabling extra jobs causes compilation to fail, so It's not running parallel makes on the Dev machine.
Both machines have single cpu's with dual cores. the rx5670 has "beefier" MHz values, but I believe the rx2660 has a family 32 and the 5670 has a family 31 model 1 revision 5 (older cpu, pre 9000 Itanium 2) which makes me wonder if one can downgrade families for better performance. :)
Also, I HyperThreading is enabled on both (not 100% sure on the 5670 though) toggling threading on the 2660 didn't have a noticeable effect. Since the jobs are not parallel though, I don't know that the additional cpu's will help at all, would just make the system more responsive while one cpu is loaded.
While the compiles are rolling 100% utilizations alternate between cpu 0 and 1 with the relaxed cpu hovering at 10-30% values. As top reports things anyway.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-31-2008 10:33 AM
тАО10-31-2008 10:33 AM
Re: Compilation Intensive Load rx2660 vs rx5670 performance
This is not the version I want and requires a secret decoder ring to get A.06.20. What does "aCC -V" show?
And you didn't mention what opt level you were using?
>The builds are non-parallel makes, gmake is performing them and enabling extra jobs causes compilation to fail, so It's not running parallel makes on the Dev machine.
Then you aren't using the resources you have. Why does it fail? (Out of swap or bad makefiles?) You probably need more memory/swap.
>HyperThreading is enabled on both
I didn't think you could do that on 11.23 and you probably don't want it.
>I don't know that the additional CPUs will help at all
You do multiple compiles at once.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
тАО10-31-2008 11:35 AM
тАО10-31-2008 11:35 AM
Re: Compilation Intensive Load rx2660 vs rx5670 performance
aCC: HP C/aC++ B3910B A.06.20 [May 13 2008]
Optimization Level: -fast flag is used for most of the .o files produced and the -D64 and -W*** flags (which aren't optimizations per se) no -O# optimizations are being used.
Memory/Swap:
Physical Memory: 4079.1 MB
Real Memory: Active/Total ~ 300 MB/500MB
Virtual Memory: Active/Total ~ 700MB/800MB
swap is 8 Gb and approximately 500-700 Mb of that is used.
So how do I make the machine forego using swap, if it is thrashing on swap and seems to have enough physical memory to get by just fine?
Threads: initial runs were with threading disabled, threading was enabled later in the game to see if it would affect performance either way, they haven't. I (admittedly ignorantly) suspect that 11.23 kernel might be ignoring the EFI set threading behavior anyway since it doesn't measurably change behavior or introduce breaks.
Multiple Compiles: The product built with it's existing makefiles do not support multiple job makes (apologies, thought I had written that in to an earlier posting) so that's not an option, but it does rule out the other machine being run with multiples to explain the difference.