BladeSystem - General
1748184 Members
4095 Online
108759 Solutions
New Discussion юеВ

Re: Slow BL645c node through Infiniband

 

Slow BL645c node through Infiniband

We have a blade with 16 BL465c nodes connected through a Voltaire switch.
One of them had the motherboard replaced and now we noted it runs slower than the other nodes, around 20-25% slower, but only in multinode jobs.
So we started experimenting:
- With jobs that run inside the node, it behaves exactly as the other nodes.
- iLO & BIOS are the same version.
- We swapped Mezzanine cards between "the node" and a "normal" and the problems stays at the node.
- We swapped bays to see if the problem was at the enclosure, but the problem stays at the node too.

So it looks the problem is in the motherboard, someway, or the connection to the Mezzanine card. We've compared the transmission though Infiniband with collectl and it shows that drop of 20% in the transmission:

#HCA KBIn PktIn SizeIn KBOut PktOut SizeOut Errors
0 23730 15848 1 23689 15835 1 0
0 23487 15701 1 23522 15718 1 0
0 24021 16014 1 23915 15972 1 0
0 24138 16078 1 23911 15976 1 0
0 23455 15674 1 23556 15726 1 0
0 19665 13283 1 19924 13416 1 0
0 13640 9066 1 13605 9052 1 0
0 19898 13396 1 20180 13548 1 0
0 22485 14947 1 22299 14860 1 0
0 23293 15452 1 22870 15271 1 0
0 23500 15704 1 23599 15761 1 0
0 18080 12097 1 18237 12178 1 0
0 23663 15778 1 23426 15685 1 0
0 23695 15811 1 23611 15787 1 0
0 26494 17822 1 27162 18156 1 0
0 26702 17705 1 26042 17408 1 0
0 23845 15907 1 23781 15894 1 0
0 23065 15486 1 23457 15675 1 0
0 22316 14173 1 17785 11985 1 0
0 23777 15917 1 24177 16127 1 0
0 24563 16537 1 24729 16640 1 0
0 22804 15079 1 22585 15004 1 0

yei20
0 29449 19836 1 30230 20180 1 0
0 34207 22994 1 34893 23287 1 0
0 29751 19953 1 30065 20080 1 0
0 26190 17760 1 27606 18423 1 0
0 31813 21245 1 31725 21204 1 0
0 29496 19792 1 30044 20061 1 0
0 34131 22993 1 35194 23510 1 0
0 14868 9954 1 14968 10002 1 0
0 23639 15904 1 24293 16214 1 0
0 34303 23078 1 35023 23427 1 0
0 29756 19949 1 30067 20107 1 0
0 30751 20801 1 31788 21302 1 0
0 32752 21921 1 33397 22228 1 0
0 27596 17866 1 23892 16047 1 0
0 29323 19719 1 30084 20077 1 0
0 34359 23108 1 35005 23418 1 0
0 29810 19971 1 30099 20110 1 0
0 30319 20550 1 31879 21293 1 0
0 27610 18439 1 27461 18351 1 0
0 29517 19839 1 29991 20051 1 0
0 31908 21608 1 33411 22315 1 0
0 31921 21336 1 31776 21253 1 0

Has anyone experienced anything like this? Solved it? Any hint? Any way to check it deeper? Or should I open a case?

Thanks in advance
8 REPLIES 8
rick jones
Honored Contributor

Re: Slow BL645c node through Infiniband

Were there perhaps some BIOS tweaks lost when the motherboard was swapped?
there is no rest for the wicked yet the virtuous have no pillows

Re: Slow BL645c node through Infiniband

Oh, yes! We've already checked it too. As far as we've were able to check both BIOS as equal and have the same setup too.
rick jones
Honored Contributor

Re: Slow BL645c node through Infiniband

Then unless other, more fruitful suggestions are forthcoming here, you should go ahead and excercise your support contract(s) and open a case.
there is no rest for the wicked yet the virtuous have no pillows
rick jones
Honored Contributor

Re: Slow BL645c node through Infiniband

This is IPoIB yes? Are the interrupt assignments the same between a fast system and the now slow one?
there is no rest for the wicked yet the virtuous have no pillows
MarkSeger
Frequent Advisor

Re: Slow BL645c node through Infiniband

Good point about interrupts. Since you're already using collectl it's easy enough to look at the interrupt distribution by cpu by adding 'j' to your -s switch. While you're t it you might also consider using -sC which will break out the CPU load by individual CPU and also show the type of load. Perhaps one system is spending more time processing interrupts than the other? or maybe they're not being distributed across the multiple ones.
-mark

Re: Slow BL645c node through Infiniband

OK! My (our) bad. after checking all what was said here we came back to the BIOS again and we saw the in thw first tab there was a discrepancy.

I can't remember now the exact tab or name, but it was in teh 1st tab and it was about some kind of power management option. I don't know exactly what it did , but we changed it to something like "OS controlled" and now it looks it is running again at the same speed as the other nodes.

Thank you all!

Re: Slow BL645c node through Infiniband

Opps, explained above...
rick jones
Honored Contributor

Re: Slow BL645c node through Infiniband

Making the not-always-true assumption that iLO is iLO, looking at th eiLO on a DL785 suggests that it would be the "Power Management" tab at the top followed by "Settings" on the left. IIRC the default is "HP Dynamic Power Savings Mode" and often when one is in a "Damn the Watts! Full Speed Ahead!" mindset :) it can be set to "HP Static High Performance Mode" or in this case, when the OS is smart enough into "OS Control Mode"

This all controls who/how processor "p-states" (the "Processor States" selection on the left of the "Power Management" page get set. P0 means highest performance and highest power consumption, P3 means lowest performance and lowest power consumption.

My experience as an end-user is that Dynamic Power Savings Mode is when the BIOS takes its best guess as to what mode should be selected for each core, and it will go between P0 and P3 making no stops at either P1 or P2. Here and there under OS Control Mode I've seen cores in all four p-states. In Static High Performance mode they are locked into P0 state.

there is no rest for the wicked yet the virtuous have no pillows