Are you also using the "bcm" cards as cluster inter-connect? Or do you use Memorychannel as an interconnect medium.
If one was to ask me to build a NFS cluster, i would use:
2 Gigabit networkcard (bcm) with netrain, per node.
MemoryChannel interconnect, as thats 70% faster than GB.
Performance would also degrade when the "owner" of the filesystem the NFS share is on, is not the active NFS server. File access would than first pass the cluster interconnect. Which you would notice performance wise.
Watch, Think and Tinker.