1825787 Members
2312 Online
109687 Solutions
New Discussion

bnx2x driver crash

 
jzhu
New Member

bnx2x driver crash

We have 2 interfaces configured with bonding, we are seeing bnx2x crash in dmesg:

 

9450:[21631713.887893] bnx2x: [bnx2x_attn_int_deasserted3:4297(eno49)]MC assert!
9451:[21631713.887954] bnx2x: [bnx2x_mc_assert:716(eno49)]XSTORM_ASSERT_LIST_INDEX 0x2
9452:[21631713.888014] bnx2x: [bnx2x_mc_assert:732(eno49)]XSTORM_ASSERT_INDEX 0x0 = 0x00000000 0xc9e0c9e0 0x2e382e38 0x00010057
9453:[21631713.888107] bnx2x: [bnx2x_mc_assert:746(eno49)]Chip Revision: everest3, FW Version: 7_10_51
9454:[21631713.888171] bnx2x: [bnx2x_attn_int_deasserted3:4303(eno49)]driver assert
9455:[21631713.888224] bnx2x: [bnx2x_panic_dump:914(eno49)]begin crash dump -----------------
9456:[21631713.888284] bnx2x: [bnx2x_panic_dump:924(eno49)]def_idx(0x34c0)  def_att_idx(0x3292)  attn_state(0x1)  spq_prod_idx(0xd8) next_stats_cnt(0x34b1)
9457:[21631713.888382] bnx2x: [bnx2x_panic_dump:929(eno49)]DSB: attn bits(0x0)  ack(0x1)  id(0x0)  idx(0x3292)
9458:[21631713.888452] bnx2x: [bnx2x_panic_dump:930(eno49)]     def (0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x7f3f 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0)  igu_sb_id(0x0)  igu_seg_id(0x1) pf_id(0x0)  vnic_id(0x0)  vf_id(0xff)  vf_valid (0x0) state(0x1)
9459:[21631713.888667] bnx2x: [bnx2x_panic_dump:981(eno49)]fp0: rx_bd_prod(0x829b)  rx_bd_cons(0xd4)  rx_comp_prod(0xd36b)  rx_comp_cons(0xd19f)  *rx_cons_sb(0xd19f)
9460:[21631713.888769] bnx2x: [bnx2x_panic_dump:984(eno49)]     rx_sge_prod(0x32c0)  last_max_sge(0x2edf)  fp_hc_idx(0xffe9)
9461:[21631713.888847] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp0: tx_pkt_prod(0x244e)  tx_pkt_cons(0x244e)  tx_bd_prod(0xc459)  tx_bd_cons(0xc458)  *tx_cons_sb(0x244e)
9462:[21631713.888950] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp0: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9463:[21631713.889043] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp0: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9464:[21631713.889137] bnx2x: [bnx2x_panic_dump:1012(eno49)]     run indexes (0xffe9 0x0)
9465:[21631713.889198] bnx2x: [bnx2x_panic_dump:1018(eno49)]     indexes (
9477:[21631713.889284] bnx2x: [bnx2x_panic_dump:981(eno49)]fp1: rx_bd_prod(0x9a7a)  rx_bd_cons(0x8b3)  rx_comp_prod(0x845c)  rx_comp_cons(0x8290)  *rx_cons_sb(0x8290)
9478:[21631713.889385] bnx2x: [bnx2x_panic_dump:984(eno49)]     rx_sge_prod(0x7440)  last_max_sge(0x7060)  fp_hc_idx(0x3fc7)
9479:[21631713.889464] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp1: tx_pkt_prod(0xbec5)  tx_pkt_cons(0xbec3)  tx_bd_prod(0x1d85)  tx_bd_cons(0x1d80)  *tx_cons_sb(0xbec3)
9480:[21631713.889579] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp1: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9481:[21631713.889674] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp1: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9482:[21631713.889769] bnx2x: [bnx2x_panic_dump:1012(eno49)]     run indexes (0x3fc7 0x0)
9483:[21631713.889830] bnx2x: [bnx2x_panic_dump:1018(eno49)]     indexes (
9495:[21631713.889920] bnx2x: [bnx2x_panic_dump:981(eno49)]fp2: rx_bd_prod(0xd5dc)  rx_bd_cons(0x417)  rx_comp_prod(0xfad5)  rx_comp_cons(0xf909)  *rx_cons_sb(0xf909)
9496:[21631713.890020] bnx2x: [bnx2x_panic_dump:984(eno49)]     rx_sge_prod(0xb640)  last_max_sge(0xb27c)  fp_hc_idx(0x39b9)
9497:[21631713.890097] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp2: tx_pkt_prod(0xf5eb)  tx_pkt_cons(0xf5eb)  tx_bd_prod(0x1bb4)  tx_bd_cons(0x1bb3)  *tx_cons_sb(0xf5eb)
9498:[21631713.890200] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp2: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9499:[21631713.892868] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp2: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9500:[21631713.898213] bnx2x: [bnx2x_panic_dump:1012(eno49)]     run indexes (0x39b9 0x0)
9501:[21631713.898270] bnx2x: [bnx2x_panic_dump:1018(eno49)]     indexes (
9513:[21631713.901029] bnx2x: [bnx2x_panic_dump:981(eno49)]fp3: rx_bd_prod(0xebd0)  rx_bd_cons(0xa0b)  rx_comp_prod(0x8304)  rx_comp_cons(0x8137)  *rx_cons_sb(0x8137)
9514:[21631713.906471] bnx2x: [bnx2x_panic_dump:984(eno49)]     rx_sge_prod(0x480)  last_max_sge(0xbd)  fp_hc_idx(0x5495)
9515:[21631713.909290] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp3: tx_pkt_prod(0x72fd)  tx_pkt_cons(0x72fd)  tx_bd_prod(0xf310)  tx_bd_cons(0xf30f)  *tx_cons_sb(0x72fd)
9516:[21631713.914996] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp3: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9517:[21631713.920825] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp3: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9518:[21631713.926823] bnx2x: [bnx2x_panic_dump:1012(eno49)]     run indexes (0x5495 0x0)
9519:[21631713.926882] bnx2x: [bnx2x_panic_dump:1018(eno49)]     indexes (
9531:[21631713.929931] bnx2x: [bnx2x_panic_dump:981(eno49)]fp4: rx_bd_prod(0xca60)  rx_bd_cons(0x899)  rx_comp_prod(0xc0b8)  rx_comp_cons(0xbeec)  *rx_cons_sb(0xbeec)
9532:[21631713.936006] bnx2x: [bnx2x_panic_dump:984(eno49)]     rx_sge_prod(0x80)  last_max_sge(0xfc95)  fp_hc_idx(0xc1df)
9533:[21631713.939113] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp4: tx_pkt_prod(0xd736)  tx_pkt_cons(0xd736)  tx_bd_prod(0xe7f5)  tx_bd_cons(0xe7f4)  *tx_cons_sb(0xd736)
9534:[21631713.945239] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp4: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9535:[21631713.951427] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp4: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9536:[21631713.957612] bnx2x: [bnx2x_panic_dump:1012(eno49)]     run indexes (0xc1df 0x0)
9537:[21631713.957670] bnx2x: [bnx2x_panic_dump:1018(eno49)]     indexes (
9549:[21631713.960802] bnx2x: [bnx2x_panic_dump:981(eno49)]fp5: rx_bd_prod(0xece6)  rx_bd_cons(0xb1f)  rx_comp_prod(0x7e6d)  rx_comp_cons(0x7ca1)  *rx_cons_sb(0x7ca1)
9550:[21631713.967229] bnx2x: [bnx2x_panic_dump:984(eno49)]     rx_sge_prod(0x3c80)  last_max_sge(0x38a2)  fp_hc_idx(0xa03f)
9551:[21631713.970327] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp5: tx_pkt_prod(0x2636)  tx_pkt_cons(0x2636)  tx_bd_prod(0xccc5)  tx_bd_cons(0xccc4)  *tx_cons_sb(0x2636)
9552:[21631713.976421] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp5: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9553:[21631713.982533] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp5: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9554:[21631713.988724] bnx2x: [bnx2x_panic_dump:1012(eno49)]     run indexes (0xa03f 0x0)
9555:[21631713.988782] bnx2x: [bnx2x_panic_dump:1018(eno49)]     indexes (
9567:[21631713.991897] bnx2x: [bnx2x_panic_dump:981(eno49)]fp6: rx_bd_prod(0x8c6)  rx_bd_cons(0x6ff)  rx_comp_prod(0x747a)  rx_comp_cons(0x72ae)  *rx_cons_sb(0x72ae)
9568:[21631713.998043] bnx2x: [bnx2x_panic_dump:984(eno49)]     rx_sge_prod(0xb00)  last_max_sge(0x734)  fp_hc_idx(0xce5d)
9569:[21631714.001139] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp6: tx_pkt_prod(0x78bc)  tx_pkt_cons(0x78bc)  tx_bd_prod(0x397e)  tx_bd_cons(0x397d)  *tx_cons_sb(0x78bc)
9570:[21631714.007243] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp6: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9571:[21631714.013360] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp6: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9572:[21631714.019560] bnx2x: [bnx2x_panic_dump:1012(eno49)]     run indexes (0xce5d 0x0)
9573:[21631714.019622] bnx2x: [bnx2x_panic_dump:1018(eno49)]     indexes (
9585:[21631714.022739] bnx2x: [bnx2x_panic_dump:981(eno49)]fp7: rx_bd_prod(0x2faa)  rx_bd_cons(0xde3)  rx_comp_prod(0x6ab9)  rx_comp_cons(0x68ed)  *rx_cons_sb(0x68ed)
9586:[21631714.028892] bnx2x: [bnx2x_panic_dump:984(eno49)]     rx_sge_prod(0xf140)  last_max_sge(0xed6b)  fp_hc_idx(0x1085)
9587:[21631714.031994] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp7: tx_pkt_prod(0x2e3d)  tx_pkt_cons(0x2e38)  tx_bd_prod(0xc9ea)  tx_bd_cons(0xc9dd)  *tx_cons_sb(0x2e38)
9588:[21631714.038096] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp7: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9589:[21631714.044216] bnx2x: [bnx2x_panic_dump:1001(eno49)]fp7: tx_pkt_prod(0x0)  tx_pkt_cons(0x0)  tx_bd_prod(0x0)  tx_bd_cons(0x0)  *tx_cons_sb(0x0)
9590:[21631714.050421] bnx2x: [bnx2x_panic_dump:1012(eno49)]     run indexes (0x1085 0x0)
9591:[21631714.050479] bnx2x: [bnx2x_panic_dump:1018(eno49)]     indexes (
9603:[21631714.053632] bnx2x 0000:04:00.0 eno49: bc 7.13.23
9696:[21631714.063136] bnx2x: [bnx2x_mc_assert:716(eno49)]XSTORM_ASSERT_LIST_INDEX 0x2
9697:[21631714.066125] bnx2x: [bnx2x_mc_assert:732(eno49)]XSTORM_ASSERT_INDEX 0x0 = 0x00000000 0xc9e0c9e0 0x2e382e38 0x00010057
9698:[21631714.069110] bnx2x: [bnx2x_mc_assert:746(eno49)]Chip Revision: everest3, FW Version: 7_10_51
9699:[21631714.072039] bnx2x: [bnx2x_panic_dump:1177(eno49)]end crash dump -----------------
9707:[21631736.644206] NETDEV WATCHDOG: eno49 (bnx2x): transmit queue 1 timed out
9741:[21631738.684604] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[0]: txdata->tx_pkt_prod(9299) != txdata->tx_pkt_cons(9294)
9743:[21631740.745792] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[1]: txdata->tx_pkt_prod(48843) != txdata->tx_pkt_cons(48835)
9744:[21631742.815793] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[2]: txdata->tx_pkt_prod(62970) != txdata->tx_pkt_cons(62955)
9745:[21631744.886766] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[3]: txdata->tx_pkt_prod(29448) != txdata->tx_pkt_cons(29437)
9747:[21631746.959711] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[4]: txdata->tx_pkt_prod(55138) != txdata->tx_pkt_cons(55094)
9748:[21631749.031377] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[5]: txdata->tx_pkt_prod(9786) != txdata->tx_pkt_cons(9782)
9749:[21631751.103618] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[7]: txdata->tx_pkt_prod(11860) != txdata->tx_pkt_cons(11832)
9750:[21631753.171240] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[0]: txdata->tx_pkt_prod(9299) != txdata->tx_pkt_cons(9294)
9751:[21631755.246299] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[1]: txdata->tx_pkt_prod(48843) != txdata->tx_pkt_cons(48835)
9752:[21631757.322651] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[2]: txdata->tx_pkt_prod(62970) != txdata->tx_pkt_cons(62955)
9753:[21631759.401861] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[3]: txdata->tx_pkt_prod(29448) != txdata->tx_pkt_cons(29437)
9755:[21631761.476578] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[4]: txdata->tx_pkt_prod(55138) != txdata->tx_pkt_cons(55094)
9756:[21631763.550730] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[5]: txdata->tx_pkt_prod(9786) != txdata->tx_pkt_cons(9782)
9757:[21631765.626399] bnx2x: [bnx2x_clean_tx_queue:1159(eno49)]timeout waiting for queue[7]: txdata->tx_pkt_prod(11860) != txdata->tx_pkt_cons(11832)
9758:[21631765.637799] bnx2x: [bnx2x_del_all_macs:8425(eno49)]Failed to delete MACs: -5
9759:[21631765.640938] bnx2x: [bnx2x_chip_cleanup:9245(eno49)]Failed to schedule DEL commands for UC MACs list: -5
9760:[21631765.658668] bnx2x: [bnx2x_func_stop:9004(eno49)]FUNC_STOP ramrod failed. Running a dry transaction
9761:[21631766.342949] bnx2x 0000:04:00.0 eno49: using MSI-X  IRQs: sp 94  fp[0] 96 ... fp[7] 103
9762:[21631766.449028] bnx2x: [bnx2x_nic_load:2754(eno49)]Function start failed!
9763:[21631766.618178] bond0: link status definitely down for interface eno49, disabling it

 

ethtool shows link not detected:

        Link partner advertised pause frame use: No
        Link partner advertised auto-negotiation: Yes
        Speed: Unknown!
        Duplex: Unknown! (255)
        Port: Twisted Pair
        PHYAD: 17
        Transceiver: internal
        Auto-negotiation: on
        MDI-X: Unknown
        Supports Wake-on: g
        Wake-on: g
        Current message level: 0x00000000 (0)
                               
        Link detected: no

ifdown eno49; ifup eno49, to bounce the interface, then see the following logs shown in dmesg.

[23453271.455597] bond0: Removing slave eno49
[23453271.455606] bond0: option slaves: invalid value (-eno49)
[23453284.807623] bond0: Adding slave eno49
[23453285.504955] bnx2x 0000:04:00.0 eno49: using MSI-X  IRQs: sp 94  fp[0] 96 ... fp[7] 103
[23453285.610512] bnx2x: [bnx2x_nic_load:2754(eno49)]Function start failed!
[23454236.613604] SELinux: initialized (dev tmpfs, type tmpfs), uses transition SIDs
[23454296.711718] SELinux: initialized (dev tmpfs, type tmpfs), uses transition SIDs
[23455696.147966] bond0: Removing slave eno49
[23455696.147974] bond0: option slaves: invalid value (-eno49)
[23455712.060846] bond0: Adding slave eno49
[23455712.588317] bnx2x 0000:04:00.0 eno49: using MSI-X  IRQs: sp 94  fp[0] 96 ... fp[7] 103
[23455712.694346] bnx2x: [bnx2x_nic_load:2754(eno49)]Function start failed!

After rebooting the machine, it returned to normal.

Server info:

ethtool -i eno49
driver: bnx2x
version: 1.712.30-0
firmware-version: bc 7.12.83
expansion-rom-version: 
bus-info: 0000:06:00.0
supports-statistics: yes
supports-test: yes
supports-eeprom-access: yes
supports-register-dump: yes
supports-priv-flags: yes
uname -a
Linux  HOSTNAME.battle.net 3.10.0-327.13.1.el7.x86_64 #1 SMP Thu Mar 31 16:04:38 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux
Product Name: ProLiant DL360 Gen9

Did anyone meet this problem?

 

1 REPLY 1
VNET_SR
Member

Re: bnx2x driver crash

Hi,

the same here.

lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description:    Ubuntu 14.04.5 LTS
Release:        14.04
Codename:       trusty

kernel:

Linux os-node-07 4.4.0-79-generic #100~14.04.1-Ubuntu SMP Fri May 19 18:36:51 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

 

HW:

System Information
        Manufacturer: HP
        Product Name: ProLiant BL460c G7
        Serial Number: GB8733RBWV

09:00.1 Ethernet controller: Broadcom Corporation NetXtreme II BCM57711E 10-Gigabit PCIe
        Subsystem: Hewlett-Packard Company NC532m Dual Port 10GbE Multifunction BL-C Adapter
        Physical Slot: 2
        Flags: bus master, fast devsel, latency 0, IRQ 85
        Memory at fa000000 (64-bit, non-prefetchable) [size=8M]
        Memory at f9800000 (64-bit, non-prefetchable) [size=8M]
        Capabilities: [48] Power Management version 3
        Capabilities: [50] Vital Product Data
        Capabilities: [58] MSI: Enable- Count=1/8 Maskable- 64bit+
        Capabilities: [a0] MSI-X: Enable+ Count=17 Masked-
        Capabilities: [ac] Express Endpoint, MSI 00
        Capabilities: [100] Device Serial Number 00-23-7d-ff-fe-e8-1b-14
        Capabilities: [110] Advanced Error Reporting
        Capabilities: [150] Power Budgeting <?>
        Capabilities: [160] Virtual Channel
        Kernel driver in use: bnx2x


root@os-node-07:~# ethtool p2p1
Settings for p2p1:
        Supported ports: [ FIBRE ]
        Supported link modes:   1000baseT/Full
                                2500baseX/Full
                                10000baseT/Full
        Supported pause frame use: Symmetric Receive-only
        Supports auto-negotiation: Yes
        Advertised link modes:  1000baseT/Full
                                2500baseX/Full
                                10000baseT/Full
        Advertised pause frame use: Transmit-only
        Advertised auto-negotiation: Yes
        Speed: Unknown!
        Duplex: Unknown! (255)
        Port: FIBRE
        PHYAD: 16
        Transceiver: internal
        Auto-negotiation: on
        Supports Wake-on: g
        Wake-on: g
        Current message level: 0x00000000 (0)

        Link detected: no

 

root@os-node-07:~# ethtool -i p2p1
driver: bnx2x
version: 1.712.30-0
firmware-version: bc 6.2.28 phy baa0.105
bus-info: 0000:09:00.0
supports-statistics: yes
supports-test: yes
supports-eeprom-access: yes
supports-register-dump: yes
supports-priv-flags: yes

DMESG:

https://pastebin.com/7apkynwp