ProLiant Servers (ML,DL,SL)
1753905 Members
9636 Online
108810 Solutions
New Discussion

DL580G7 - Solaris 10 x86 9/10 (update 9) page fault crash in module ntxn

 
Muhammad Ridho
Occasional Contributor

DL580G7 - Solaris 10 x86 9/10 (update 9) page fault crash in module ntxn

Did any one experiencing the same problem like below?

 

Server Info:

System        : ProLiant DL580 G7
Serial No.    : CN7*******     
ROM version   : P65 12/01/2010
iLo present   : Yes
Embedded NICs : 4
...

                                        CPU   Bus   Thread Level
Processor............................... Speed Speed Count  Cache  Status
Number Socket Cores Step Name            (MHz) (MHz)        (KB)
------ ------ ----- ---- --------------- ----- ----- ------ ------ -----------
  0       1      4    6  Intel Xeon       1867   133   8    3 18432    Ok
  1       2      4    6  Intel Xeon       1867   133   8    3 18432    Ok
  2       3      4    6  Intel Xeon       1867   133   8    3 18432    Ok
  3       4      4    6  Intel Xeon       1867   133   8    3 18432    Ok

Processor total  : 4
Memory installed : 32768 MBytes
ECC supported    : Yes

That server got panic several times in a week with the following messages:

 

Jul 18 09:38:49 ABC1a unix: [ID 836849 kern.notice]
Jul 18 09:38:49 ABC1a ^Mpanic[cpu15]/thread=fffffe80008b1c60:
Jul 18 09:38:49 ABC1a genunix: [ID 335743 kern.notice] BAD TRAP: type=e (#pf Page fault) rp=fffffe80008b1a00 addr=0 occurred in module "ntxn" due to a NULL pointer dereference
Jul 18 09:38:49 ABC1a unix: [ID 100000 kern.notice]
Jul 18 09:38:49 ABC1a unix: [ID 839527 kern.notice] sched:
Jul 18 09:38:49 ABC1a unix: [ID 753105 kern.notice] #pf Page fault
Jul 18 09:38:49 ABC1a unix: [ID 532287 kern.notice] Bad kernel fault at addr=0x0
Jul 18 09:38:49 ABC1a unix: [ID 243837 kern.notice] pid=0, pc=0xffffffffef45b7f3, sp=0xfffffe80008b1af0, eflags=0x10206
Jul 18 09:38:49 ABC1a unix: [ID 211416 kern.notice] cr0: 8005003b<pg,wp,ne,et,ts,mp,pe> cr4: 6f8<xmme,fxsr,pge,mce,pae,pse,de>
Jul 18 09:38:49 ABC1a unix: [ID 354241 kern.notice] cr2: 0 cr3: 12ac3000 cr8: c
Jul 18 09:38:49 ABC1a unix: [ID 592667 kern.notice]     rdi: ffffffff919e34c0 rsi:                1 rdx:                0
Jul 18 09:38:49 ABC1a unix: [ID 592667 kern.notice]     rcx: fffffe80008b1c60  r8: ffffffff919e34c0  r9:                1
Jul 18 09:38:49 ABC1a unix: [ID 592667 kern.notice]     rax:             11a7 rbx:                0 rbp: fffffe80008b1b20
Jul 18 09:38:49 ABC1a unix: [ID 592667 kern.notice]     r10:             7fff r11: fffffffffbcf55c0 r12:                0
Jul 18 09:38:49 ABC1a unix: [ID 592667 kern.notice]     r13: ffffffff919e34c0 r14: ffffffff919d1000 r15:                0
Jul 18 09:38:50 ABC1a unix: [ID 592667 kern.notice]     fsb:                0 gsb: ffffffff89d63800  ds:               43
Jul 18 09:38:50 ABC1a unix: [ID 592667 kern.notice]      es:               43  fs:                0  gs:              1c3
Jul 18 09:38:50 ABC1a unix: [ID 592667 kern.notice]     trp:                e err:                0 rip: ffffffffef45b7f3
Jul 18 09:38:50 ABC1a unix: [ID 592667 kern.notice]      cs:               28 rfl:            10206 rsp: fffffe80008b1af0
Jul 18 09:38:50 ABC1a unix: [ID 266532 kern.notice]      ss:               30
Jul 18 09:38:50 ABC1a unix: [ID 100000 kern.notice]
Jul 18 09:38:50 ABC1a genunix: [ID 655072 kern.notice] fffffe80008b1910 unix:die+da ()
Jul 18 09:38:50 ABC1a genunix: [ID 655072 kern.notice] fffffe80008b19f0 unix:trap+5e6 ()
Jul 18 09:38:50 ABC1a genunix: [ID 655072 kern.notice] fffffe80008b1a00 unix:cmntrap+140 ()
Jul 18 09:38:50 ABC1a genunix: [ID 655072 kern.notice] fffffe80008b1b20 ntxn:unm_reserve_rx_buffer+23 ()
Jul 18 09:38:50 ABC1a genunix: [ID 655072 kern.notice] fffffe80008b1ba0 ntxn:unm_process_rcv_ring+171 ()
Jul 18 09:38:50 ABC1a genunix: [ID 655072 kern.notice] fffffe80008b1bf0 ntxn:unm_intr+1c3 ()
Jul 18 09:38:50 ABC1a genunix: [ID 655072 kern.notice] fffffe80008b1c40 unix:av_dispatch_autovect+78 ()
Jul 18 09:38:50 ABC1a genunix: [ID 655072 kern.notice] fffffe80008b1c50 unix:intr_thread+5f ()
Jul 18 09:38:50 ABC1a unix: [ID 100000 kern.notice]
Jul 18 09:38:50 ABC1a genunix: [ID 672855 kern.notice] syncing file systems...
Jul 18 09:38:51 ABC1a genunix: [ID 733762 kern.notice]  51
Jul 18 09:38:52 ABC1a genunix: [ID 733762 kern.notice]  48
Jul 18 09:38:54 ABC1a genunix: [ID 733762 kern.notice]  46
Jul 18 09:39:25 ABC1a last message repeated 20 times
Jul 18 09:39:26 ABC1a genunix: [ID 622722 kern.notice]  done (not all i/o completed)
Jul 18 09:39:27 ABC1a genunix: [ID 111219 kern.notice] dumping to /dev/dsk/c0t0d0s1, offset 13423673344, content: kernel
Jul 18 09:40:06 ABC1a genunix: [ID 100000 kern.notice]
Jul 18 09:40:06 ABC1a genunix: [ID 665016 kern.notice] ^M100% done: 804820 pages dumped,

This problem really bit us right now, no good pointers from google around the issue. I hope to get some clue in this forum.

 

 

Thanks n best regards,

1 REPLY 1
Muhammad Ridho
Occasional Contributor

Re: DL580G7 - Solaris 10 x86 9/10 (update 9) page fault crash in module ntxn

Hi everyone,

 

About the problem above, the best match info i can get recently with google is using this keyword: NTXN_1 bug 7004495.

 

Seems like there are solaris bug not yet publicly known for the netxen solaris driver.

 

BR,