Xsanity Sanity for Apple's Xsan and Final Cut Server.
  
Wednesday, May 22 2013 @ 09:03 AM EDT
Topics
Storage (39)
People (1)
Xsan (103)
How To (26)
User Functions
Username:

Password:

Don't have an account yet? Sign up as a New User
Who's Online
Guest Users: 11
Sponsorship

Xsanity is proudly sponsored by:

Tekserve
The Old Reliable Mac Shop

MDC1 Kernel Panic (hwmond)
Goto page 1, 2  Next
 
Post new topic   Reply to topic    Xsanity Forums Forum Index -> Troubleshooting
View previous topic :: View next topic  
Author Message
d4corp
Could work for Apple
Could work for Apple


Joined: 07 Nov 2008
Posts: 50

PostPosted: Tue Jan 05, 2010 3:30 am    Post subject: MDC1 Kernel Panic (hwmond) Reply with quote

Good morning !

A nice email was waiting for me this morning :
"Le volume SANLaboutique sur le SAN SANXXX est à nouveau en ligne et fonctionne sur mdc2.xxx.pub." which means that the SAN is now available on mdc2.

After investigating a little bit, it turns out that mdc1 had a kernel panic earlier, related to hwmond.

Background :
    - two Xserve1,1 for mdc1 & mdc2
    - 2 Xserve RAID 8x500GB
    - 2 Xserve RAID 8x750GB
    - Mac OS X 10.6.2 (10C540)
    - Xsan 2.2.1
    - 2x Qlogic SANbox 5600
    - LSIlogic


Running apps :
    - Server Monitor.app
    - RAID Admin.app
    - Xsan Admin.app


Kernel panic log :
Code:
Anonymous UUID:                    3FD888E3-2BFC-4D49-A506-4F51EA26E65E

Tue Jan  5 04:12:38 2010
panic(cpu 0 caller 0x2345e7): "zalloc: \"kalloc.8192\" (159 elements) retry fail 3, kfree_nop_count: 0"@/SourceCache/xnu/xnu-1486.2.11/osfmk/kern/zalloc.c:981
Backtrace (CPU 0), Frame : Return Address (4 potential args on stack)
0x36743c78 : 0x21b2bd (0x5cf868 0x36743cac 0x223719 0x0)
0x36743cc8 : 0x2345e7 (0x5886dc 0x587125 0x9f 0x3)
0x36743d68 : 0x2203d8 (0x184c2d8 0x1 0x5484c58 0x0)
0x36743da8 : 0x220418 (0x1698 0x1 0x36743dc8 0x2a0596)
0x36743dc8 : 0x2104ac (0x1698 0x246 0x36743e68 0x20fd38)
0x36743df8 : 0x21d5d7 (0x10f8 0x0 0x5d00 0x2903)
0x36743e38 : 0x2107f7 (0x5912a00 0x0 0x546e2d0 0x546e8f0)
0x36743e98 : 0x216a5a (0x5912a00 0x0 0x0 0x0)
0x36743f18 : 0x2924f1 (0x36743f44 0x0 0x0 0x0)
0x36743fc8 : 0x29dfd8 (0x5212ca0 0x1 0x10 0x6da0ba4)

BSD process name corresponding to current thread: hwmond

Mac OS version:
10C540

Kernel version:
Darwin Kernel Version 10.2.0: Tue Nov  3 10:37:10 PST 2009; root:xnu-1486.2.11~1/RELEASE_I386
System model name: Xserve1,1 (Mac-F4208AC8)

System uptime in nanoseconds: 318659674259306
vm objects:10728520
vm object hash entri:1338240
kernel map entries:2875356
pv_list:1181432
kalloc.16:1851392
kalloc.32:2068480
kalloc.64:19804160
kalloc.128:5009408
kalloc.1024:24092672
kalloc.2048:322654208
kalloc.8192:1302528
vm pages:22617232
ipc ports:1823248
threads:1176000
vnodes:13325180
namecache:4728720
HFS node:19787900
HFS fork:8016512
buf.8192:85794816
ubc_info zone:2676480
vnode pager structur:1338240
Kernel Stacks:2424832
PageTables:16367616
Kalloc.Large:2595280
unloaded kexts:
com.apple.iokit.IOUSBHIDDriver   3.8.4 (addr 0x36275000, size 0x24576) - last unloaded 85722720307455
loaded kexts:
com.apple.filesystems.afpfs   9.6 - last loaded 85628111877553
com.apple.nke.asp_tcp   5.0
com.apple.filesystems.autofs   2.1.0
com.apple.driver.AppleHWSensor   1.9.2d0
com.apple.driver.AppleUpstreamUserClient   3.1.0
com.apple.kext.ATIFramebuffer   6.0.6
com.apple.Dont_Steal_Mac_OS_X   7.0.0
com.apple.ATIRadeonX1000   6.0.6
com.apple.driver.AudioIPCDriver   1.1.2
com.apple.driver.AppleIntel8254XEthernet   2.1.1b7
com.apple.driver.AppleSEP   1.4.2
com.apple.driver.AppleIntelMeromProfile   19
com.apple.driver.AppleMCEDriver   1.1.9
com.apple.driver.AppleBMC   2.0.2
com.apple.driver.Apple16X50ACPI   3.0
com.apple.driver.ACPI_SMC_PlatformPlugin   4.0.1d0
com.apple.driver.AppleLPC   1.4.9
com.apple.driver.AppleRAID   4.0.6
com.apple.iokit.SCSITaskUserClient   2.6.0
com.apple.BootCache   31
com.apple.AppleFSCompression.AppleFSCompressionTypeZlib   1.0.0d1
com.apple.driver.AppleUSBHub   3.8.4
com.apple.driver.AppleLSIFusionMPT   2.5.0
com.apple.driver.AppleFWOHCI   4.4.0
com.apple.driver.AppleIntelPIIXATA   2.5.0
com.apple.driver.AppleEFINVRAM   1.3.0
com.apple.driver.AppleUSBEHCI   3.7.5
com.apple.driver.AppleUSBUHCI   3.7.5
com.apple.driver.AppleACPIButtons   1.3
com.apple.driver.AppleRTC   1.3
com.apple.driver.AppleHPET   1.4
com.apple.driver.AppleSMBIOS   1.4
com.apple.driver.AppleACPIEC   1.3
com.apple.driver.AppleAPIC   1.4
com.apple.driver.AppleIntelCPUPowerManagementClient   96.0.0
com.apple.security.sandbox   0
com.apple.security.quarantine   0
com.apple.nke.applicationfirewall   2.1.11
com.apple.driver.AppleIntelCPUPowerManagement   96.0.0
com.apple.filesystems.acfsctl   412.3
com.apple.filesystems.acfs   412.3
com.apple.driver.AppleProfileReadCounterAction   17
com.apple.driver.AppleProfileTimestampAction   10
com.apple.driver.AppleProfileThreadInfoAction   14
com.apple.driver.AppleProfileRegisterStateAction   10
com.apple.driver.AppleProfileKEventAction   10
com.apple.driver.AppleProfileCallstackAction   20
com.apple.iokit.IOSurface   73.0
com.apple.iokit.IOBluetoothSerialManager   2.2.4f3
com.apple.iokit.IONDRVSupport   2.0
com.apple.iokit.IOAudioFamily   1.7.2fc1
com.apple.kext.OSvKernDSPLib   1.3
com.apple.kext.ATI1300Controller   6.0.6
com.apple.kext.ATISupport   6.0.6
com.apple.iokit.IOFireWireIP   2.0.3
com.apple.iokit.IONetworkingFamily   1.9
com.apple.iokit.AppleProfileFamily   41
com.apple.iokit.IOGraphicsFamily   2.0
com.apple.driver.Apple16X50Serial   3.0
com.apple.iokit.IOSerialFamily   10.0.3
com.apple.driver.AppleSMC   3.0.1d2
com.apple.driver.IOPlatformPluginFamily   4.0.1d0
com.apple.driver.XsanFilter   402.1
com.apple.iokit.IOSCSIBlockCommandsDevice   2.6.0
com.apple.driver.AppleUSBMergeNub   3.8.5
com.apple.driver.AppleUSBComposite   3.7.5
com.apple.iokit.IOSCSIMultimediaCommandsDevice   2.6.0
com.apple.iokit.IOBDStorageFamily   1.6
com.apple.iokit.IODVDStorageFamily   1.6
com.apple.iokit.IOCDStorageFamily   1.6
com.apple.iokit.IOATAPIProtocolTransport   2.5.0
com.apple.iokit.IOSCSIParallelFamily   2.0.0
com.apple.iokit.IOSCSIArchitectureModelFamily   2.6.0
com.apple.iokit.IOFireWireFamily   4.1.7
com.apple.iokit.IOUSBUserClient   3.8.5
com.apple.iokit.IOATAFamily   2.5.0
com.apple.iokit.IOUSBFamily   3.8.5
com.apple.driver.AppleEFIRuntime   1.3.0
com.apple.driver.AppleKeyswitch   1.0.5f4
com.apple.iokit.IOHIDFamily   1.6.1
com.apple.iokit.IOSMBusFamily   1.1
com.apple.security.TMSafetyNet   6
com.apple.kext.AppleMatch   1.0.0d1
com.apple.driver.DiskImages   281
com.apple.iokit.IOStorageFamily   1.6
com.apple.driver.AppleACPIPlatform   1.3
com.apple.iokit.IOPCIFamily   2.6
com.apple.iokit.IOACPIFamily   1.3.0
Model: Xserve1,1, BootROM XS11.0080.B01, 4 processors, Dual-Core Intel Xeon, 2 GHz, 2 GB, SMC 1.11f5
Graphics: ATI Radeon X1300, ATY,RadeonX1300, PCIe, 64 MB
Memory Module: global_name
Network Service: Ethernet 2, Ethernet, en1
Network Service: Ethernet 1, Ethernet, en0
PCI Card: ATY,RadeonX1300, Display, Mezzanine
PCI Card: Apple 2 Port 4Gbps Fibre Channel Card, sppci_fibrechannel, Slot-1
PCI Card: Apple 2 Port 4Gbps Fibre Channel Card, sppci_fibrechannel, Slot-1
Parallel ATA Device: MATSHITACD-RW  CW-8124
Fibre Channel Device: SCSI Target Device @ 0
Fibre Channel Device: SCSI Target Device @ 1
Fibre Channel Device: SCSI Target Device @ 2
Fibre Channel Device: SCSI Target Device @ 3
Fibre Channel Device: SCSI Target Device @ 4
Fibre Channel Device: SCSI Target Device @ 5
Fibre Channel Device: SCSI Target Device @ 6
Fibre Channel Device: SCSI Target Device @ 7
Fibre Channel Device: SCSI Target Device @ 0
Fibre Channel Device: SCSI Target Device @ 1
Fibre Channel Device: SCSI Target Device @ 2
Fibre Channel Device: SCSI Target Device @ 3
Fibre Channel Device: SCSI Target Device @ 4
Fibre Channel Device: SCSI Target Device @ 5
Fibre Channel Device: SCSI Target Device @ 6
Fibre Channel Device: SCSI Target Device @ 7
USB Device: Frontpanel Controller, 0x05ac  (Apple Inc.), 0x8261, 0x3d100000
FireWire Device: built-in_hub, Up to 800 Mb/sec


hwmond.log :
Code:


============================================================
Thu Dec 31 17:14:08 CET 2009 - Number of drives change from 11 to 3.
Thu Dec 31 17:18:14 CET 2009 - Number of drives change from 3 to 11.
Thu Dec 31 17:18:26 CET 2009 - Number of drives change from 11 to 4.
Thu Dec 31 17:23:50 CET 2009 - Number of drives change from 4 to 12.
Thu Dec 31 17:23:57 CET 2009 - Number of drives change from 12 to 5.
Fri Jan  1 11:36:50 CET 2010 -
============================================================
Status Summary

Server:
  Host                          : Metadata-1
  Model                         : Xserve1,1
  Uptime                        : 1 minutes
  OS version                    : Mac OS X Server 10.6.2 (10C540)
  Processor                     : 2 x 2000 MHz
  Memory                        : 2048 MB
  BootROM                       : XS11.88Z.0080.B01.0706271533
  Serial                        : CK737011V2Q

Memory:
  Memory Slot "BRANCH 0 CHANNEL 0/DIMM 1"   : 512MB, ECC DDR2 FB-DIMM, 667 MHz
  Memory Slot "BRANCH 0 CHANNEL 1/DIMM 2"   : 512MB, ECC DDR2 FB-DIMM, 667 MHz
  Memory Slot "BRANCH 1 CHANNEL 0/DIMM 3"   : 512MB, ECC DDR2 FB-DIMM, 667 MHz
  Memory Slot "BRANCH 1 CHANNEL 1/DIMM 4"   : 512MB, ECC DDR2 FB-DIMM, 667 MHz

Drives:
  Drive 1               (disk8)   : Normal
  Drive 2               (disk10)   : Normal
  Drive 3               (disk7)   : Normal
  Drive 4               (disk0)   : Normal
  Drive 5               (disk1)   : Normal
  Drive 6               (disk2)   : Normal
  Drive 7               (disk3)   : Normal
  Drive 8               (disk4)   : Normal
  Drive 9               (disk5)   : Normal
  Drive 10               (disk11)   : Normal
  Drive 11               (disk6)   : Normal

Network:
  en1                (  active)   : Normal
  en0                (  active)   : Normal
  fw0                (inactive)   : Normal


============================================================
Fri Jan  1 11:36:53 CET 2010 - Number of drives change from 11 to 3.
Tue Jan  5 04:12:58 CET 2010 -
============================================================
Status Summary

Server:
  Host                          : Metadata-1
  Model                         : Xserve1,1
  Uptime                        : 1 minutes
  OS version                    : Mac OS X Server 10.6.2 (10C540)
  Processor                     : 2 x 2000 MHz
  Memory                        : 2048 MB
  BootROM                       : XS11.88Z.0080.B01.0706271533
  Serial                        : CK737011V2Q

Memory:
  Memory Slot "BRANCH 0 CHANNEL 0/DIMM 1"   : 512MB, ECC DDR2 FB-DIMM, 667 MHz
  Memory Slot "BRANCH 0 CHANNEL 1/DIMM 2"   : 512MB, ECC DDR2 FB-DIMM, 667 MHz
  Memory Slot "BRANCH 1 CHANNEL 0/DIMM 3"   : 512MB, ECC DDR2 FB-DIMM, 667 MHz
  Memory Slot "BRANCH 1 CHANNEL 1/DIMM 4"   : 512MB, ECC DDR2 FB-DIMM, 667 MHz

Drives:
  Drive 1               (disk8)   : Normal
  Drive 2               (disk10)   : Normal
  Drive 3               (disk0)   : Normal
  Drive 4               (disk1)   : Normal
  Drive 5               (disk2)   : Normal
  Drive 6               (disk3)   : Normal
  Drive 7               (disk4)   : Normal
  Drive 8               (disk5)   : Normal
  Drive 9               (disk6)   : Normal
  Drive 10               (disk11)   : Normal
  Drive 11               (disk7)   : Normal

Network:
  en1                (  active)   : Normal
  en0                (  active)   : Normal
  fw0                (inactive)   : Normal

mdc1:~ administrateur$


Notes :

    - I was experiencing hangs while browsing the Xsan volume with the Finder & Terminal. Even a mkdir /Volumes/SANXXX/test was taking forever
    - All the (software) setup is brand new, I don't think the issue was happening before the install.


Questions :
    1. How could I investigate the issue a little bit more ?
    2. Could it be hardware related ?
    3. Do you have any idea why the number of drives keep changing in hwmond.log ?
Back to top
View user's profile Send private message Visit poster's website
d4corp
Could work for Apple
Could work for Apple


Joined: 07 Nov 2008
Posts: 50

PostPosted: Fri Jan 08, 2010 5:20 am    Post subject: Reply with quote

Paniced this morning at 04:23:48… interesting…

Code:
Interval Since Last Panic Report:  685083 sec
Panics Since Last Report:          2
Anonymous UUID:                    3FD888E3-2BFC-4D49-A506-4F51EA26E65E

Fri Jan  8 04:23:48 2010
panic(cpu 2 caller 0x2345e7): "zalloc: \"kalloc.8192\" (169 elements) retry fail 3, kfree_nop_count: 0"@/SourceCache/xnu/xnu-1486.2.11/osfmk/kern/zalloc.c:981
Backtrace (CPU 2), Frame : Return Address (4 potential args on stack)
0x35e5bc78 : 0x21b2bd (0x5cf868 0x35e5bcac 0x223719 0x0)
0x35e5bcc8 : 0x2345e7 (0x5886dc 0x587125 0xa9 0x3)
0x35e5bd68 : 0x2203d8 (0x15342d8 0x1 0x35e5bdb8 0x22678f)
0x35e5bda8 : 0x220418 (0x1698 0x1 0x35e5bdc8 0x2a0596)
0x35e5bdc8 : 0x2104ac (0x1698 0x246 0x35e5be68 0x20fd38)
0x35e5bdf8 : 0x21d5d7 (0x10f8 0x0 0x5d00 0x2903)
0x35e5be38 : 0x2107f7 (0x15d08c00 0x0 0x5088460 0x4f9ace0)
0x35e5be98 : 0x216a5a (0x15d08c00 0x0 0x0 0x0)
0x35e5bf18 : 0x2924f1 (0x35e5bf44 0x0 0x0 0x0)
0x35e5bfc8 : 0x29dfd8 (0x51ceba0 0x1 0x10 0x51ba484)

BSD process name corresponding to current thread: hwmond

Mac OS version:
10C540

Kernel version:
Darwin Kernel Version 10.2.0: Tue Nov  3 10:37:10 PST 2009; root:xnu-1486.2.11~1/RELEASE_I386
System model name: Xserve1,1 (Mac-F4208AC8)

System uptime in nanoseconds: 259559679298749
vm objects:12783796
vm object hash entri:1587120
kernel map entries:2875356
pv_list:1181432
kalloc.16:5365760
kalloc.32:1961984
kalloc.64:30777344
kalloc.128:15536128
kalloc.256:1052672
kalloc.1024:88748032
kalloc.2048:229945344
kalloc.8192:1384448
vm pages:22621324
ipc ports:2338336
threads:1179920
vnodes:13325180
namecache:4740960
HFS node:19780640
HFS fork:7074144
buf.8192:85803008
ubc_info zone:3174240
vnode pager structur:1587120
Kernel Stacks:2375680
PageTables:16293888
Kalloc.Large:4179952
unloaded kexts:
com.apple.iokit.IOUSBHIDDriver   3.8.4 (addr 0x35ec1000, size 0x24576) - last unloaded 22627709646317
loaded kexts:
com.apple.filesystems.autofs   2.1.0
com.apple.driver.AppleHWSensor   1.9.2d0
com.apple.driver.AppleUpstreamUserClient   3.1.0
com.apple.kext.ATIFramebuffer   6.0.6
com.apple.Dont_Steal_Mac_OS_X   7.0.0
com.apple.ATIRadeonX1000   6.0.6
com.apple.driver.AudioIPCDriver   1.1.2
com.apple.driver.AppleIntel8254XEthernet   2.1.1b7
com.apple.driver.AppleSEP   1.4.2
com.apple.driver.AppleIntelMeromProfile   19
com.apple.driver.AppleMCEDriver   1.1.9
com.apple.driver.AppleBMC   2.0.2
com.apple.driver.Apple16X50ACPI   3.0
com.apple.driver.ACPI_SMC_PlatformPlugin   4.0.1d0
com.apple.driver.AppleLPC   1.4.9
com.apple.driver.AppleRAID   4.0.6
com.apple.iokit.SCSITaskUserClient   2.6.0
com.apple.BootCache   31
com.apple.AppleFSCompression.AppleFSCompressionTypeZlib   1.0.0d1
com.apple.driver.AppleFWOHCI   4.4.0
com.apple.driver.AppleLSIFusionMPT   2.5.0
com.apple.driver.AppleUSBHub   3.8.4
com.apple.driver.AppleEFINVRAM   1.3.0
com.apple.driver.AppleIntelPIIXATA   2.5.0
com.apple.driver.AppleUSBUHCI   3.7.5
com.apple.driver.AppleACPIButtons   1.3
com.apple.driver.AppleRTC   1.3
com.apple.driver.AppleUSBEHCI   3.7.5
com.apple.driver.AppleHPET   1.4
com.apple.driver.AppleSMBIOS   1.4
com.apple.driver.AppleACPIEC   1.3
com.apple.driver.AppleAPIC   1.4
com.apple.driver.AppleIntelCPUPowerManagementClient   96.0.0
com.apple.security.sandbox   0
com.apple.security.quarantine   0
com.apple.nke.applicationfirewall   2.1.11
com.apple.driver.AppleIntelCPUPowerManagement   96.0.0
com.apple.driver.AppleUSBMergeNub   3.8.5 - last loaded 22530066423799
com.apple.driver.AppleUSBComposite   3.7.5
com.apple.filesystems.acfsctl   412.3
com.apple.filesystems.acfs   412.3
com.apple.driver.AppleProfileReadCounterAction   17
com.apple.driver.AppleProfileTimestampAction   10
com.apple.driver.AppleProfileThreadInfoAction   14
com.apple.driver.AppleProfileRegisterStateAction   10
com.apple.driver.AppleProfileKEventAction   10
com.apple.driver.AppleProfileCallstackAction   20
com.apple.iokit.IOSurface   73.0
com.apple.iokit.IOBluetoothSerialManager   2.2.4f3
com.apple.iokit.IONDRVSupport   2.0
com.apple.iokit.IOAudioFamily   1.7.2fc1
com.apple.kext.OSvKernDSPLib   1.3
com.apple.kext.ATI1300Controller   6.0.6
com.apple.kext.ATISupport   6.0.6
com.apple.iokit.AppleProfileFamily   41
com.apple.iokit.IOGraphicsFamily   2.0
com.apple.driver.Apple16X50Serial   3.0
com.apple.iokit.IOSerialFamily   10.0.3
com.apple.driver.AppleSMC   3.0.1d2
com.apple.driver.IOPlatformPluginFamily   4.0.1d0
com.apple.iokit.IOFireWireIP   2.0.3
com.apple.iokit.IONetworkingFamily   1.9
com.apple.driver.XsanFilter   402.1
com.apple.iokit.IOSCSIBlockCommandsDevice   2.6.0
com.apple.iokit.IOSCSIMultimediaCommandsDevice   2.6.0
com.apple.iokit.IOBDStorageFamily   1.6
com.apple.iokit.IODVDStorageFamily   1.6
com.apple.iokit.IOCDStorageFamily   1.6
com.apple.iokit.IOATAPIProtocolTransport   2.5.0
com.apple.iokit.IOFireWireFamily   4.1.7
com.apple.iokit.IOSCSIParallelFamily   2.0.0
com.apple.iokit.IOSCSIArchitectureModelFamily   2.6.0
com.apple.iokit.IOATAFamily   2.5.0
com.apple.iokit.IOUSBUserClient   3.8.5
com.apple.driver.AppleEFIRuntime   1.3.0
com.apple.driver.AppleKeyswitch   1.0.5f4
com.apple.iokit.IOHIDFamily   1.6.1
com.apple.iokit.IOUSBFamily   3.8.5
com.apple.iokit.IOSMBusFamily   1.1
com.apple.security.TMSafetyNet   6
com.apple.kext.AppleMatch   1.0.0d1
com.apple.driver.DiskImages   281
com.apple.iokit.IOStorageFamily   1.6
com.apple.driver.AppleACPIPlatform   1.3
com.apple.iokit.IOPCIFamily   2.6
com.apple.iokit.IOACPIFamily   1.3.0
Model: Xserve1,1, BootROM XS11.0080.B01, 4 processors, Dual-Core Intel Xeon, 2 GHz, 2 GB, SMC 1.11f5
Graphics: ATI Radeon X1300, ATY,RadeonX1300, PCIe, 64 MB
Memory Module: global_name
Network Service: Ethernet 2, Ethernet, en1
Network Service: Ethernet 1, Ethernet, en0
PCI Card: ATY,RadeonX1300, Display, Mezzanine
PCI Card: Apple 2 Port 4Gbps Fibre Channel Card, sppci_fibrechannel, Slot-1
PCI Card: Apple 2 Port 4Gbps Fibre Channel Card, sppci_fibrechannel, Slot-1
Parallel ATA Device: MATSHITACD-RW  CW-8124
Fibre Channel Device: SCSI Target Device @ 0
Fibre Channel Device: SCSI Target Device @ 1
Fibre Channel Device: SCSI Target Device @ 2
Fibre Channel Device: SCSI Target Device @ 3
Fibre Channel Device: SCSI Target Device @ 4
Fibre Channel Device: SCSI Target Device @ 5
Fibre Channel Device: SCSI Target Device @ 6
Fibre Channel Device: SCSI Target Device @ 7
Fibre Channel Device: SCSI Target Device @ 0
Fibre Channel Device: SCSI Target Device @ 1
Fibre Channel Device: SCSI Target Device @ 2
Fibre Channel Device: SCSI Target Device @ 3
Fibre Channel Device: SCSI Target Device @ 4
Fibre Channel Device: SCSI Target Device @ 5
Fibre Channel Device: SCSI Target Device @ 6
Fibre Channel Device: SCSI Target Device @ 7
USB Device: Frontpanel Controller, 0x05ac  (Apple Inc.), 0x8261, 0x3d100000
FireWire Device: built-in_hub, Up to 800 Mb/sec
Back to top
View user's profile Send private message Visit poster's website
d4corp
Could work for Apple
Could work for Apple


Joined: 07 Nov 2008
Posts: 50

PostPosted: Wed Jan 20, 2010 2:18 am    Post subject: Reply with quote

After the third consecutive crash in a week, I contacted AppleCare Support. Having no Xsan Applecare Contract, they gave me three choices :
- Buy AppleCare Xsan
- Remove Xsan from Metadata 1
- Stop calling them

I chose the second choice. I've uninstalled Xsan from MDC1, promoting MACPRO6 to metadata 2. A week later, MDC1 was still up and running.

What's more interesting is what was waiting for me this morning :

Quote:
Interval Since Last Panic Report: 649401 sec
Panics Since Last Report: 1
Anonymous UUID: 6FF6D692-C046-4E0B-8F67-A6B82A4D8CD3

Wed Jan 20 03:43:01 2010
panic(cpu 0 caller 0x2345e7): "zalloc: \"buf.8192\" (230 elements) retry fail 3, kfree_nop_count: 0"@/SourceCache/xnu/xnu-1486.2.11/osfmk/kern/zalloc.c:981
Backtrace (CPU 0), Frame : Return Address (4 potential args on stack)
0x36c42f78 : 0x21b2bd (0x5cf868 0x36c42fac 0x223719 0x0)
0x36c42fc8 : 0x2345e7 (0x5886dc 0x59566c 0xe6 0x3)
0x36c43068 : 0x234c33 (0x1845d4c 0x1 0x23ac8254 0x0)
0x36c43088 : 0x2c9a00 (0x1845d4c 0x0 0x36c430b8 0x3)
0x36c430c8 : 0x2ca77c (0x31528830 0x2000 0xa 0x487be3)
0x36c43168 : 0x2caa09 (0x456ee10 0x13f3 0x0 0x2000)
0x36c431a8 : 0x2caac6 (0x2000 0x0 0x0 0x10)
0x36c431c8 : 0x411da3 (0x456ee10 0x13f3 0x0 0x2000)
0x36c43238 : 0x448145 (0x456ee10 0x13f3 0x0 0x36c43294)
0x36c43258 : 0x445a5c (0x4563c04 0x13f3 0x0 0x36c43294)
0x36c43308 : 0x415ef0 (0x457cf10 0x4 0x23ac8004 0x414519)
0x36c438e8 : 0x43ab72 (0x451d004 0x3fa 0x5b87bb8 0x36c43ebc)
0x36c439e8 : 0x2f6bc0 (0x36c43a00 0x3 0x36c43ebc 0x48753f)
0x36c43a38 : 0x2e28c4 (0x456e8dc 0x36c43ebc 0x1 0x36c43f04)
0x36c43f28 : 0x2e2cf9 (0x1000 0x0 0x36c43f5c 0x36c43f50)
0x36c43f78 : 0x4ee947 (0x5218000 0x14386908 0x1648c324 0x1)
0x36c43fc8 : 0x29e3fd (0x14386904 0x0 0x10 0x0)

BSD process name corresponding to current thread: sendmail

Mac OS version:
10C540

Kernel version:
Darwin Kernel Version 10.2.0: Tue Nov 3 10:37:10 PST 2009; root:xnu-1486.2.11~1/RELEASE_I386
System model name: Xserve1,1 (Mac-F4208AC8)

System uptime in nanoseconds: 120031991612207
vm objects:13174516
vm object hash entri:1640160
kernel map entries:2875356
pv_list:1459416
kalloc.16:6057984
kalloc.32:1728512
kalloc.64:36401152
kalloc.128:15736832
kalloc.256:1142784
kalloc.1024:91222016
kalloc.2048:316026880
kalloc.8192:1523712
vm pages:22621324
ipc ports:3090528
threads:1450400
vnodes:13325180
namecache:4720560
HFS node:9093260
HFS fork:3571568
buf.8192:1884160
ubc_info zone:3280320
vnode pager structur:1640160
Kernel Stacks:3293184
PageTables:22024192
Kalloc.Large:2246640
unloaded kexts:
com.apple.iokit.IOUSBHIDDriver 3.8.4 (addr 0x35b61000, size 0x24576) - last unloaded 58799672939167
loaded kexts:
com.apple.filesystems.autofs 2.1.0
com.apple.driver.AppleHWSensor 1.9.2d0
com.apple.driver.AppleUpstreamUserClient 3.1.0
com.apple.Dont_Steal_Mac_OS_X 7.0.0
com.apple.kext.ATIFramebuffer 6.0.6
com.apple.driver.AudioIPCDriver 1.1.2
com.apple.ATIRadeonX1000 6.0.6
com.apple.driver.AppleIntel8254XEthernet 2.1.1b7
com.apple.driver.AppleSEP 1.4.2
com.apple.driver.AppleIntelMeromProfile 19
com.apple.driver.AppleMCEDriver 1.1.9
com.apple.driver.AppleBMC 2.0.2
com.apple.driver.Apple16X50ACPI 3.0
com.apple.driver.ACPI_SMC_PlatformPlugin 4.0.1d0
com.apple.driver.AppleLPC 1.4.9
com.apple.driver.AppleRAID 4.0.6
com.apple.iokit.SCSITaskUserClient 2.6.0
com.apple.BootCache 31
com.apple.AppleFSCompression.AppleFSCompressionTypeZlib 1.0.0d1
com.apple.driver.AppleLSIFusionMPT 2.5.0
com.apple.driver.AppleFWOHCI 4.4.0
com.apple.driver.AppleUSBHub 3.8.4
com.apple.driver.AppleEFINVRAM 1.3.0
com.apple.driver.AppleIntelPIIXATA 2.5.0
com.apple.driver.AppleUSBEHCI 3.7.5
com.apple.driver.AppleUSBUHCI 3.7.5
com.apple.driver.AppleACPIButtons 1.3
com.apple.driver.AppleRTC 1.3
com.apple.driver.AppleHPET 1.4
com.apple.driver.AppleSMBIOS 1.4
com.apple.driver.AppleACPIEC 1.3
com.apple.driver.AppleAPIC 1.4
com.apple.driver.AppleIntelCPUPowerManagementClient 96.0.0
com.apple.security.sandbox 0
com.apple.security.quarantine 0
com.apple.nke.applicationfirewall 2.1.11
com.apple.driver.AppleIntelCPUPowerManagement 96.0.0
com.apple.filesystems.acfsctl 412.3 - last loaded 58696614978227
com.apple.filesystems.acfs 412.3
com.apple.driver.AppleProfileReadCounterAction 17
com.apple.driver.AppleProfileTimestampAction 10
com.apple.driver.AppleProfileThreadInfoAction 14
com.apple.driver.AppleProfileRegisterStateAction 10
com.apple.driver.AppleProfileKEventAction 10
com.apple.driver.AppleProfileCallstackAction 20
com.apple.iokit.IOSurface 73.0
com.apple.iokit.IOBluetoothSerialManager 2.2.4f3
com.apple.iokit.IOAudioFamily 1.7.2fc1
com.apple.kext.OSvKernDSPLib 1.3
com.apple.iokit.IONDRVSupport 2.0
com.apple.iokit.IOFireWireIP 2.0.3
com.apple.iokit.IONetworkingFamily 1.9
com.apple.iokit.AppleProfileFamily 41
com.apple.driver.Apple16X50Serial 3.0
com.apple.iokit.IOSerialFamily 10.0.3
com.apple.driver.AppleSMC 3.0.1d2
com.apple.driver.IOPlatformPluginFamily 4.0.1d0
com.apple.kext.ATI1300Controller 6.0.6
com.apple.kext.ATISupport 6.0.6
com.apple.iokit.IOGraphicsFamily 2.0
com.apple.driver.XsanFilter 402.1
com.apple.iokit.IOSCSIBlockCommandsDevice 2.6.0
com.apple.driver.AppleUSBMergeNub 3.8.5
com.apple.driver.AppleUSBComposite 3.7.5
com.apple.iokit.IOSCSIMultimediaCommandsDevice 2.6.0
com.apple.iokit.IOBDStorageFamily 1.6
com.apple.iokit.IODVDStorageFamily 1.6
com.apple.iokit.IOCDStorageFamily 1.6
com.apple.iokit.IOATAPIProtocolTransport 2.5.0
com.apple.iokit.IOSCSIParallelFamily 2.0.0
com.apple.iokit.IOSCSIArchitectureModelFamily 2.6.0
com.apple.iokit.IOFireWireFamily 4.1.7
com.apple.iokit.IOUSBUserClient 3.8.5
com.apple.iokit.IOATAFamily 2.5.0
com.apple.iokit.IOUSBFamily 3.8.5
com.apple.driver.AppleEFIRuntime 1.3.0
com.apple.driver.AppleKeyswitch 1.0.5f4
com.apple.iokit.IOHIDFamily 1.6.1
com.apple.iokit.IOSMBusFamily 1.1
com.apple.security.TMSafetyNet 6
com.apple.kext.AppleMatch 1.0.0d1
com.apple.driver.DiskImages 281
com.apple.iokit.IOStorageFamily 1.6
com.apple.driver.AppleACPIPlatform 1.3
com.apple.iokit.IOPCIFamily 2.6
com.apple.iokit.IOACPIFamily 1.3.0
Model: Xserve1,1, BootROM XS11.0080.B01, 4 processors, Dual-Core Intel Xeon, 2 GHz, 2 GB, SMC 1.11f5
Graphics: ATI Radeon X1300, ATY,RadeonX1300, PCIe, 64 MB
Memory Module: global_name
Network Service: Ethernet 2, Ethernet, en1
Network Service: Ethernet 1, Ethernet, en0
PCI Card: ATY,RadeonX1300, Display, Mezzanine
PCI Card: Apple 2 Port 4Gbps Fibre Channel Card, sppci_fibrechannel, Slot-1
PCI Card: Apple 2 Port 4Gbps Fibre Channel Card, sppci_fibrechannel, Slot-1
Parallel ATA Device: MATSHITACD-RW CW-8124
Fibre Channel Device: SCSI Target Device @ 0
Fibre Channel Device: SCSI Target Device @ 1
Fibre Channel Device: SCSI Target Device @ 2
Fibre Channel Device: SCSI Target Device @ 3
Fibre Channel Device: SCSI Target Device @ 4
Fibre Channel Device: SCSI Target Device @ 5
Fibre Channel Device: SCSI Target Device @ 6
Fibre Channel Device: SCSI Target Device @ 7
Fibre Channel Device: SCSI Target Device @ 0
Fibre Channel Device: SCSI Target Device @ 1
Fibre Channel Device: SCSI Target Device @ 2
Fibre Channel Device: SCSI Target Device @ 3
Fibre Channel Device: SCSI Target Device @ 4
Fibre Channel Device: SCSI Target Device @ 5
Fibre Channel Device: SCSI Target Device @ 6
Fibre Channel Device: SCSI Target Device @ 7
USB Device: Frontpanel Controller, 0x05ac (Apple Inc.), 0x8261, 0x3d100000
FireWire Device: built-in_hub, Up to 800 Mb/sec


Yes, MDC2 crashed. Same kernel panic caused by zalloc, but this time sendmail crashed. This might be caused by hwmond trying to send a mail, who knows… Anyway sendmail is not configured, so I don't know how it could be used besides sending me notifications…

I thought it could be related to periodic maintenance, but they did run at 03:15, half an hour before the kernel panic. And the third time mdc1 crashed was at 4:30PM, not AM. But interestingly enough, 1,2,4 were all around 4AM.

Code:

mdc2:~ administrateur$ ls -al /var/log/*.out
-rw-r--r--  1 root  wheel  33965 Jan 20 03:15 /var/log/daily.out
-rw-r--r--  1 root  wheel    172 Jan  1 05:30 /var/log/monthly.out
-rw-r--r--  1 root  wheel    350 Jan 16 03:15 /var/log/weekly.out
Back to top
View user's profile Send private message Visit poster's website
lotte
Xsan Master
Xsan Master


Joined: 11 Dec 2008
Posts: 190

PostPosted: Wed Jan 20, 2010 4:44 am    Post subject: Reply with quote

Have you tried to check the Xsan Filesystem?

Lotte
Back to top
View user's profile Send private message
memblin
Been around the blocks
Been around the blocks


Joined: 22 Apr 2009
Posts: 20

PostPosted: Wed Jan 20, 2010 3:14 pm    Post subject: Reply with quote

We've been dealing with this issue for over a year.

Our setup is currently:

2 x Intel MDC 2 ( OSX 10.5.8 )
2 x Promise VTrak E610f for storage
XSan 2.2

1 Data volume
1 Metadata volume

We use this data volume to store small graphics files and print job
information then pull those same assets from the XSan when running the
print jobs on our presses.

In the spring of 2008 before I took this job an apple certified tech came out
to do the Xsan install. I started working here in July the same year, and then
in December of 2008 we started getting these zalloc / kalloc panics.

We worked with AppleCare for almost four months trying to resolve the
issue. We did the standard upgrades as we were told. We were told to
upgrade the memory in the MDC servers since we only had 4gig in each. We
handled that (12gig each now) and still had the trouble.

Around March of 2009, one of the AppleCare folks told us to try forcing the
maxvnodes setting to 90000. Here's a copy / paste from that email.

Make note, we never actually did this. The problem just quit happening
one day and it fell off the radar until it started happening again this past
December and has continued, last panic was yesterday.

*snip*
Chris,
After reviewing this and also doing some internal testing I wanted to make you aware of something.
In the last email I sent the change I had you make is ideal if the MDC always only has 4 GB of RAM which your
server does, or at least did at the time. If you decide to add more RAM to the MDC's you'll want to edit the /etc/rc.server
file differently.
Basically we want to ensure that:
kern.maxnodes=90000 which the change I sent to you does IF the server has 4GB of RAM. If you later add more RAM or even if you don't but think you might later you would want to change the file thusly:

What we want to do is comment out the whole block of code accept for
sysctl -w kern.maxproc=2500
sysctl -w kern.ipc.somaxconn=2500
sysctl -w kern.maxvnodes=90000
*/snip*

I called Apple back again today, referenced our old case number. They said
they have engineers working on this right now already. Apparently a number
of people are having this trouble. My net searches turn up a few other
reports of the same behavoir, often times on MDCs that are running volumes
used for smaller randomly accessed files instead of the larger files that
everyone says XSan does best with.

The tech I spoke with told me that they have been handing out that
maxvnodes 'fix' and having fair results. He sent me a different email with
modified directions that I'm pasting below.

*snip*
Subject: Steps to resolve zalloc/kalloc panic in Xsan on Mac OS X Server 10.5

1. While logged in as root, use your preferred text editor to add the following line to the end of the /etc/rc.server file:
2.
3. sysctl -w kern.maxvnodes=90000
4. Repeat step 1 on all Mac OS X Server v10.5 servers in the Xsan deployment.
5. Reboot the standby Metadata Controllers and Server clients.
6. Failover the volume to a standby Metadata Controller. To do this, open Xsan Admin, click Volumes in the left sidebar, select the volume(s), then choose "Force failover" from the Action Menu in the lower left corner.
7.Reboot the originally-active Metadata Controller.
8.Failover the volume back to the originally-active Metadata Controller, if desired, using the process in step 4 above.
*/snip*

That said, I don't work for Apple. I'm not sure what possible bad things could
happen by monkeying with this setting. They couldn't tell me if I should
update the 90,000 number to something larger since we added an additional
8gig of RAM. I do have an email in to AppleCare about it though referencing
the original ticket when I couldn't find even a tiny whisper about this on the
net.

I did go ahead and set the setting on the current backup MDC and got it
all rebooted. I made the settings change on the primary MDC but haven't
rebooted it yet. I figure I'll fail it over tonight or just wait for it to panic and
fail itself over. I'll try to remember to post back in a couple of weeks when
we figure out if this is a true fix.

*edited for misspeling Promise*


Last edited by memblin on Thu Jan 21, 2010 5:03 pm; edited 1 time in total
Back to top
View user's profile Send private message Visit poster's website
rstasel
Xsan Master
Xsan Master


Joined: 03 Aug 2007
Posts: 120

PostPosted: Thu Jan 21, 2010 1:09 am    Post subject: Reply with quote

as a note, on my Xsan 2.2.1 (and one 2.1.1) clients, running both 10.5 and 10.6, the setting for maxvnodes is higher than 90k.

On the machines with ≥8GB ram, it's set to 150k. On the clients with 4GB of ram, it's 120k. Even the PPC with 4GB of ram, running Xsan 2.1.1 is at 120k.

So something is odd here that your systems would have below a 90k maxvnode setting.

Only thing I can think is, are your MDC's Xserves? I know there are some tweeks Apple makes to some settings in the case of Xserves (I think because they figure no one is using them as desktop clients, they can increase some settings beyond the point you'd want if you were using the machine for "work")? In my case, all of my Xsan clients, and MDCs are Xserves (One G5, 5x 2006 Intel Xserves, and 2x 2009 Intel Xserves).
Back to top
View user's profile Send private message Visit poster's website
memblin
Been around the blocks
Been around the blocks


Joined: 22 Apr 2009
Posts: 20

PostPosted: Thu Jan 21, 2010 5:30 pm    Post subject: Reply with quote

We are running both MDCs on Xserve (intel arch) hardware and OS X 10.5.8

Our metadata controllers have been automatically configuring the maxvnodes
setting to 120000 at 4GB, when we upgraded to 12GB RAM it started setting
them to 150000 just like yours. The kernel panics stopped happening in
March of 2009 so we never got around to trying the 'fix' provided by Apple.

It was the guys in XSan Enterprise Support up at Apple that recommended
dropping the setting manually to 90,000 using the /etc/rc.server file. It didn't
make much sense to me either so we put off the maintenance a bit, and the
kernel panics just stopped so I never made the change.

In December of 2009 when we started having the same kernel panics again
we engaged Apple support again and they recommended the same thing. I
asked him why I would want to lower it and he said that's just how they'd
been handling the kernel panic problem that is apparent in some Xsan
installations running a combination of XSan 2 and MacOS X 10.5.x. I went
ahead and modified both of our MDCs on their recommendation and have
not seen any trouble yet. Will post again with my results if it does anything
other than resolve the issue with the kernel panics. heh

Going back over the post it looks like d4corp is running 10.6.2 so this is
probably not applicable for their current problems.

I did email Apple Support to verify that with 12Gigs of RAM 90000 is still the
*sweet spot* number. Here is that response from them.

*snip*
If you are still running 10.5.8 you would want to leave it at 90000 due to
the panic issue with Xsan and 10.5.x. If you have upgraded to 10.6 that
issue has been addressed and maxvnodes would be set automatically by the
OS. Here is an article that you can use for reference:

Mac OS X Server v10.6: Understanding process limits
http://support.apple.com/kb/HT3854
*/snip*

The original post makes me wonder if this has actually been fixed in 10.6 or
if they just haven't had enough people report in with the problem yet. I'm
hoping for fixed, my boss won't accept the "This is why we recommend
having two metadata controllers instead of just one." that Apple tried to give
us the first time we called on the issue back in 2008. *grin*
Back to top
View user's profile Send private message Visit poster's website
rstasel
Xsan Master
Xsan Master


Joined: 03 Aug 2007
Posts: 120

PostPosted: Thu Jan 21, 2010 10:33 pm    Post subject: Reply with quote

well, it IS a valid reason to have more than a single MDC. =)

Okay, that makes more sense. Though, I haven't seen any problems with Xsan 2.2.1 or 2.1.1 on my 10.5 machines, though, neither are MDCs.

Certainly let us know. 10.6 has been, so far, a wonderful upgrade in our environment. AFP performance is worlds better. And Xsan seems to run better too, even without turning on Native Extended Attributes (which I probably won't do until Summer).
Back to top
View user's profile Send private message Visit poster's website
d4corp
Could work for Apple
Could work for Apple


Joined: 07 Nov 2008
Posts: 50

PostPosted: Fri Jan 22, 2010 1:39 am    Post subject: Reply with quote

Great ! This post is finally living !

I've had two more failures this morning. More info on that asap.

I think the maxvnodes sysctl definitively makes sense. IMHO, it looks like spotlight indexing floods fsevents…

How much RAM do you have on the machine(s) that kernel panics ?
Do you have Spotlight enabled ?
Do you have a lot of files ? (especially small files, such as graphics or home folders ?)
Back to top
View user's profile Send private message Visit poster's website
memblin
Been around the blocks
Been around the blocks


Joined: 22 Apr 2009
Posts: 20

PostPosted: Fri Jan 22, 2010 1:49 pm    Post subject: Reply with quote

We have 12gigs of RAM in both MDCs and are on Xserve hardware.

This is from our hwmond.log for our primary, the secondary has the same
setup.

*/snip*
Server:
Host : <removed>
Model : Xserve2,1
Uptime : 1 minutes
OS version : Mac OS X Server 10.5.8 (9L34)
Processor : 1 x 2800 MHz
Memory : 12288 MB
BootROM : XS21.88Z.006C.B01.0712212323

Memory:
Memory Slot "BRANCH 0 CHANNEL 0/DIMM 1" : 2048MB, ECC DDR2 FB-DIMM, 800 MHz
Memory Slot "BRANCH 0 CHANNEL 1/DIMM 2" : 2048MB, ECC DDR2 FB-DIMM, 800 MHz
Memory Slot "BRANCH 1 CHANNEL 0/DIMM 3" : 2048MB, ECC DDR2 FB-DIMM, 800 MHz
Memory Slot "BRANCH 1 CHANNEL 1/DIMM 4" : 2048MB, ECC DDR2 FB-DIMM, 800 MHz
Memory Slot "BRANCH 0 CHANNEL 0/DIMM 5" : 1024MB, ECC DDR2 FB-DIMM, 800 MHz
Memory Slot "BRANCH 0 CHANNEL 1/DIMM 6" : 1024MB, ECC DDR2 FB-DIMM, 800 MHz
Memory Slot "BRANCH 1 CHANNEL 0/DIMM 7" : 1024MB, ECC DDR2 FB-DIMM, 800 MHz
Memory Slot "BRANCH 1 CHANNEL 1/DIMM 8" : 1024MB, ECC DDR2 FB-DIMM, 800 MHz
*/snip*

That is our memory setup on both MDCs.

We had Spotlight enabled until we got some bad tech support from Promise
where a tech miss-spoke and told us to just pull one of the controllers. After
some corruption do to that it broke our Spotlight and we haven't been able
to take the time to re-index 16TB of data.

We have a TON (over 8,000 files in some directories) of tiny files (web
graphics, jlts, and pre-press PDFs) on the single volume currently hosted
by our MDCs.

Last night was our maintenance night, I was able to go ahead and force a
failover to the MDC that was already modified according to Apples direction
to have maxvnodes=90000. I made the change on the primary after the
failover and rebooted it so we're running with the 'fix' in place now.

Do be careful if you try that setting in /etc/rc.server. It was recommended to
us by Apple since our MDCs are in the 10.5.x family and some Xsan installs
using 10.5.x utilizing their XSan to store many smaller files have been
randomly popping these kernel panics. They say this has been taken care of
in 10.6.x and there shouldn't be a problem because 10.6 adjust the
maxvnodes automatically depending on installed memory.

From reading the rc.server file in 10.5.8, you can see that 10.5 adjusted it
as well. I'll keep the threat informed of what we run into.
Back to top
View user's profile Send private message Visit poster's website
d4corp
Could work for Apple
Could work for Apple


Joined: 07 Nov 2008
Posts: 50

PostPosted: Sat Jan 23, 2010 4:48 am    Post subject: Reply with quote

Thanks for your reply…

That doesn't reassure me…

My MDC are 10.6.2 with 2GB of RAM. I have 699.401 files on the Xsan Volume… Spotlight is activated, but not working as I'm having a permission issue.

Code:
mdc1:~ administrateur$ find /Volumes/SANLaboutique/ -type f |wc -l
  699401


Could you launch this command as well and report your results ? It can take long - took me more than an hour.

The error I'm having is about FSevents - /var/log/kernel.log
Code:
Jan 22 20:45:22 mdc1 kernel[0]: add_fsevent: zalloc sez: 0
Jan 22 20:45:22 mdc1 kernel[0]: add_fsevent: event_zone info: 4096 0x0
Jan 22 20:45:22 mdc1 kernel[0]: add_fsevent: watcher 0x5093004: num dropped 0 rd  452 wr  452 q_size 1024 flags 0x0
Jan 22 20:45:22 mdc1 kernel[0]: add_fsevent: watcher 0x239ce004: num dropped 0 rd 3865 wr 3865 q_size 8192 flags 0xc
Jan 22 20:45:22 mdc1 kernel[0]: add_fsevent: watcher 0x239f8004: num dropped 225294 rd 1496 wr 1495 q_size 8192 flags 0x5
Jan 22 20:46:14 mdc1 kernel[0]: add_fsevent: event queue is full! dropping events (num dropped events: 1081; num events outstanding: 4067).
Jan 22 20:46:14 mdc1 kernel[0]: add_fsevent: kfse_list head 0x355522c0 ; num_pending_rename 29
Jan 22 20:46:14 mdc1 kernel[0]: add_fsevent: zalloc sez: 0

This repeats over and over.

Source code : http://fxr.watson.org/fxr/source/bsd/vfs/vfs_fsevents.c?v=xnu-1228#L623
http://www.opensource.apple.com/source/xnu/xnu-1456.1.26/bsd/vfs/vfs_fsevents.c

FSevents doc : http://www.booktou.com/node/148/0321278542/ch11lev1sec8.html#ch11lev2sec31

Spotlight doc : http://www.booktou.com/node/148/0321278542/ch11lev1sec8.html
Back to top
View user's profile Send private message Visit poster's website
rstasel
Xsan Master
Xsan Master


Joined: 03 Aug 2007
Posts: 120

PostPosted: Sat Jan 23, 2010 2:45 pm    Post subject: Reply with quote

d4corp:

If you do indeed mean that your MDCs have 2GB of ram, this is against the minimum requirements of Xsan, which require 2GB of RAM for all Xsan clients, plus an ADDITIONAL 2GB on the MDCs for each volume hosted. So your MDCs should have a minimum of 4GB of ram (assuming 1 volume hosted). I can certainly see that causing issues...

Hardware Requirements: http://support.apple.com/kb/HT3927

These have been the memory requirements since Xsan 2.0 was released.

Both my MDCs have 8GB of ram, and my SAN has approximately 6M files on it. You can easily tell this by selecting the SAN volume in Disk Utility, which will then show "number of files".

Spotlight is disabled for our setup. But, as I said, our MDCs are 10.6. Previously we were running 10.5.8 on our MDCs, and had not seen this error. Previously one MDC was Intel, the other was PPC. Then both were Intel, still running 10.5.8 with Xsan 2.1.1. We only updated to 10.6.2 and 2.2.1 when both were available around Christmas.
Back to top
View user's profile Send private message Visit poster's website
d4corp
Could work for Apple
Could work for Apple


Joined: 07 Nov 2008
Posts: 50

PostPosted: Sat Jan 23, 2010 8:46 pm    Post subject: Reply with quote

Thanks, you're right about the RAM, I misinterpreted the manual. Thanks for the link to the kbase.

As I told you, I'll put 8GB in each MDC, it should resolve the issue.

I'll keep you informed

Thanks
Back to top
View user's profile Send private message Visit poster's website
rstasel
Xsan Master
Xsan Master


Joined: 03 Aug 2007
Posts: 120

PostPosted: Sat Jan 23, 2010 8:58 pm    Post subject: Reply with quote

d4corp,

Great! Let us know. I imagine upping the ram will resolve the issue. But, I'm just guessing.
Back to top
View user's profile Send private message Visit poster's website
memblin
Been around the blocks
Been around the blocks


Joined: 22 Apr 2009
Posts: 20

PostPosted: Fri Mar 12, 2010 10:50 am    Post subject: Reply with quote

Just a quick follow up on what I had been told by Xsan Support to do to fix our issues with the kernel panics. Forcing the maxvnodes to 90000 did nothing for us we had another random kernel panic today same gig. Zalloc garbage collection goes crazy. I'm back on the phone with them again, figure they'll make us upgrade a few things then tell us again they don't know why it's happening. heh
Back to top
View user's profile Send private message Visit poster's website
Display posts from previous:   
Post new topic   Reply to topic    Xsanity Forums Forum Index -> Troubleshooting All times are GMT - 5 Hours
Goto page 1, 2  Next
Page 1 of 2

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © 2001, 2005 phpBB Group
Best Viewed on a Mac | Suggested Browser: Whatever floats yer boat.