User Functions
Don't have an account yet? Sign up as a New User
Who's Online
Guest Users: 9
|
| View previous topic :: View next topic |
| Author |
Message |
svavaroe partially protected

Joined: 08 Aug 2006 Posts: 8
|
Posted: Tue Oct 10, 2006 10:35 am Post subject: XSAN crashing in time to time |
|
|
Hi.
I was wandering if anyone outthere would know what's crashing my XSAN in time to time.
My current hardware and software configuration is :
1x XRaid Box with :
First Controller = 2x 250GB Mirrored for MetaData
Second Controller = 7x 500GB RAID5 for VolumeData
1x Brocade Silkworm 3250 Series (3252) Fiber Switch
As for now, I'm migrate'ing from previous setup, then I have the XSAN Controller, and Client
that serves AFP on the same machine. Later on, I'l have them splitted, e.g. seperate Controller and AFP server.
XSAN Controller and Client :
PowerMac G5 2.0Ghz
1.5GB RAM
ATTO Tech Celerity FC-22XH (Dual-channel 2-Gb FC) (connected to Brocade Switch)
Connected to 1Gb on HP Procurve 2848 gigabit Switch (LAN)
XSAN 1.4
OS X Server - 10.4.8
And in the next weeks I'l have the XSAN and AFP server on seperate Apple XServes with 5GB in RAM
after the XSAN migration.
So the problem is, every now and then the XSAN just crashes.
Crash = The Volume dosn't appear any more on the controller,AFP server.
In my system.log this is whats logged just before the crash
| Code: |
Oct 10 12:36:33 servername fsm[427]: Xsan FSS 'DataVOL[1]': PANIC: /Library/Filesystems/Xsan/bin/fsm "OpHangLimitSecs exceeded VOP-Setattr 183.28 secs Conn[1] Thread-0x186de00 Pqueue-0x3031c8 Workp-0x492
Oct 10 12:36:33 servername KernelEventAgent[35]: tid 00000000 received VQ_NOTRESP event (1)
Oct 10 12:36:33 servername fsm[427]: PANIC: /Library/Filesystems/Xsan/bin/fsm "OpHangLimitSecs exceeded VOP-Setattr 183.28 secs Conn[1] Thread-0x186de00 Pqueue-0x3031c8 Workp-0x4923618 MsgQ-0x4923608 Msg-
Oct 10 12:36:33 servername fsm[427]: Xsan FSS 'DataVOL[1]': PANIC: wait 3 secs for journal to flush
Oct 10 12:36:33 servername KernelEventAgent[35]: tid 00000000 type 'acfs', mounted on '/Volumes/DataVOL', from '/dev/disk3', not responding
Oct 10 12:36:33 servername KernelEventAgent[35]: tid 00000000 found 1 filesystem(s) with problem(s)
Oct 10 12:36:34 servername kernel[0]: Reconnecting to FSS 'DataVOL'
Oct 10 12:36:34 servername kernel[0]: FSS on host 10.100.0.10 not responding, retrying...
Oct 10 12:36:36 servername fsm[427]: Xsan FSS 'DataVOL[1]': PANIC: aborting threads now.
Oct 10 12:36:57 servername servermgrd: xsan: [38/302DA0] ERROR: get_fsmvol_at_index: Could not connect to FSM because Connect to FSM failed - Connection refused
Oct 10 12:36:57 servername servermgrd: xsan: [38/302DA0] ERROR: get_quotas_for_fsmvol_named(DataVOL): Could not connect to FSM because Connect to FSM failed - Connection refused
|
Can anyone out there describe, or has the knowledge of that "OpHangLimitSecs" mean?
Are there any sucess stories, and some configuration that I need to do on a Brocade Silkworm Fiber Switches. ?
This is driving me crazy...
Thanks alot people.
Best regards,
Svavaroe
Reykjavik - Iceland[/code] |
|
| Back to top |
|
 |
|
|
You cannot post new topics in this forum You cannot reply to topics in this forum You cannot edit your posts in this forum You cannot delete your posts in this forum You cannot vote in polls in this forum
|
Powered by phpBB © 2001, 2005 phpBB Group
|
|