| View previous topic :: View next topic |
| Author |
Message |
BenT Knows DNS is the answer

Joined: 12 Jun 2007 Posts: 39
|
Posted: Tue Dec 22, 2009 5:54 pm Post subject: XSAN 2 Admin - Computers unreachable or offline |
|
|
I've got this problem at a few sites, and wondering if anyone has a trick to reliably fix it:
- Computers in XSAN2 admin show "UNREACHABLE OR OFFLINE"
- All computers can be controlled via Screensharing, can ping on both MD and LAN IPs, and can be resolved by hostname.
- If you Force Remove the computer from the SAN, and then add it back by specifying IP, it appears as READY (until you quit XSAN Admin and relaunch).
- Computer showing as UNREACABLE OR OFFLINE will mount the XSAN volume(s) each boot without error.
- Only some client machines are affected, but the same machines show the problem on both MDCs.
It seems to be something DNS related, as it only happens on sites where the XSAN has it's own Apple DNS service, but there is also a corporate DNS server present on the same network.
I was hoping there is some host file entry or some plist tweak to make XSAN Admin stop "forgetting" how to communicate with these clients. |
|
| Back to top |
|
 |
abstractrude Xsan Master

Joined: 13 Mar 2008 Posts: 864
|
Posted: Tue Dec 22, 2009 6:40 pm Post subject: |
|
|
your on static addresses right?
clients dont really need entries for DNS.... |
|
| Back to top |
|
 |
BenT Knows DNS is the answer

Joined: 12 Jun 2007 Posts: 39
|
Posted: Tue Dec 22, 2009 6:43 pm Post subject: |
|
|
Yes, all the XSAN client systems have static addresses for both the public LAN interface and the Metadata interface.
It seems to be Bonjour fighting with DNS that causes the issue.
Not sure if it makes a difference, but if you look at Inspector for the clients, it always lists their metadata interface address for IP, but their public LAN DNS name. |
|
| Back to top |
|
 |
abstractrude Xsan Master

Joined: 13 Mar 2008 Posts: 864
|
Posted: Tue Dec 22, 2009 6:49 pm Post subject: |
|
|
| are your controllers both pointing to the same DNS server, and is that server sending requests up the tree properly. why dont you remove the entries for the clients. i dont put entries in for clients and have never had issues. that said make sure you keep your DNS working for the controllers to resolve both ways. |
|
| Back to top |
|
 |
BenT Knows DNS is the answer

Joined: 12 Jun 2007 Posts: 39
|
Posted: Tue Dec 22, 2009 6:53 pm Post subject: |
|
|
Yes, both controllers are pointing to the same DNS servers, with the same DNS search order.
I suspect the problem is some sort of conflict between the entries in the local Mac-based DNS servers and the Corporate servers.
The local servers have the zones xsan.companyname.com and metadata.companyname.com - which the corporate server just hosts companyname.com
However there are entries for the client machines with the same hostname and IPs in both xsan.companyname.com and companyname.com zones.
I'll give your suggestion ago for removing the clients completely from the local DNS servers, and see if the behaviour changes. |
|
| Back to top |
|
 |
JesusAli Xsan Master

Joined: 25 Jul 2008 Posts: 151
|
Posted: Thu Dec 24, 2009 2:00 pm Post subject: DNS Worksheet Thread |
|
|
Hello, a little bit back, Aaron Freimark posted a great "DNS Worksheet" in a thread where someone seemed to be having a DNS problem.
He wanted to try to fix it with Host Config files, but the general consensus was that it was better to NAIL DOWN all DNS first because it would be easier to maintain in the long run.
Here is that Thread:
http://www.xsanity.com/forum/viewtopic.php?t=6123
Aaron's Worksheet is about 3/4's down.
Good Luck. If you run the worksheet, please tell us what you find. |
|
| Back to top |
|
 |
BenT Knows DNS is the answer

Joined: 12 Jun 2007 Posts: 39
|
Posted: Wed Jan 06, 2010 6:49 pm Post subject: |
|
|
Unfortunately this problem is still persisting at one site.
DNS servers are just the 2 MDCs (no public DNS servers in the mix at all), all tests on Aaron's worksheet return expected values (and I also ran through the client names/IPs and these return correct too).
3 of the 7 client systems show up as UNREACHABLE OR OFFLINE in XSAN Admin, yet their System Preferences values match the MDCs and all other clients.
If you remove them with FORCE REMOVE then add them back to XSAN Admin they show up as READY and can be managed as normal..... but only until you quit XSAN Admin and re-launch it, then you get the UNREACHABLE OR OFFLINE status again.
The DIG values for the problem clients returns correct for both the forward and reverse lookups.
So right now it looks like DNS setup is perfect, but XSAN Admin is still having troubles....
I'm not sure if this is related, and I've seen this at multiple XSAN2 sites, but when you do any operation in XSAN Admin, the first entry in the computers list gets duplicated. After a while you get a lot of duplicates. The only way to clear this is to disconnect the XSAN Admin session, and set it up again through the wizard. |
|
| Back to top |
|
 |
nrausch Xsan Master

Joined: 14 Sep 2007 Posts: 202
|
Posted: Wed Jan 06, 2010 11:30 pm Post subject: |
|
|
Ben,
Just curious if the client machines were cloned?
I had a very similar issue (especially with the repeated listings) a while back. I found out the client machines had been cloned from an image that had xsan client software installed. Removing all the clients and reinstalling xsan client software individually on clients solved it...
The other thing to check would be local firewalls on clients, and any switches that may be blocking necessary ports for xsan admin... |
|
| Back to top |
|
 |
JesusAli Xsan Master

Joined: 25 Jul 2008 Posts: 151
|
Posted: Thu Jan 07, 2010 12:35 am Post subject: |
|
|
YES! I had the same problem in my setup when two labs were imaged from a master that had Xsan activated before duplication!
The culprit is a particular file inside the Libraries/FileSystems/Xsan/config/ directory (that path is probably wrong, I'm away from an xsan right now).
The file's name is "UUID" ("universal user identification?"). You can trash it and restart the computer and a new UUID file will be generated.
Please let us know if that was the culprit! |
|
| Back to top |
|
 |
BenT Knows DNS is the answer

Joined: 12 Jun 2007 Posts: 39
|
Posted: Thu Jan 07, 2010 7:34 am Post subject: |
|
|
2 of the clients showing the problem are running server and were not cloned.
All the other clients were from a master SOE - but the SOE did NOT have XSAN software installed (it was a post-SOE step).
I'll run some tests trashing the UUID file - thanks for the suggestion. |
|
| Back to top |
|
 |
peterk Could work for Apple

Joined: 18 Mar 2008 Posts: 50
|
Posted: Mon Jan 18, 2010 3:48 pm Post subject: |
|
|
Hi Ben,
did you find a fix for your problem ? I had a similar problem today: two fresh XServer installs (10.6.2, XS 2.2.1), DNS records on separate machine.
The first weirtd thing I noticed was that when I checked changeip -checkhostname, the changeip told me that Host name "mainMDC" doesn't match DNS name that is "mainmdc". I checked DNS, there was mainMDC record and I am pretty sure I never used mainmdc string anywhere (it is my convention I am using last years ...). Then I fixed it (by changeip), checked again, and it looked like it's ok. I also noticed that installer installed and activated DNS on both MDCs (never happaned before on 10.5). So I stopped them, restarted, checked the DNS names, launched XSAN Admin and I saw the same problem. I am pretty sure my DNS is working correctly.
I had no time to search further, but I'm curious if you have some news on this issue.
Thank you.
Peter |
|
| Back to top |
|
 |
peterk Could work for Apple

Joined: 18 Mar 2008 Posts: 50
|
Posted: Mon Jan 18, 2010 4:06 pm Post subject: |
|
|
Hi Ben,
did you find a fix for your problem ? I had a similar problem today: two fresh XServer installs (10.6.2, XS 2.2.1), DNS records on separate machine.
The first weirtd thing I noticed was that when I checked changeip -checkhostname, the changeip told me that Host name "mainMDC" doesn't match DNS name that is "mainmdc". I checked DNS, there was mainMDC record and I am pretty sure I never used mainmdc string anywhere (it is my convention I am using last years ...). Then I fixed it (by changeip), checked again, and it looked like it's ok. I also noticed that installer installed and activated DNS on both MDCs (never happaned before on 10.5). So I stopped them, restarted, checked the DNS names, launched XSAN Admin and I saw the same problem. I am pretty sure my DNS is working correctly.
I had no time to search further, but I'm curious if you have some news on this issue.
Thank you.
Peter |
|
| Back to top |
|
 |
BenT Knows DNS is the answer

Joined: 12 Jun 2007 Posts: 39
|
Posted: Mon Jan 18, 2010 5:43 pm Post subject: |
|
|
Unfortunately not yet.
When restarting XSAN Admin, I still get a few of the clients reporting UNREACHABLE OR OFFLINE, and one of the clients gets multiple entries in the COMPUTERS list.
I have to FORCE REMOVE the problem machines, then re-add them, then XSAN Admin works for a time with all machines showing READY.
I'm suspecting some sort of UUID file problem, but I've yet to go through every machine to strip and re-create the file. |
|
| Back to top |
|
 |
JesusAli Xsan Master

Joined: 25 Jul 2008 Posts: 151
|
Posted: Tue Jan 19, 2010 1:54 pm Post subject: |
|
|
After you Force Remove a station, I recommend running the latest version of Xsan Uninstaller on it. Completely wipe out the Xsan file system on the station.
When you download the Xsan 2.2 install CD, you can right click the installer and get to the Xsan installer pkg. Then also download the Xsan File System Update pkg. You can push them both out to station(s) with Apple Remote Desktop.
btw, you may be able to run the Uninstaller that way too. |
|
| Back to top |
|
 |
cthomasquinlan Been around the blocks

Joined: 20 Jan 2010 Posts: 21
|
Posted: Wed Jan 20, 2010 1:21 pm Post subject: |
|
|
| It sounds simplistic, but I've resolved this exact issue before by removing all keychain access entries for mdcs and clients; if they were entered improperly at some point, or have changed, this could resolve the issue. I've had dns working, all pinging correctly, and authenticating initially but upon relaunch, some came back unreachable or offline. Close xsan admin, remove all mdc and client entries in Keychain Access, relaunch xsan admin and reauthenticate the machines, creating new entries. |
|
| Back to top |
|
 |
|