We've been struggling with an odd AD issue at one site for days now, and I'm running out of ideas to try. I'm hoping someone has seen something like this in the past and can point me in a direction. I suspect DNS, but haven't been able to find anything definitive.
We have one subnet that every Mac client cannot look up AD accounts, even though they appear bound to AD. Affected machines are both Macbook Airs and Minis, and either 10.12.3 or 10.12.4. If these users take their computers to another subnet, they have no issues, and anyone who visits the site with this subnet has the problem while there.
Time sync is not an issue, and our usual tricks of restarting opendirectoryd and/or flushing directory and DNS caches doesn't work. Unbinding and rebinding also does not resolve the issue; the machines unbind and rebind fine, but are unable to authenticate AD users or even look up AD accounts. The machines also appear to be fine in Active Directory, with the proper machine record in the proper OU, and appear correctly in Windows DNS. We use Infoblox (managed by another State agency) for DHCP, and our own Windows DNS for DNS at all our sites. This subnet appears to be configured identically to all our other (40 ish) subnets.
I can sometimes work around the issue by manually configuring DNS servers on the client, but I'm not convinced that that hasn't been a coincidence, since I can't recreate that 100% of the time.
On the clients, aside from not being able to communicate with AD:
- opendirectoryd is approx 100% CPU
- I get "Invalid Path" and "DS Error: -14009 (eDSUnknownNodeName)" when trying to use dscl to browse AD
- User lookups with id result in "unknown user"
- If I crank up odutil logging, I see "Module: ActiveDirectory - DDNS update - failure -- '10.118.185.201' - exit status 2 -- ; TSIG error with server: tsig verify failure" in the logs
We've pretty much exhausted internal resources, as well as help we've brought in from the group that handles Infoblox. We've also tagged in a Microsoft AD/DNS consultant who is also at a loss.
Has anyone here seen behavior like this in the past? I've pasted a log file [LINK REMOVED] that shows me rejoining the domain, then errors.