TL;DR I’m getting DNS resolution timeouts. My problem feels like an issue with my ISP (Spectrum residential) but seeking some guidance, tips-n-tricks, etc for how to troubleshoot this. If it is an ISP issue, how to raise this with them so as not to go round and round with their tier 1 support?
Details
I’m getting intermittent “Internet” issues on my clients (laptops, tablets, etc) and found pdns_recursor throwing errors on resolve timeouts. My setup looks like client → pihole (192.168.1.5) → vyos → Internet. For VyOS, I’m basically running with the quick start guides DNS setup:
me@vyos# show service dns forwarding
allow-from 192.168.1.0/24
allow-from 10.64.0.0/16
cache-size 0
listen-address 192.168.1.1
listen-address 10.64.200.1
listen-address 10.64.20.1
listen-address 10.64.30.1
listen-address 10.64.40.1
listen-address 10.64.60.1
listen-address 10.64.150.1
listen-address 10.64.0.1
listen-address 10.64.50.1
The 10.64 space is a to-be realized VLAN segmentation. ATM everything is operating off the 192.168.1.0/24 space.
My setup has been operating in it’s current running config for 2 months without this issue, which is further making me think this is something with Spectrum.
me@vyos# run show system image
The system currently has the following image(s) installed:
1: 1.4-rolling-202108221610
2: 1.4-rolling-202107010537 (default boot) (running image)
3: 1.4-rolling-202105090417
Example errors I’m seeing:
Sep 2 09:59:28 vyos pdns_recursor[2968]: Sending SERVFAIL to 192.168.1.5:39105 during resolve of 'xmpp013.zoom.us' because: Too much time waiting for ns-1772.awsdns-29.co.uk|A, timeouts: 2, throttles: 3, queries: 25, 7742msec
Sep 2 09:59:28 vyos pdns_recursor[2968]: Sending SERVFAIL to 192.168.1.5:39105 during resolve of 'xmpp013.zoom.us' because: Too much time waiting for ns-1772.awsdns-29.co.uk|A, timeouts: 2, throttles: 0, queries: 22, 7754msec
Sep 2 09:59:29 vyos pdns_recursor[2968]: Sending SERVFAIL to 192.168.1.5:41095 during resolve of 'eastus2-prod-2.notifications.teams.microsoft.com' because: Too much time waiting for cloudapp.azure.com|A, timeouts: 3, throttles: 1, queries: 19, 7184msec
Sep 2 09:59:29 vyos pdns_recursor[2968]: Sending SERVFAIL to 192.168.1.5:41095 during resolve of 'eastus2-prod-2.notifications.teams.microsoft.com' because: Too much time waiting for cloudapp.azure.com|A, timeouts: 3, throttles: 1, queries: 19, 7215msec
Sep 2 09:59:29 vyos pdns_recursor[2968]: Sending SERVFAIL to 192.168.1.5:41095 during resolve of 'eastus2-prod-2.notifications.teams.microsoft.com' because: Too much time waiting for cloudapp.azure.com|A, timeouts: 2, throttles: 1, queries: 21, 7166msec
Sep 2 09:59:29 vyos pdns_recursor[2968]: Sending SERVFAIL to 192.168.1.5:41095 during resolve of 'eastus2-prod-2.notifications.teams.microsoft.com' because: Too much time waiting for cloudapp.azure.com|A, timeouts: 3, throttles: 2, queries: 20, 7170msec
Sep 2 09:59:30 vyos pdns_recursor[2968]: Sending SERVFAIL to 192.168.1.5:53402 during resolve of 'ic3.events.data.microsoft.com' because: Too much time waiting for eastus.cloudapp.azure.com|A, timeouts: 3, throttles: 2, queries: 18, 7514msec
Sep 2 09:59:30 vyos pdns_recursor[2968]: Sending SERVFAIL to 192.168.1.5:53402 during resolve of 'ic3.events.data.microsoft.com' because: Too much time waiting for eastus.cloudapp.azure.com|A, timeouts: 2, throttles: 5, queries: 23, 7524msec
Sep 2 09:59:30 vyos pdns_recursor[2968]: Sending SERVFAIL to 192.168.1.5:56607 during resolve of 'ic3.events.data.microsoft.com' because: Too much time waiting for japaneast.cloudapp.azure.com|A, timeouts: 2, throttles: 0, queries: 20, 7529msec
Sep 2 09:59:30 vyos pdns_recursor[2968]: Sending SERVFAIL to 192.168.1.5:56607 during resolve of 'ic3.events.data.microsoft.com' because: Too much time waiting for japaneast.cloudapp.azure.com|A, timeouts: 2, throttles: 3, queries: 23, 7533msec
Sep 2 09:59:30 vyos pdns_recursor[2968]: Sending SERVFAIL to 192.168.1.5:54315 during resolve of 'xmpp003.zoom.us' because: Too much time waiting for ns-888.awsdns-47.net|A, timeouts: 1, throttles: 2, queries: 23, 7159msec
My assumption is that I can’t just send those SERVFAIL messages to Spectrum without offering some additional proof their mucking with my DNS queries?