On 02/10/2024 17:00, Dominik Derigs via Dnsmasq-discuss wrote:
Hey all,


having this configurable makes sense particularly thinking about the multitude of IoT devices often receiving not all that much attention from their manufacturers. A particular example is the camera Tapo C-310 which seems to be widely used and has often been reported in the Pi-hole forums to cause a huge amount of queries due to exactly this truncation method we are talking about here (when dnsmasq is used with DNSSEC, the query A time.nist.gov results in truncation).


Said Tapo camera is not behaving nicely and does not retry over TCP. Instead, it retries the same query roughly one second later, resulting in a whooping 86,400 identical queries, all being truncated. A workaround is querying this domain periodically manually over TCP but it is a rather ugly workaround.


If there would be an option to allow truncated content to remain, I would indeed find it very useful for situations like the one mentioned above. I confirmed that the camera could successfully reads the returned CNAME/A record. This is what I got when simply applying the submitted patch (but keeping the logging):


Oct  2 17:46:53 dnsmasq[1861633]: 25 127.0.0.1/45027 query[A] time.nist.gov from 127.0.0.1 Oct  2 17:46:53 dnsmasq[1861633]: 25 127.0.0.1/45027 forwarded time.nist.gov to 127.0.0.1#5335
Oct  2 17:46:53 dnsmasq[1861633]: 26 dnssec-query[DS] gov to 127.0.0.1#5335
Oct  2 17:46:53 dnsmasq[1861633]: 26 reply gov is DS for keytag 2536, algo 13, digest 2 Oct  2 17:46:53 dnsmasq[1861633]: 27 dnssec-query[DS] nist.gov to 127.0.0.1#5335 Oct  2 17:46:53 dnsmasq[1861633]: 28 dnssec-query[DNSKEY] gov to 127.0.0.1#5335 Oct  2 17:46:53 dnsmasq[1861633]: 28 reply gov is DNSKEY keytag 2536, algo 13 Oct  2 17:46:53 dnsmasq[1861633]: 28 reply gov is DNSKEY keytag 35496, algo 13 Oct  2 17:46:53 dnsmasq[1861633]: 27 reply nist.gov is DS for keytag 33751, algo 8, digest 2 Oct  2 17:46:53 dnsmasq[1861633]: 29 dnssec-query[DNSKEY] nist.gov to 127.0.0.1#5335 Oct  2 17:46:53 dnsmasq[1861633]: 29 reply nist.gov is DNSKEY keytag 18303, algo 8 Oct  2 17:46:53 dnsmasq[1861633]: 29 reply nist.gov is DNSKEY keytag 33751, algo 8 Oct  2 17:46:53 dnsmasq[1861633]: 30 dnssec-query[DS] glb.nist.gov to 127.0.0.1#5335 Oct  2 17:46:53 dnsmasq[1861633]: 30 reply glb.nist.gov is DS for keytag 56235, algo 7, digest 1 Oct  2 17:46:53 dnsmasq[1861633]: 30 reply glb.nist.gov is DS for keytag 4395, algo 7, digest 1 Oct  2 17:46:53 dnsmasq[1861633]: 31 dnssec-query[DNSKEY] glb.nist.gov to 127.0.0.1#5335
Oct  2 17:46:53 dnsmasq[1861633]: 31 reply glb.nist.gov is truncated[DNSKEY]
Oct  2 17:46:53 dnsmasq[1861633]: 25 127.0.0.1/45027 validation result is TRUNCATED
Oct  2 17:46:53 dnsmasq[1861633]: 25 127.0.0.1/45027 reply is truncated


For testing, "dig +ignore + notcp time.nist.gov" is handy as it is guaranteed to be truncated with dnsmasq and DNSSEC enabled if the domain isn't already in your local cache through a preceding TCP query.




I'm in the process of testing some code that fixes this problem. What's happening here is not that the answer to the original query is too large and truncated, it's that the answer to a subsidiary query needed to validate the original answer is too large (in this case DNSKEY glb.nist.gov) All existing dnsmasq releases return the answer to the original query with the truncated bit set, even if the answer in fact fits, just to cause the client to retry with TCP so that the subsidiary query can be done over TCP and avoid truncation.

The new code swaps to TCP for subsidiary DNSSEC queries without having to return an artificially truncated answer to the original query and, once the validation is complete, returns the validated answer over UDP. This is much cleaner, saves a round trip to the client, and avoids breakage with clients which fail to do TCP properly.

It's not the whole solution, however. If the answer to a query exceeds the truncation limit, then no client without TCP support can get an good answer. DNSSEC adds one more possible state. That's when the answer without DNSSEC records will fit untruncated, but the addition of signatures makes the answer too large. Dnsmasq passes on the query with the DO bit set to make upstream add the signature RRs and if that causes truncation then the truncated answer goes back to client. In theory, dnsmasq could switch to TCP, and get the validated answer. It then strips the RRSIG records from the answer, and if it then fits, returns it via UDP. Only if the answer still doesn't fit (or the client set the DO bit, and wants RRSIGS) is TCP fallback by the client required.

I'm minded to aee if the code I have can be extended to handle this state.

Simon.


Best,

Dominik


Am 10/2/24 um 12:30 AM schrieb Petr Menšík:

I think Simon has pointed out this is intentional. Partial reply is incomplete and for well behaving clients carries not useful information. It would use TCP anyway instead, therefore it adds just additional data.


I would consider clients not falling back to TCP as broken. TCP is not considered optional nowadays. Attempts to fix such clients by relying on incomplete responses instead are wrong.


It might make sense to allow enabling such behaviour by configuration, if you have broken software not able to workaround. But I would insist that software is broken and therefore sending incomplete responses should be only workaround for them, not something behaved by default.


Is that something using Alpine C library?


Cheers,
Petr


On 30/09/2024 06:39, Rahul Thakur via Dnsmasq-discuss wrote:
Hi Simon,

So what do you think of my reasoning for this patch? Do you agree?

Best regards,
Rahul Thakur
------------------------------------------------------------------------
*From:* Rahul Thakur <[email protected]>
*Sent:* 25 September 2024 15:29
*To:* Simon Kelley <[email protected]>; dnsmasq- [email protected] <[email protected]> *Subject:* Re: [Dnsmasq-discuss] [PATCH 1/1] forward.c: fix handling of truncated response
Hi Simon,

Thanks for responding to this patch, please find my justification for this patch as follows:

I think rfc 2181 is defining the behaviour for DNS server and not DNS proxy.

I am relying on and referring to rfc 5625 while making this change.

In section 4.4 (https://datatracker.ietf.org/doc/html/ rfc5625#section-4.4 <https://datatracker.ietf.org/doc/html/ rfc5625#section-4.4>), the rfc 5625 states,

   If a proxy must unilaterally truncate a response, then the proxy MUST
   set the TC bit.  Similarly, proxies MUST NOT remove the TC bit from
   responses.

Dnsmasq is ofcourse complying to this behaviour and not meddling with the TC bit while setting the answers to 0. But, if I read further section 4.4.1 (https://datatracker.ietf.org/doc/html/ rfc5625#section-4.4.1 <https://datatracker.ietf.org/doc/html/ rfc5625#section-4.4.1>),

 Whilst TCP transport is not strictly mandatory, it is supported by
   the vast majority of stub resolvers and recursive servers.

So, this indicates that it is not mandatory that the client ignores this truncated response. This is further supported by section 6.1.3.2 of rfc 1123 (https://datatracker.ietf.org/doc/html/ rfc1123#section-6.1.3.2). In paragraph 3 of the DISCUSSION in section 6.1.3.2, it states,

                 Whether it is possible to use a truncated answer
                 depends on the application.

Hence, when dnsmasq explicitly deletes the answers, then it deprives clients that do not fallback to TCP and are happy with the truncated response to be able to resolve their queries.
Fix such client to fall back to TCP instead of making dnsmasq to provide it. Or ensure it uses EDNS0 with buffer big enough to receive whole message.

To me, it sounds like a better strategy to forward the truncated response as is to the client and let the client decide what it wants to do rather than forcefully dropping the answers.

Best regards,
Rahul Thakur





------------------------------------------------------------------------
*From:* Dnsmasq-discuss <dnsmasq-discuss- [email protected]> on behalf of Simon Kelley <[email protected]>
*Sent:* 25 September 2024 13:39
*To:* [email protected] <dnsmasq- [email protected]> *Subject:* Re: [Dnsmasq-discuss] [PATCH 1/1] forward.c: fix handling of truncated response
I think that this is legitimate behaviour. RFC 2181 para 9 says

    Where TC is set, the partial RRSet that would not completely fit may
    be left in the response.  When a DNS client receives a reply with TC
    set, it should ignore that response, and query again, using a
    mechanism, such as a TCP connection, that will permit larger replies.

Which means the contents (or lack of them) of the answer, auth and
additional sections has to be ignored by the client anyway.

Do you have a standards reference which says otherwise? Test suites can
tell you either that behaviour has changed over releases or that
behaviour differs from other implementations. They cant tell you that
behaviour is correct.

There is a subtle reason for the code being as it is. Dnsmasq
has various functions which change the contents of a packet being
returned, and these can't reliably be applied to a truncated packet, so
data in a truncated packet may (for instance) disclose DNS data which
should be blocked.

The patch is, in any case, broken because it gratuitously removes the
call to the logging code.


Cheers,

Simon.

On 24/09/2024 11:01, Rahul Thakur via Dnsmasq-discuss wrote:
> From: Rahul Thakur <[email protected]>
>
> the handling of truncated reponse is broken in 2.90. The answers
> are removed before forwarding in case TC bit is set, which
> seems incorrect.
>
> test details-
> the regression was caught by a CDrouter run and this change fixes
> the regression.
> ---
>   src/forward.c | 7 -------
>   1 file changed, 7 deletions(-)
>
> diff --git a/src/forward.c b/src/forward.c
> index 10e7496..c893d84 100644
> --- a/src/forward.c
> +++ b/src/forward.c
> @@ -782,13 +782,6 @@ static size_t process_reply(struct dns_header *header, time_t now, struct server
>        server->flags |= SERV_WARNED_RECURSIVE;
>       }
>
> -  if (header->hb3 & HB3_TC)
> -    {
> -      log_query(F_UPSTREAM, NULL, NULL, "truncated", 0);
> -      header->ancount = htons(0);
> -      header->nscount = htons(0);
> -      header->arcount = htons(0);
> -    }
>
>     if  (!(header->hb3 & HB3_TC) && (!bogusanswer || (header->hb4 & HB4_CD)))
>       {

--
Petr Menšík
Software Engineer, RHEL
Red Hat,https://www.redhat.com/
PGP: DFCF908DB7C87E8E529925BC4931CA5B6C9FC5CB

_______________________________________________
Dnsmasq-discuss mailing list
[email protected]
https://lists.thekelleys.org.uk/cgi-bin/mailman/listinfo/dnsmasq-discuss

_______________________________________________
Dnsmasq-discuss mailing list
[email protected]
https://lists.thekelleys.org.uk/cgi-bin/mailman/listinfo/dnsmasq-discuss


_______________________________________________
Dnsmasq-discuss mailing list
[email protected]
https://lists.thekelleys.org.uk/cgi-bin/mailman/listinfo/dnsmasq-discuss

Reply via email to