Re: [Doh] A question on the mix of DNS and HTTP semantics

Patrick McManus <pmcmanus@mozilla.com> Sun, 18 March 2018 09:22 UTC

Return-Path: <pmcmanus@mozilla.com>
X-Original-To: doh@ietfa.amsl.com
Delivered-To: doh@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id BBE46127241 for <doh@ietfa.amsl.com>; Sun, 18 Mar 2018 02:22:46 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 2.101
X-Spam-Level: **
X-Spam-Status: No, score=2.101 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HTML_MESSAGE=0.001, RCVD_IN_SBL_CSS=3.335, SPF_SOFTFAIL=0.665] autolearn=no autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id POGDxFgOr7BW for <doh@ietfa.amsl.com>; Sun, 18 Mar 2018 02:22:44 -0700 (PDT)
Received: from linode64.ducksong.com (linode6only.ducksong.com [IPv6:2600:3c02::f03c:91ff:fe6e:e8da]) by ietfa.amsl.com (Postfix) with ESMTP id 9423C1200FC for <doh@ietf.org>; Sun, 18 Mar 2018 02:22:44 -0700 (PDT)
Received: from mail-ot0-f179.google.com (mail-ot0-f179.google.com [74.125.82.179]) by linode64.ducksong.com (Postfix) with ESMTPSA id 7ED003A021 for <doh@ietf.org>; Sun, 18 Mar 2018 05:22:39 -0400 (EDT)
Received: by mail-ot0-f179.google.com with SMTP id y11-v6so14521314otg.0 for <doh@ietf.org>; Sun, 18 Mar 2018 02:22:39 -0700 (PDT)
X-Gm-Message-State: AElRT7GyXOp+kc56wvu4brMGuFEZVbTNM3sPK4v/2Bkc4zfc2kH2+kaI j7yKTyVDaa2niUKh79gMe9zdWCkTuLfh0VZeTRQ=
X-Google-Smtp-Source: AG47ELt//Sbc66Ht1q+nYpie/feCmdwEFPJLryGtLy5up7OTvRs6MqO1l8ZKkxKePp6Gi7Z66A4rm9NOGUWW46kSyGI=
X-Received: by 2002:a9d:213c:: with SMTP id i57-v6mr3178402otb.85.1521364959194; Sun, 18 Mar 2018 02:22:39 -0700 (PDT)
MIME-Version: 1.0
Received: by 10.74.66.212 with HTTP; Sun, 18 Mar 2018 02:22:38 -0700 (PDT)
In-Reply-To: <CA+9kkMB7awRfW9jUmY9Q-1p+w3VLtpG5DxhF3s7Q58nEMZeX3w@mail.gmail.com>
References: <CA+9kkMB7awRfW9jUmY9Q-1p+w3VLtpG5DxhF3s7Q58nEMZeX3w@mail.gmail.com>
From: Patrick McManus <pmcmanus@mozilla.com>
Date: Sun, 18 Mar 2018 09:22:38 +0000
X-Gmail-Original-Message-ID: <CAOdDvNpt-d+8epv80q4eofG4X-K83=zJdmFV67Z2z+4GNOQwxw@mail.gmail.com>
Message-ID: <CAOdDvNpt-d+8epv80q4eofG4X-K83=zJdmFV67Z2z+4GNOQwxw@mail.gmail.com>
To: Ted Hardie <ted.ietf@gmail.com>
Cc: doh@ietf.org
Content-Type: multipart/alternative; boundary="000000000000f776f00567ac61f8"
Archived-At: <https://mailarchive.ietf.org/arch/msg/doh/e1LavZ14XvwmB9mpxaiVEN_0_Zc>
Subject: Re: [Doh] A question on the mix of DNS and HTTP semantics
X-BeenThere: doh@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: DNS Over HTTPS <doh.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/doh>, <mailto:doh-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/doh/>
List-Post: <mailto:doh@ietf.org>
List-Help: <mailto:doh-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/doh>, <mailto:doh-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 18 Mar 2018 09:22:47 -0000

Hi Ted, thanks for writing this down. I think I understand it a bit better
now.

I think the most relevant point is that DoH is the use of HTTP as a
substrate. As a user of HTTP RFC7231 is very relevant (HTTP status codes)


On Sat, Mar 17, 2018 at 5:42 PM, Ted Hardie <ted.ietf@gmail.com> wrote:

>   In particular, the HTTP semantics for redirection (e.g. HTTP response
> 307 or 308) and client error (e.g. HTTP response 403 or 451) may be
> difficult to map back into DNS errors when the target is a traditional DNS
> client.
>
>
7231 defines 2xx as success. If your code is not 2xx then your DoH request
did not succeed and you can feel confident in saying that you do not have a
dns response to the dns request that was made in the http request.
Conversely, a 2xx response needs to carry a DNS reply to your DNS request.
(now that can be a DNS failure - nxdomain, etc, but the 2xx at the HTTP
level is saying a valid DNS response to your request is contained therein..
i.e. DoH has succeeded).

non 2xx levels of response do not carry the answer to the DNS query. The
message in those responses is basically just FYI. HTTP allows you to put
any media type you want in there, so I'm not going to tell you you cannot
put DNS information in the body of a 404 if you think that's good FYI, but
I am going to say the body of a 4xx is not the dns answer to the DoH
request that generated it.

A client may or may not be able to go further based on the response code.
e.g. a 3xx code can obviously be taken further with another request, as can
401, or maybe even 405. There is a general expectation that clients that
can act on http response codes other than 2xx do so, but its their
decision. So I would say all HTTP clients, including DoH clients but not in
any special way, are expected to follow redirects within the scope of how
the particular redirect is defined, and the message body of the [345]xx is
not very interesting.

As noted above, if you don't end this process with a 2xx you don't have a
DNS response to the DNS query of the original request - the transport has
failed. I would say you should map that into the traditional DNS in the
same fashion as any other transport failure.. e.g. UDP timing out, or TCP
generating RST, or TLS failing verification. An HTTP non-success isn't
going to give you rich DNS information.. if the failure is at the DNS level
and not the DOH level, then the DNS failure can be carried in a 2xx HTTP
response.

I hope all that helps. I hope that's what things like BCP56bis are aimed at
- we don't want to write down all the HTTP rules for every application that
uses HTTP as a substrate each time. And of course I would expect that new
DoH clients would reuse old HTTP stacks where possible as they should
already encapsulate these rules.


> To take an example from the redirection side, if a client has configured
> an ordered list of recursive resolvers and gets a redirect from the DOH
> server, it could theoretically either follow the redirect or move down its
> own list (treating the redirect as an offered member for the list, but
> sorted below the current list).  The behavior for that might be influenced
> by the body included in the response. A 307 with a response body containing
> the UDP wire format DNS response for SRVFAIL, for example, might cause the
> DNS client to move down the ordered list, where a 307 with no response body
> might be handled entirely within HTTP's semantics, prior to returning a
> message to the DNS client code.  (Note that the choice between these two
> may result in different privacy considerations, especially if resolvers
> later in the list do not use DOH or DNS over TLS).
>

The udp body in the 307 that you posit is not the dns response to the
original dns question it is just FYI and doesn't have a programmatic
interpretation in the way the 307 and the Location response header do. The
expectation is that if you would like to complete that original request you
should make a new request to the redirected location. Aborting or pushing a
new approach on top of the stack is fine, but I feel like you feel you're
getting more information from that 307 than you really are. All you know is
its in progress and how to continue.



> Similarly, it was not clear to me whether a response like 451 could
> contain a UDP wireformat body and, if so, what it would be.  If it contains
> no body, the DNS implementation might continue attempting to query for the
> information..  If it contains a REFUSED RCODE, in contrast, it would see a
> policy-based error.  The document simply provides very little guidance on
> how to handle cases like these.
>
>
Similarly to above, 451 is defined by 7225. DoH seeks to _not_ restate all
the existing HTTP definitions, it just seeks to use them. If DoH was doing
something inconsistent with 7225 that would imo probably be a doh bug. In
the case of 451 In this case 7225 defines the response body as a human
readable description of the censorship - so I don't think wireformat would
be a good choice. (7225 doesn't make an exception for non human clients, of
which there have always been many). It is interesting to note how the code
itself is indeed actionable - the DoH client might choose to repeat the
same query of a VPN or Tor knowing this information.



> There appear to be two extreme positions possible:
>
> 1) DOH servers should, as much as possible, encode all the semantics in
> UDP wireformat DNS messages and return appropriate messages when using
> HTTP's redirection, authorization, or caching semantics.  Clients can then
> act on the DNS messages within a structure that may include other recursive
> resolvers.
>

I hope this message has shed some light on this. wireformat in a non 2xx
response, while legal, is not the response to the dns query in the http
request. I would say its probably not a good idea either, but ymmv.


> 2) DOH clients should use HTTP semantics as faithfully as possible,
> returning UDP wireformat DNS messages only after the full HTTP protocol
> mechanics have been worked out and synthesizing DNS messages (such as
> SRVFAIL) when these are implied by the HTTP exchange.
>
>
I hope the situation is a bit clearer that this - but yes this is closer.


> The document leans more toward 2 than 1, but does not do so in a way that
> makes it mandatory; you would appear to be able to write a conformant
> server or client using either theory (or much in between).  I suspect that
> leaning is due to the web application use case's presumed popularity
> compared to the traditional DNS client use case, but I could be wrong.
>
> I think I disagree here. While you can indeed stick wireformat into all
kinds of non 2xx responses, I do not believe it means what you thought it
meant when you wrote this :)



> Whatever the cause, I believe that additional clarity here is important
> and that some text specifically on the interplay in semantics would be
> useful.
>

The text you cited in the top of your mail (which I've trimmed here
prematurely) was meant to be part of that, but more important the "over
HTTP" parts are meant to make it clear that HTTP semantics carry the DNS
semantics.

I'm certainly happy to add a sentence or two to help make that more clear
(suggestions?) but I don't want to start defining/reiterating HTTP
semantics in the DoH doc - we've already got lots of documents (i.e.
rfc723x.. which are all included normatively by 7230 which doh includes
normatively) that do that and its not in doh's charter to mess those things
anyhow.


>