Re: [alto] Warren Kumari's Discuss on draft-ietf-alto-xdom-disc-04: (with DISCUSS)

Warren Kumari <warren@kumari.net> Wed, 19 December 2018 00:48 UTC

Return-Path: <warren@kumari.net>
X-Original-To: alto@ietfa.amsl.com
Delivered-To: alto@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 4B84F131203 for <alto@ietfa.amsl.com>; Tue, 18 Dec 2018 16:48:50 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.901
X-Spam-Level:
X-Spam-Status: No, score=-1.901 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_MED=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001] autolearn=unavailable autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=kumari-net.20150623.gappssmtp.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id hfD-Cgie20CJ for <alto@ietfa.amsl.com>; Tue, 18 Dec 2018 16:48:48 -0800 (PST)
Received: from mail-wm1-x344.google.com (mail-wm1-x344.google.com [IPv6:2a00:1450:4864:20::344]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 91BF41311A7 for <alto@ietf.org>; Tue, 18 Dec 2018 16:48:43 -0800 (PST)
Received: by mail-wm1-x344.google.com with SMTP id a62so4414722wmh.4 for <alto@ietf.org>; Tue, 18 Dec 2018 16:48:43 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=kumari-net.20150623.gappssmtp.com; s=20150623; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=D82G5tcBsx1RDr53N9OtLdgALbj/+JUbxh6xw0/DglQ=; b=hURKbHVu6G+GYVIIn3JO0aD7ry3qzBPXlSAKYzduqBf+hF6hOnUccxVDSEeO3eCByM aR64CMCS0Crqpo8xVMfAZt0SXdgK2Hul2vJy6p+9OPpRc5fcgjdG0Rzt2x/f2X/P2lC6 wwRbFZiP2FrI6oRTBieuksUapAszWlRIWIVCJ4IztgicbgmFpw8deghXvOAoVSD5uhvs wpt2Y6ELJ7cVHjrLTAk4MKgrawq8z6fpLCtbibdVPNNEwR9WkQ7vc+NuqRxZR/JW9e12 pPWnw91ki8NDd9vDWz5g32ieZfIDBuOFNfwpKkOgHwPJJTaKBJ+kQZpnGd9Qsr6YIIq3 ZWgg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=D82G5tcBsx1RDr53N9OtLdgALbj/+JUbxh6xw0/DglQ=; b=Rddlrswfp+z22InUwACGgQd6uy75CBIwphF3iegQdBhDlcPDn99Sv5F93PhuwhCUVn imBI/dP+WdDo4beUCt0tYq6o+Sr6p/A+Sz581LpwgPVr9YTTqu3V6nCkedpNgEX5Fhlm 9e71CuHTwKTXzXQ49VpzddQNvpfy0FUNG+hBZNcFigHXw0fKt048xu9PbOcR6PWL4mLw kGFSksrmQkxD+TDLgpw5Xrl98AdCgVZNunYnjzJGt7fOCQ4s13WPUFHZP8GNsnY+y8qK K29Il5xx2Mki4my/82MuoJWACbXZAcihwjb23imgVpc3ZKNsXV4nXG/xo74c3m5dDbJf M1Ng==
X-Gm-Message-State: AA+aEWZF+ehEB1sDtg5AobYy4R2vS5WR+odnJr9zIc1u2NhUCtKmqhDO Ea92IaoDlBfBxzomwVOu/Yfq2CidATMWCAv+yKahKrnFZWc=
X-Google-Smtp-Source: AFSGD/WI6OiF3yP2goT0A0TUjtbF9d0IEvH9GLK9tRLLURHL1gLloCvimiUrsrhi7u0htzaHh1Wn3KrFbkm3nvttIGU=
X-Received: by 2002:a1c:ce0e:: with SMTP id e14mr5354366wmg.53.1545180521601; Tue, 18 Dec 2018 16:48:41 -0800 (PST)
MIME-Version: 1.0
References: <154516546501.5516.6260976046434363502.idtracker@ietfa.amsl.com> <20181218232300.634bfvvny6jh3r3f@gw01.ehlo.wurstkaes.de>
In-Reply-To: <20181218232300.634bfvvny6jh3r3f@gw01.ehlo.wurstkaes.de>
From: Warren Kumari <warren@kumari.net>
Date: Tue, 18 Dec 2018 19:48:04 -0500
Message-ID: <CAHw9_iL+n0d0CejmD==ciROOTpUkjjLPSdZuYjbVxAgim5dojw@mail.gmail.com>
To: Sebastian Kiesel <ietf-alto@skiesel.de>
Cc: The IESG <iesg@ietf.org>, draft-ietf-alto-xdom-disc@ietf.org, Jan Seedorf <jan.seedorf@hft-stuttgart.de>, alto-chairs@ietf.org, alto@ietf.org
Content-Type: multipart/alternative; boundary="0000000000001aaa33057d556036"
Archived-At: <https://mailarchive.ietf.org/arch/msg/alto/cYIajy8un4HNVTp0_qG5Y9qQwOI>
Subject: Re: [alto] Warren Kumari's Discuss on draft-ietf-alto-xdom-disc-04: (with DISCUSS)
X-BeenThere: alto@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Application-Layer Traffic Optimization \(alto\) WG mailing list" <alto.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/alto>, <mailto:alto-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/alto/>
List-Post: <mailto:alto@ietf.org>
List-Help: <mailto:alto-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/alto>, <mailto:alto-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 19 Dec 2018 00:48:50 -0000

On Tue, Dec 18, 2018 at 6:23 PM Sebastian Kiesel <ietf-alto@skiesel.de>
wrote:

> Warren,
>
> apologies if I restate something obvious, but the key issue for this
> discussion is probably that the XDOM procedure is NOT supposed to be run
> on your laptop, but instead on a centralized server such as a Content
> Delivery Networks's (CDN) HTTP redirect server, a P2P tracker, etc.,
> i.e., a "resource directory" in the terminology defined in RFC 5693.
>
>
Ah, you might be restating something obvious, but I'd certainly missed the
above bit / had gotten that part backwards...


> Let me give an example:
>
> 1. You (or some software on your laptop) try to access
> http://softwarecompany.example.com/really-big-service-pack.bin
> --> DNS lookups for A records, TCP connection, HTTP GET ...
>
> 2. The server behind http://softwarecompany.example.com calls
> XDOM( $ENV{REMOTE_ADDR}, "ALTO:https" ) , i.e. with your laptop's
> IP address, or the "public" IP address of the outermost NAT in front
> of your laptop, as a parameter.
>
> 3. If the XDOM procedure succeeds, it will return the URI of an
> ALTO server (typically the one provided by your ISP).
>
> 4. The software on the HTTP server will contact said ALTO server to get
> more information about topology, routing costs, etc. from the point of
> view of your ISP.  Based on these ALTO informations, it will choose one
> of serveral known servers that can provide really-big-service-pack.bin
> to your laptop; it chooses the one that gives highest throughput and/or
> causes least costs for traffic.  It then returns an appropriate HTTP
> redirect to your laptop.
>
> 5. Your laptop will get the large file from there.
>
>
>
> Of course, all this overhead only makes sense for larger data transfers,
> probably not for regular "web surfing". That alone might limit the
> number of XDOM invocations.
>
>
> The DNS queries caused by the XDOM procedure will hit:
>
> - The recursive name server next to the HTTP redirect server,
>   or the tracker, etc. --> reasonable sizing of this name server
>   is the duty of the CDN operator, tracker operater etc.
>
> - The authoritative name servers of the ISPs. If XDOM starts to take off
>   and is used by many CDNs or trackers, the resulting load on these name
>   servers might become an incentive for the ISPs to put the NAPTR RRs in
>   place so that the first or second query succeeds.
>
> - If ISPs won't install the NAPTR RRs, XDOM escalates and the queries
>   will hit the authoritative servers of the RIRs etc.  But, their
>   answers (both NAPTR RRs or NXDOMAIN) can be cached in the recursive
>   name servers next to the redirect servers or trackers.  As said, we
>   assume the XDOM procedure will not run on billions of PCs, laptops and
>   smartphones, but only on some (ten)thousands of CDN servers, P2P
>   trackers, etc. in the Internet. Each of the adjacent recursive name
>   servers will probably learn quicky all the answers on a /16 or /8
>   level, and a cached result can be reused when the XDOM procedure is
>   called for a different IP address from the same /8 or /16.  Therefore,
>   the load on RIR's servers should be manageable as well.
>
>
> Does this sound reasonable?
>

I'm still somewhat uncomfortable with some of this (it feels like moving
costs onto $others), and the caching of the "negative" answer isn't very
long (but this could be adjusted).... but, my "somewhat uncomfortable" does
not rise to the level of a discuss, so I will clear it.
(not in front of right machine at the moment)

W



>
>
> Thanks,
>
> Sebastian
>
>
>
>
>
>
> On Tue, Dec 18, 2018 at 12:37:45PM -0800, Warren Kumari wrote:
> > Warren Kumari has entered the following ballot position for
> > draft-ietf-alto-xdom-disc-04: Discuss
> >
> > When responding, please keep the subject line intact and reply to all
> > email addresses included in the To and CC lines. (Feel free to cut this
> > introductory paragraph, however.)
> >
> >
> > Please refer to
> https://www.ietf.org/iesg/statement/discuss-criteria.html
> > for more information about IESG DISCUSS and COMMENT positions.
> >
> >
> > The document, along with other ballot positions, can be found here:
> > https://datatracker.ietf.org/doc/draft-ietf-alto-xdom-disc/
> >
> >
> >
> > ----------------------------------------------------------------------
> > DISCUSS:
> > ----------------------------------------------------------------------
> >
> > Note: I have not completed my review in detail (and so it may be answered
> > further down), but I wanted to get this in early...
> >
> > I'm in no way an ALTO expert (I can barely spell it), so am hoping that
> I'm
> > missing something obvious, but I'm really concerned by the scaling
> implications
> > / cost shifting of this.
> >
> > Let's say this suddenly becomes very popular -- Apple includes this in
> the iOS
> > App Store / iMessage app, or Chrome / Firefox decides to start doing
> this to
> > find the best datacenter to send traffic to or something...
> >
> > Until the huge majority of ISPs start answering with these records for
> all of
> > their subnets, it seems like there could be a sizable amount of traffic
> hitting
> > a: the ISPs recursive servers, b: RIRs, and possibly c: AS112 servers.
> >
> > E.g: The address I get when I lookup www.google.com is 216.58.193.164.
> > These are the lookups I'd need to do (I think!) if my $application (or,
> more
> > worrying, framework / browser) were to use this:
> >
> > wkumari$ dig +nocomment +nostats +nocmd NAPTR 164.193.58.216.in-addr.arpa
> > ;164.193.58.216.in-addr.arpa.   IN      NAPTR
> > 193.58.216.in-addr.arpa. 59     IN      SOA     ns1.google.com.
> > dns-admin.google.com. 226022060 900 900 1800 60
> >
> > wkumari$ dig +nocomment +nostats +nocmd NAPTR 193.58.216.in-addr.arpa
> > ;193.58.216.in-addr.arpa.       IN      NAPTR
> > 193.58.216.in-addr.arpa. 59     IN      SOA     ns1.google.com.
> > dns-admin.google.com. 225983176 900 900 1800 60
> >
> > wkumari$ dig +nocomment +nostats +nocmd NAPTR 58.216.in-addr.arpa
> > ;58.216.in-addr.arpa.           IN      NAPTR
> > 216.in-addr.arpa.       1539    IN      SOA     z.arin.net.
> dns-ops.arin.net.
> > 2017026288 1800 900 691200 10800
> >
> > wkumari$ dig +nocomment +nostats +nocmd NAPTR 216.in-addr.arpa
> > ;216.in-addr.arpa.              IN      NAPTR
> > 216.in-addr.arpa.       1665    IN      SOA     z.arin.net.
> dns-ops.arin.net.
> > 2017026288 1800 900 691200 10800
> >
> > This is 4 lookups per host / app / connection hitting my recursive
> servers. In
> > addition 2 of them hit Google's resolvers, and 2 hit ARINs. Yes, ARIN
> already
> > gets many "reverse" queries, and my recursive already does lots of
> lookups, but
> > the document doesn't (that I could see) discuss the potential fallout
> from
> > potentially *lots* more load. Caching is only slightly effective here --
> there
> > are many many subnets, and e.g the ARIN NoData,NoError response will be
> cached
> > for 1800 seconds (30 minutes).
> >
> > There are other examples -- for example, my laptop is currently on
> > 192.168.0.65. If I try connect to 192.168.1.2 using an app which
> implements
> > this, I'll have 4 queries hitting my recursive server (3 of which will
> get
> > NXDOMAIN) and 192.in-addr.arpa. hitting ARINs servers.
> >
> > I'm assuming that I must be missing something obvious here, because I
> cannot
> > see how the above sounds reasonable.
> >
> >
> >
> >
>


-- 
I don't think the execution is relevant when it was obviously a bad idea in
the first place.
This is like putting rabid weasels in your pants, and later expressing
regret at having chosen those particular rabid weasels and that pair of
pants.
   ---maf