Re: [OPSAWG] AD review of draft-ietf-opsawg-large-flow-load-balancing (draft response)

Benoit Claise <bclaise@cisco.com> Tue, 22 April 2014 21:49 UTC

Return-Path: <bclaise@cisco.com>
X-Original-To: opsawg@ietfa.amsl.com
Delivered-To: opsawg@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1A1CF1A027C for <opsawg@ietfa.amsl.com>; Tue, 22 Apr 2014 14:49:44 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -9.772
X-Spam-Level:
X-Spam-Status: No, score=-9.772 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RP_MATCHES_RCVD=-0.272, SPF_PASS=-0.001, USER_IN_DEF_DKIM_WL=-7.5] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id EVoriUdKhB5V for <opsawg@ietfa.amsl.com>; Tue, 22 Apr 2014 14:49:38 -0700 (PDT)
Received: from aer-iport-4.cisco.com (aer-iport-4.cisco.com [173.38.203.54]) by ietfa.amsl.com (Postfix) with ESMTP id 5DCFA1A0278 for <opsawg@ietf.org>; Tue, 22 Apr 2014 14:49:36 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=cisco.com; i=@cisco.com; l=48603; q=dns/txt; s=iport; t=1398203371; x=1399412971; h=message-id:date:from:mime-version:to:cc:subject: references:in-reply-to; bh=tXNsbbzQ3K7Y4NtfHIlprfwTLdd8rmpWn34S4MrIhk8=; b=NeupyQfy5ib90lDle4hDEukI4sg634UIJECRCReZDB/sdIwWL+zjmD5R 6QvZnum0YfE13p5xGrHnsboaK8vUWbe3czHRemOZy56/lshczaSweZTnL rA6AXnTqisBwWfRrsJFK51VWJlFqXS3RMldN1TtR5EXzaT51exDt7kO+r Y=;
X-IronPort-Anti-Spam-Filtered: true
X-IronPort-Anti-Spam-Result: AhEFAHLiVlOQ/khL/2dsb2JhbABZDoI0RE/EdoEWFnSCJQEBAQQtSwEQCxEEAQEBCQwKAQEGBwkDAgECAQ8lCQgGAQwBBQIBAQWIJAMRDcgiDYVqF4w+gUFXBgEKhC4ElwKBboE3hSKGJ4VSgnFCO4Et
X-IronPort-AV: E=Sophos; i="4.97,906,1389744000"; d="scan'208,217"; a="19511811"
Received: from ams-core-2.cisco.com ([144.254.72.75]) by aer-iport-4.cisco.com with ESMTP; 22 Apr 2014 21:49:29 +0000
Received: from [10.60.67.86] (ams-bclaise-8915.cisco.com [10.60.67.86]) by ams-core-2.cisco.com (8.14.5/8.14.5) with ESMTP id s3MLnSvs025826; Tue, 22 Apr 2014 21:49:28 GMT
Message-ID: <5356E3E8.9060106@cisco.com>
Date: Tue, 22 Apr 2014 23:49:28 +0200
From: Benoit Claise <bclaise@cisco.com>
User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:24.0) Gecko/20100101 Thunderbird/24.4.0
MIME-Version: 1.0
To: ramki Krishnan <ramk@Brocade.com>, Anoop Ghanwani <anoop@alumni.duke.edu>
References: <CA+-tSzxDpD2V7Q15Jjgzz2A+d5Gn_92YQ-1_Zvx2AP=s5AWpxA@mail.gmail.com> <534BD465.4090503@cisco.com> <CA+-tSzxh-R_4W+bqy7ATrjVf5cmPx29Oo_371BcrZXDODFTEug@mail.gmail.com> <534D2030.3020809@cisco.com> <CA+-tSzy4+nswSyj27cKSTKr3jo=sZXpo1Eut894B+=7h_30zsg@mail.gmail.com> <534E4505.9080806@cisco.com> <CA+-tSzw7Mhdp5DbxSSudwGvRYMVTT4rYzP6biVvGxC8Xsm2J7A@mail.gmail.com> <C7634EB63EFD984A978DFB46EA5174F2C004004482@HQ1-EXCH01.corp.brocade.com>
In-Reply-To: <C7634EB63EFD984A978DFB46EA5174F2C004004482@HQ1-EXCH01.corp.brocade.com>
Content-Type: multipart/alternative; boundary="------------000001080505080609000803"
Archived-At: http://mailarchive.ietf.org/arch/msg/opsawg/SG4PTGuwn1DbBOIQsP5np1XOzKg
Cc: "opsawg@ietf.org" <opsawg@ietf.org>
Subject: Re: [OPSAWG] AD review of draft-ietf-opsawg-large-flow-load-balancing (draft response)
X-BeenThere: opsawg@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: OPSA Working Group Mail List <opsawg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/opsawg>, <mailto:opsawg-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/opsawg/>
List-Post: <mailto:opsawg@ietf.org>
List-Help: <mailto:opsawg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/opsawg>, <mailto:opsawg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 22 Apr 2014 21:49:44 -0000

Thanks.
The IETF LC has been requested.

Regards, Benoit
>
> Hi Benoit,
>
> Please find updated draft below. We have addressed all your comments.
>
> http://datatracker.ietf.org/doc/draft-ietf-opsawg-large-flow-load-balancing/
>
> Thanks,
>
> Anoop & Ramki
>
> *From:*ghanwani@gmail.com [mailto:ghanwani@gmail.com] *On Behalf Of 
> *Anoop Ghanwani
> *Sent:* Thursday, April 17, 2014 11:11 AM
> *To:* Benoit Claise
> *Cc:* ramki Krishnan; opsawg@ietf.org
> *Subject:* Re: AD review of 
> draft-ietf-opsawg-large-flow-load-balancing (draft response)
>
> Thanks Benoit.  We'll figure out how to account for this in the draft 
> and will have the update posted in the next few days.
>
> Anoop
>
> On Wed, Apr 16, 2014 at 1:53 AM, Benoit Claise <bclaise@cisco.com 
> <mailto:bclaise@cisco.com>> wrote:
>
> Anoop,
>
>     Benoit,
>
>     Please see inline.
>
>     On Tue, Apr 15, 2014 at 5:04 AM, Benoit Claise <bclaise@cisco.com
>     <mailto:bclaise@cisco.com>> wrote:
>
>     On 14/04/2014 19:09, Anoop Ghanwani wrote:
>
>         Hi Benoit,
>
>         I will work on the editorials shortly and I'm removing those
>         from the discussion.   See below:
>
>         On Mon, Apr 14, 2014 at 5:28 AM, Benoit Claise
>         <bclaise@cisco.com <mailto:bclaise@cisco.com>> wrote:
>
>         Hi Anoop,
>
>         Thanks for the new draft version.
>         I removed some of the points
>
>             On Tue, Feb 18, 2014 at 7:55 AM, Benoit Claise
>             <bclaise@cisco.com <mailto:bclaise@cisco.com>> wrote:
>
>                 -
>
>                     A number of routers support sampling techniques such as sFlow [sFlow-
>
>                     v5, sFlow-LAG], PSAMP [RFC 5475] and NetFlow Sampling [RFC 3954].
>
>                     For the purpose of large flow identification, sampling must be
>
>                     enabled on all of the egress ports in the router where such
>
>                     measurements are desired.
>
>                 I don't understand the second sentence.
>                 One way to read this is: sampling must be _enabled _on
>                 all of the egress ports where such measurements are
>                 desired.
>                     Ok, this is an obvious statement. If the
>                 measurements are desired, enable them
>
>             Yes,
>
>                 Or maybe you want to say: _sampling _must be enabled
>                 on all of the egress ports where such measurements are
>                 desired.
>                     This is a false statement: if you have the choice
>                 between sampling and non sampling, use non sampling
>                 measurements.
>                 Or maybe you want to say: sampling must be enabled on
>                 _all _of the egress ports where such measurements are
>                 desired.
>                     This is a false statement: if I have ECMP on 2
>                 links, and only one of them can't do non sampling,
>                 then we should not force
>                     sampling on both links.
>                 You see, I'm confused.
>
>                 You miss a couple of key messages:
>                 - if unsampled measurements are available, use those.
>                 - egress means where LAG/ECMP are enabled (this is
>                 important for the paragraph starting with "If egress
>                 sampling is not available, ingress sampling can
>                 suffice since the central management entity use")
>
>             We were not intending to discuss a mix sampling and
>             non-sampling interfaces in the same router, but this is a
>             reasonable point and it will be clarified (i.e. we will
>             state that it's possible to mix sampled and non sampled
>             interfaces as long as the function of large flow
>             detection/identification can be performed).
>
>         You're still missing the point that unsampled measurements is
>         better than sampled ones.
>
>         We do point this out in Section 4.3.4.
>
>         http://tools.ietf.org/html/draft-ietf-opsawg-large-flow-load-balancing-10#section-4.3.4
>
>         >>>
>
>                  As link speeds get higher, sampling rates are typically reduced
>
>                  to keep the number of samples manageable which places a lower
>
>                  bound on the detection time.  With automatic hardware
>
>                  recognition, large flows can be detected in shorter windows on
>
>                  higher link speeds since every packet is accounted for in
>
>                  hardware [NDTM  <http://tools.ietf.org/html/draft-ietf-opsawg-large-flow-load-balancing-10#ref-NDTM>].
>
>         >>>
>
>     I've seen that, but why do you equate automatic _hardware
>     _recognition to unsampled measurements.
>     Whether it's done in hardware of software is orthogonal.
>
>     OK, I think I see the reason for the disconnect.  In the draft we
>     only talked about automatic hardware recognition and sampling as
>     methods for large flow recognition.  It seems you're suggesting
>     there's a third way -- unsampled measurements (likely in hardware)
>     but use of software for the actual recognition of large flows from
>     those measurements?  Can you confirm?  If so, we can add that to
>     the draft as well.
>
> I see two only ways: sampled and unsampled.
> Both could be done in hardware (most likely on router; This is what 
> you called automatic hardware recognition, AFAICT) or software.
> Whether it's done in hardware of software is orthogonal to the 
> mechanism in this document.
>
> I hope this clarifies
>
> Regards, Benoit
>
>
>
>
>             Is this what you mean by:
>
>             It is possible that a router may have line cards that support a
>
>             sampling technique while other line cards support automatic hardware
>
>             detection of large flows.
>
>             It's not very clear.
>
>         No, this does not address your point.  This is talking about
>         the case where line cards have different capabilities, rather
>         than a line card that supports both.
>
>         Since we already have the advantages and disadvantages listed
>         in 4.3.4, do you still see a need for explicitly mentioning
>         that automatic hardware detection is to be preferred over
>         sampling if both are available?
>
>         We did debate the point about accuracy quite a bit among the
>         authors.  The question is -- does that level of accuracy
>         really matter for the large flow case?
>
>     Maybe not (for the details:
>     http://dl.acm.org/citation.cfm?id=1791959), but I don't understand
>     why you want to limit this mechanism to sampling only. Simply
>     telling that sampled data could be good enough, but if you have
>     unsampled data, you will get a better accuracy.
>
> Thanks for the reference.
>
>         Since we are dealing with flows that need to consume a certain
>         percent of the link bandwidth, sampling, if configured correctly,
>
>     And you don't go in the details of "sampling, if configured
>     correctly"...
>
> There are suggestions in some of the references (e.g. the DevoFlow 
> paper), but there are also other references, e.g.
>
> http://www.sflow.org/packetSamplingBasics/index.htm.   This is a 
> general sampling problem, rather than something that was introduced by 
> this draft.  If you think it would be useful to add something (or 
> maybe just pointers to the references), that can be done.
>
> Anoop
>