Re: [Idnet] Intelligence-Defined Network Architecture and Call for Interests

Oscar Mauricio Caicedo Rendon <omcaicedo@unicauca.edu.co> Fri, 31 March 2017 13:48 UTC

Return-Path: <omcaicedo@unicauca.edu.co>
X-Original-To: idnet@ietfa.amsl.com
Delivered-To: idnet@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9986F1297DE for <idnet@ietfa.amsl.com>; Fri, 31 Mar 2017 06:48:39 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.699
X-Spam-Level:
X-Spam-Status: No, score=-2.699 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=unicauca.edu.co
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id yqyhGtrU7J5B for <idnet@ietfa.amsl.com>; Fri, 31 Mar 2017 06:48:36 -0700 (PDT)
Received: from mail-vk0-x242.google.com (mail-vk0-x242.google.com [IPv6:2607:f8b0:400c:c05::242]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id D412C1297D0 for <idnet@ietf.org>; Fri, 31 Mar 2017 06:48:35 -0700 (PDT)
Received: by mail-vk0-x242.google.com with SMTP id d188so11404311vka.3 for <idnet@ietf.org>; Fri, 31 Mar 2017 06:48:35 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=unicauca.edu.co; s=google; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=a7MgeB/Z7JenUyVYDjPsgHcFpkf79dbN9gDEt1y5OtE=; b=YKaMLGfqmUG+SnG1DYU3QLL83b6+k+tDFGN8Fi3os0wYUjm49oQE4kzie0YjAFlpJ2 7HO23aeag2unjjH/60eOb5B4ZmPhQsmxpkydTVsAm4CGDP27inFYkcySBbUx+R9IhmQ9 YJQi2A298ptDexGCH7XIrOR4lgYxnf8JAimBY=
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=a7MgeB/Z7JenUyVYDjPsgHcFpkf79dbN9gDEt1y5OtE=; b=szojFhEhAF8UHhyNL9w+luxAAWHByqdUuSHCUa6PeNef7Ow2xrTqIXDAZUzmhmRr4s xWnpyMIQ2WUtU1ngwUYzcSxySfXayFyaYWiYhRAgYELSR6hdnMi8sQ0e7jN1OTBxfoG1 8Lz9xN7CM6hwKkLryi08A510xyWatL4lH3N/ACJsm4IlE+uB8l6QHQPt/RA9esREM1gA TULr9Fw/delItTfTxFFQ1pM+fmvW7NyaF1nmv5ScIEU85ZzYE0qV8/85yowhDfSBo0Mj IVlhYZiVu9dkawXkTBt3F5+hRZhrFvNXk3b3hGJe++TJDJy3FAmMIs9mJPCuWZhokhoc w26w==
X-Gm-Message-State: AFeK/H184Mhi64pj0zuP7+tmVZbPSThBEIhkssSx354Hou9fLVj3xPKh/ihKUkhjPAX7TrjaJAUub3V9IQ77W3MaqPNggSwETVy5u1u1N04RbRBpr6VvlCf/ex0omauhSwVSUj7s4w==
X-Received: by 10.176.64.202 with SMTP id i68mr1545690uad.55.1490968114810; Fri, 31 Mar 2017 06:48:34 -0700 (PDT)
MIME-Version: 1.0
Received: by 10.103.44.76 with HTTP; Fri, 31 Mar 2017 06:48:14 -0700 (PDT)
In-Reply-To: <52152b22-0178-0531-c26d-bb15a5c53ec8@gmail.com>
References: <3B110B81B721B940871EC78F107D848CF33029@DGGEMM506-MBS.china.huawei.com> <20170331030720.GF4808@spectre> <9a761473-d188-5a1a-d31e-6ad991dfd1e5@nokia-bell-labs.com> <0e924c06-b8fb-48fa-5137-f4de120b03db@nokia-bell-labs.com> <b6b93efd-9d78-f4f7-80db-411a39a745a6@inria.fr> <52152b22-0178-0531-c26d-bb15a5c53ec8@gmail.com>
From: Oscar Mauricio Caicedo Rendon <omcaicedo@unicauca.edu.co>
Date: Fri, 31 Mar 2017 08:48:14 -0500
Message-ID: <CABo5upVx75UOqpidSRRc2kmDp74XB+_7QF4hQ_W-LEdrUhih5Q@mail.gmail.com>
To: Brian E Carpenter <brian.e.carpenter@gmail.com>
Cc: =?UTF-8?B?SsOpcsO0bWUgRnJhbsOnb2lz?= <jerome.francois@inria.fr>, idnet@ietf.org
Content-Type: multipart/alternative; boundary=94eb2c122e6adaebfb054c0710c0
Archived-At: <https://mailarchive.ietf.org/arch/msg/idnet/yY2blz8JHnlu0eHhkTtGtSa1SVo>
Subject: Re: [Idnet] Intelligence-Defined Network Architecture and Call for Interests
X-BeenThere: idnet@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: "The IDNet \(Intelligence-Defined Network\) " <idnet.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idnet>, <mailto:idnet-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idnet/>
List-Post: <mailto:idnet@ietf.org>
List-Help: <mailto:idnet-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idnet>, <mailto:idnet-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 31 Mar 2017 13:48:39 -0000

Hi everyone.

I think third option us most viable.

On the other hand, in
https://www.cl.cam.ac.uk/research/srg/netos/projects/brasil/data/index.html
is the BRASIl data set that has been used for IP traffic classification.
Furthermore, in next links there are interesting papers using ML for IP
traffic classification:

- *Lightweight Application Classification for Network Management. *
https://www.cl.cam.ac.uk/research/srg/netos/projects/brasil/publications/2007-jiang2007lightweight.pdf

- *A Machine Learning Approach for Efficient Traffic Classification. *
https://www.cl.cam.ac.uk/research/srg/netos/projects/brasil/publications/2007-li2007machine.pdf


- *Experience with High-Speed Automated Application-Identification for
Network-Management. *
https://www.cl.cam.ac.uk/research/srg/netos/projects/brasil/publications/atoz.ancs09.pdf

Regards,





On Fri, Mar 31, 2017 at 7:47 AM, Brian E Carpenter <
brian.e.carpenter@gmail.com> wrote:

> > 1) on anonymization: this far from being simple...
>
> Yes, and another issue is that it may even make the data useless. I hit
> that problem a few years ago when evaluating stateless hash algorithms for
> use in the IPv6 flow label. The problem was that anonymization would change
> the addresses in a pcap trace in such a way that some regularity in the
> original
> data would be replaced by pseudo-random bits, which would completely
> falsify
> the results from a statistical viewpoint. I think that in the same way, ML
> results from anonymized data might be meaningless - and there is no way to
> know that.
>
> Regards
>    Brian
>
> On 31/03/2017 17:57, Jérôme François wrote:
> > Hi,
> >
> > Data is primordial but sharing them is very difficult (I'm not an
> > operator) due to privacy but also legislative issues.
> > Here are few comments:
> > 1) on anonymization: this far from being simple and I do not believe
> > that anybody can guarentee that nothing will be leaked out as sensitive
> > data (and at the end this will the responsability of the operator who
> > have shared the data).
> > -> That is why sharing but not in public manner (with a NDA) is still
> > the most viable solution (in my opinion)
> >
> > 2) charaterizing the data we would like to access is clearly a first
> > step (even if many of us like to have raw data)
> >
> > 3) Another possibility is to run the algorithms you want to test on a
> > remote platform without accessing the data and get back the results
> > (also here there might be some safguards to avoid that program tries to
> > extract sensitive data as results !)
> >
> > jerome
> >
> > Le 31/03/2017 à 06:15, Laurent Ciavaglia a écrit :
> >> Also CAIDA has multiple measurement tools to collect data.
> >> Some IETFers are specialists of these tools and report findings on
> >> data collected via these tools.
> >>
> >> Might be something to look at.
> >>
> >> Also, and with this I'm finished for today, documenting the
> >> characteristics of the requested data set(s) might help.
> >>
> >> BR, Laurent.
> >>
> >> On 31/03/2017 06:12, Laurent Ciavaglia wrote:
> >>> Hello,
> >>>
> >>> A couple of suggestions:
> >>>     -Why not raise this point / request in various WGs (e.g. in the
> >>> OPS Area) and RGs (e.g. MAPRG, NMRG).
> >>>     -Why not work with the IETF meeting NOC to collect data sets?
> >>>
> >>> BR, Laurent.
> >>>
> >>> On 31/03/2017 05:07, Pedro Martinez-Julia wrote:
> >>>> Hi all,
> >>>>
> >>>> That's it and I insist that, if anybody in the mailing list is bound
> to
> >>>> an operator or a similar organization that manages huge amounts of
> data,
> >>>> just the data we need, please try to provide it. We will be able to
> help
> >>>> you to build some anonymization solution, if needed. Thank you.
> >>>>
> >>>> Regards,
> >>>> Pedro
> >>>>
> >>>
> >>> --
> >>>
> >>> Laurent Ciavaglia
> >>>
> >>> Nokia, Bell Labs
> >>>
> >>>
> >>>
> >>> +33 160 402 636
> >>>
> >>> route de Villejust - Nozay, France
> >>>
> >>> linkedin.com/in/laurent.ciavaglia
> >>>
> >>
> >> --
> >>
> >> Laurent Ciavaglia
> >>
> >> Nokia, Bell Labs
> >>
> >>
> >>
> >> +33 160 402 636
> >>
> >> route de Villejust - Nozay, France
> >>
> >> linkedin.com/in/laurent.ciavaglia
> >>
> >>
> >>
> >> _______________________________________________
> >> IDNET mailing list
> >> IDNET@ietf.org
> >> https://www.ietf.org/mailman/listinfo/idnet
> >
> >
> >
> >
> > _______________________________________________
> > IDNET mailing list
> > IDNET@ietf.org
> > https://www.ietf.org/mailman/listinfo/idnet
> >
>
> _______________________________________________
> IDNET mailing list
> IDNET@ietf.org
> https://www.ietf.org/mailman/listinfo/idnet
>



-- 
*Oscar Mauricio Caicedo Rendón*
*PhD Computer Science - Federal University of Rio Grande do Sul*
*Full Profesor - University of Cauca*

-- 

------------------------------
*Universidad del Cauca: Comprometidos con la calidad.*