Re: [v6ops] MAC table shortage in IPv6 networks caused by multiple IPv6 prefixes/addresses//FW: New Version Notification for draft-liu-v6ops-running-multiple-prefixes-01.txt

"Liubing (Leo)" <leo.liubing@huawei.com> Fri, 11 July 2014 08:11 UTC

Return-Path: <leo.liubing@huawei.com>
X-Original-To: v6ops@ietfa.amsl.com
Delivered-To: v6ops@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9A2631B2AAE for <v6ops@ietfa.amsl.com>; Fri, 11 Jul 2014 01:11:37 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.852
X-Spam-Level:
X-Spam-Status: No, score=-4.852 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, RP_MATCHES_RCVD=-0.651, SPF_PASS=-0.001] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id esZHZzwCaIA8 for <v6ops@ietfa.amsl.com>; Fri, 11 Jul 2014 01:11:35 -0700 (PDT)
Received: from lhrrgout.huawei.com (lhrrgout.huawei.com [194.213.3.17]) (using TLSv1 with cipher RC4-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 3C8E81B2AAD for <v6ops@ietf.org>; Fri, 11 Jul 2014 01:11:33 -0700 (PDT)
Received: from 172.18.7.190 (EHLO lhreml405-hub.china.huawei.com) ([172.18.7.190]) by lhrrg02-dlp.huawei.com (MOS 4.3.7-GA FastPath queued) with ESMTP id BHA11154; Fri, 11 Jul 2014 08:11:31 +0000 (GMT)
Received: from nkgeml409-hub.china.huawei.com (10.98.56.40) by lhreml405-hub.china.huawei.com (10.201.5.242) with Microsoft SMTP Server (TLS) id 14.3.158.1; Fri, 11 Jul 2014 09:11:30 +0100
Received: from NKGEML506-MBX.china.huawei.com ([169.254.3.24]) by nkgeml409-hub.china.huawei.com ([10.98.56.40]) with mapi id 14.03.0158.001; Fri, 11 Jul 2014 16:11:26 +0800
From: "Liubing (Leo)" <leo.liubing@huawei.com>
To: Andrew Yourtchenko <ayourtch@cisco.com>
Thread-Topic: [v6ops] MAC table shortage in IPv6 networks caused by multiple IPv6 prefixes/addresses//FW: New Version Notification for draft-liu-v6ops-running-multiple-prefixes-01.txt
Thread-Index: AQHPnC1ik94AZHMtAkW0lrev3UVEV5uaFjug
Date: Fri, 11 Jul 2014 08:11:25 +0000
Message-ID: <8AE0F17B87264D4CAC7DE0AA6C406F453D8F2AB4@nkgeml506-mbx.china.huawei.com>
References: <8AE0F17B87264D4CAC7DE0AA6C406F453D8EEA21@nkgeml506-mbx.china.huawei.com> <8AE0F17B87264D4CAC7DE0AA6C406F453D8F1C32@nkgeml506-mbx.china.huawei.com> <alpine.DEB.2.02.1407091226000.7929@uplift.swm.pp.se> <CFE32281.2067C%evyncke@cisco.com> <alpine.DEB.2.02.1407091710020.7929@uplift.swm.pp.se> <alpine.OSX.2.00.1407091840270.99248@ayourtch-mac> <8AE0F17B87264D4CAC7DE0AA6C406F453D8F291C@nkgeml506-mbx.china.huawei.com> <alpine.OSX.2.00.1407101220310.93503@ayourtch-mac>
In-Reply-To: <alpine.OSX.2.00.1407101220310.93503@ayourtch-mac>
Accept-Language: en-US, zh-CN
Content-Language: zh-CN
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [10.111.98.132]
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
X-CFilter-Loop: Reflected
Archived-At: http://mailarchive.ietf.org/arch/msg/v6ops/0jn-9pfhshhXAm8OnYUzxQmQD7E
Cc: "v6ops@ietf.org" <v6ops@ietf.org>
Subject: Re: [v6ops] MAC table shortage in IPv6 networks caused by multiple IPv6 prefixes/addresses//FW: New Version Notification for draft-liu-v6ops-running-multiple-prefixes-01.txt
X-BeenThere: v6ops@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: v6ops discussion list <v6ops.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/v6ops>, <mailto:v6ops-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/v6ops/>
List-Post: <mailto:v6ops@ietf.org>
List-Help: <mailto:v6ops-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/v6ops>, <mailto:v6ops-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 11 Jul 2014 08:11:37 -0000

Hi Andrew,

> > [Bing] Would you like to share more details of the network? E.g. it is a
> dual-stack or something else.
> 
> It's dualstack, the vast majority of the clients are on a single wireless
> segment (the total number of wired devices is ~300-400, so I treat them as
> noise.
> 
> The wireless clients are addressed with SLAAC, and are outside of my control
> (conference attendees).
> 
> The segment has no officially designated servers.
> 
> The configuration from the first hop router is below:
> 
> interface Vlan4
>   description !!! WIRELESS CLIENTS !!! CiscoLive2014 !!!
>   .. IPv4 configuration omitted ..
>   ipv6 address FE80::1 link-local
>   ipv6 address 2001:4D38:A:400::1/64
>   ipv6 nd reachable-time 1800000
>   ipv6 nd prefix default 86400 9000 off-link no-autoconfig
>   ipv6 nd prefix 2001:4D38:A:400::/64 604800 86400 off-link
>   ipv6 nd prefix 2001:4D38:A:6800::/64 604800 0 off-link
>   ipv6 nd router-preference High
>   ipv6 nd ra lifetime 9000
>   no ipv6 redirects
>   no ipv6 unreachables
>   ipv6 verify unicast source reachable-via rx
> 
> ipv6 route 2001:4D38:A:400::/64 vlan4
> 
> Note a few bits that are specific for the conference setup, and some are
> specific to the topology:
> 
> 1) maximally high lifetimes (RAs are throttled on their way to the clients
> down to 1 RA in 10 minutes).
> 
> 2) clearing on-link bit: results in clients never trying to resolve the other
> clients' addresses => less memory usage on the portable devices, smaller ND
> caches there.
> 
> 3) default no-autoconfig: to force the necessity to add the prefixes manually,
> to force the operator to always have the "ipv6 nd prefix ..."
> with the correct config.
> 
> 4) two prefixes, one with zero preferred lifetime: there was a second VLAN
> for disaster recovery purposes, where the clients would have ended up if
> there was a catastrophic failure no the main pair of the wireless controllers.
> That VLAN had a mirrored configuration.
> Experimentally, I've found such a configuration to cause a rapid deprecation
> of the "wrong" prefix on a iOS and OSX, resulting in minimal disruption to
> them in case of switchover.
> 
> 5) interface route for the prefix, to send the traffic to it (configuring the
> prefix "off-link" had removed the directly connected route).

[Bing] Andrew, thanks much for sharing the details. 

> > It is quite often that a host would have at least 3 IPv6 addresses: a
> >link-local, a global and a privacy address. If ULAs are enabled,
> >DHCPv6/SLAAC co-existing, there might be more IPv6 addresses.
> 
> Right. I was well within the constraints for the first hop, so opted to not have
> DHCPv6. If I wanted a tighter control, I'd have turned off SLAAC and used
> stateful DHCPv6. Maybe I'll experiment with it this year.
> 
> > So I was just guessing, maybe the 1.7x only represent that part of the
> nodes is enabling IPv6?
> 
> The %% of the nodes running IPv6 was about 80-85%
> (http://2014.ciscolive-ipv6.com/munin/ipv6noc/ipv6noc/neighbors_dualstac
> k_percentage.html
> - the dips are the the night times, a lot of internal apps were IPv4 so the
> neighbor entries went away)
> 
> I calculated this %% by taking the "show arp" and "show ipv6 neigh"
> outputs and correlating the hosts by mac address.
> 
> As I said Apple devices (they are the most aggressive today wrt the privacy
> addresses) are ~50% of the network, so maybe in another setup the figures
> would be different.
> 
> Do you have some numbers you could share ?

[Bing] The problem was encountered by the co-author (Yang Bo) when doing projects in some campus/enterprise networks. I'm sorry we don't have the specific statistics data like above, since we don't own the networks. 

Now there are some enterprise/campus networks under real use or considering using L2 networks. Some are aiming at better user isolation through VLANs (some even consider QinQ mechanism); while some are aiming less configuration/management than the traditional L3 networks. So there would be thousands of hosts aggregated to the core switch (normally there are two core switches stacked together, but only share one cache space). As IPv6 is beginning real use, for example, some of the campus networks are already dual-stack, and the majority of the hosts are Win 7, we once observed in one campus that DHCPv6/SLAAC are both enabled, each Win 7 host had 4 IPv6 addr (SLAAC+DHCPv6+Privacy+link-local)+1 IPv4 addr. 

Current high-end switch can certainly fulfill the thousands of users' IP-MAC entries, however, the forwarding performance would be a significant redundancy, which would cost more budget than expected. This is something that won't happen in IPv4, so I thought maybe this could be considered as an IPv6 networks operational issue, and took the issue into the draft and gain some opinions in the list.

> > Plus, an IPv6-MAC binding would cost more cache space than an IPv4-MAC
> binding (2-4 times due to implementation).
> >
> >> I re-read the original mail starting this thread and the problem
> >> description there to me boils down to two issues:
> >>
> >> 1) one IP node has more IP addresses in version 6 than in version 4.
> >> 2) an IP address takes more lookup space in version 6 than in version 4.
> >>
> >> Do I grok it right ?
> > [Bing] Basically yes. This is one of the problems cause by multiple
> > IPv6 addresses in draft-liu-v6ops-running-multiple-prefixes.
> 
> Ok, then I think that's a tricky one - we can not decrease the number of bits
> in IPv6 address, nor get the end hosts to go back to ARP for address
> resolution :-)
>
> Also, if iOS 8.0 indeed changes the MAC address upon the wireless
> connection, the problem is just about to get much worse and not
> IPv6-specific ;-)
> 
> So, I think there are several means to combat the lookup space churn:
> 
> 1) IPv6-only networks. We can't do that today and IPv4 is relatively not
> *so* much but it's something.
> 
> 2) Table maintenance/cleanup mechanisms. These appear to work today
> reasonably.
> 
> 3) provisioning hardware according to anticipated scale - and either using
> larger router boxes, or splitting large L3 domains into several smaller
> segments, like Mikael mentioned.
> 
> 4) using DHCPv6-only allocation of addresses, with one address per host
> assignment.
> 
> Do you think there is something more that is needed besides the above
> approaches to steer the addresses on the host ?

[Bing] Thanks for the proposed approaches. It seems that's what we can do so far in operation perspective.
As the opinions raised in the mailing list, people tend to consider it as a network design issue rather than operational issue. I will reflect the opinions in the next version of the draft.

Best regards,
Bing

> --a