[precis] names and usernames

Peter Saint-Andre <stpeter@stpeter.im> Sun, 12 February 2017 23:29 UTC

Return-Path: <stpeter@stpeter.im>
X-Original-To: precis@ietfa.amsl.com
Delivered-To: precis@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id C5068129404 for <precis@ietfa.amsl.com>; Sun, 12 Feb 2017 15:29:57 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.701
X-Spam-Level:
X-Spam-Status: No, score=-2.701 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=stpeter.im header.b=A2kvUOzh; dkim=pass (1024-bit key) header.d=messagingengine.com header.b=Ca86nPTf
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id HXhGZagjRMaq for <precis@ietfa.amsl.com>; Sun, 12 Feb 2017 15:29:56 -0800 (PST)
Received: from new1-smtp.messagingengine.com (new1-smtp.messagingengine.com [66.111.4.221]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 569C71293FE for <precis@ietf.org>; Sun, 12 Feb 2017 15:29:56 -0800 (PST)
Received: from compute2.internal (compute2.nyi.internal [10.202.2.42]) by mailnew.nyi.internal (Postfix) with ESMTP id 7238319F6; Sun, 12 Feb 2017 18:29:55 -0500 (EST)
Received: from frontend1 ([10.202.2.160]) by compute2.internal (MEProxy); Sun, 12 Feb 2017 18:29:55 -0500
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d=stpeter.im; h= content-transfer-encoding:content-type:date:from:message-id :mime-version:subject:to:x-me-sender:x-me-sender:x-sasl-enc :x-sasl-enc; s=mesmtp; bh=CIcMfwlSsNJkrDkR4paMt0V6VCs=; b=A2kvUO zhlGIh8dBbM3I1HfZ3naQErbr2YXnfJphOEVvqWvYPafbYz7tBkcxF9oqVB4usQS BPGwH6whDyEVVQ0ek/itC5adxGCHNASb/vDfAAYgYOpQXzrTbkC5NXlB20CcmFVZ Cf3N2u83pUqeKX66U0Q45nc9BdiEacyGsn8uc=
DKIM-Signature: v=1; a=rsa-sha1; c=relaxed/relaxed; d= messagingengine.com; h=content-transfer-encoding:content-type :date:from:message-id:mime-version:subject:to:x-me-sender :x-me-sender:x-sasl-enc:x-sasl-enc; s=smtpout; bh=CIcMfwlSsNJkrD kR4paMt0V6VCs=; b=Ca86nPTf1bug/LyAGaXLiF5iCuMqOTdsftybNhowwtFKeS Sr/nl7Im5vloVmDO39KKX70Wy6ZawS3qRoa84+s5fqrazHFPpt2TLbKA/U3vbBRU PL8qeeCW+URcJNCtEDq5tzKdGOW6uCtakkiecTo1rhrkt968Cb0bKUWQkTF/0=
X-ME-Sender: <xms:8--gWGRAahSL34QUZQQCa-dRO5WNU5xTo53NlJshK582QRQrvDCIgg>
X-Sasl-enc: Sq65uIfPvOHyyNeVHJGqG+vY0M9C3hab9cxYzO5beW3L 1486942195
Received: from aither.local (unknown [76.25.4.24]) by mail.messagingengine.com (Postfix) with ESMTPA id DDC247E06B; Sun, 12 Feb 2017 18:29:54 -0500 (EST)
To: "precis@ietf.org" <precis@ietf.org>
From: Peter Saint-Andre <stpeter@stpeter.im>
Message-ID: <cbc41f53-8d39-76ad-a2a7-276d50db9bac@stpeter.im>
Date: Sun, 12 Feb 2017 16:29:54 -0700
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.12; rv:45.0) Gecko/20100101 Thunderbird/45.7.1
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"; format="flowed"
Content-Transfer-Encoding: 7bit
Archived-At: <https://mailarchive.ietf.org/arch/msg/precis/YS6gZd5d0CsQ9yst6FWWLnuUd_4>
Subject: [precis] names and usernames
X-BeenThere: precis@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: Preparation and Comparison of Internationalized Strings <precis.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/precis>, <mailto:precis-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/precis/>
List-Post: <mailto:precis@ietf.org>
List-Help: <mailto:precis-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/precis>, <mailto:precis-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 12 Feb 2017 23:29:57 -0000

John Klensin has brought to my attention that it is currently impossible 
to represent some people's names in PRECIS usernames because some of the 
relevant Unicode code points are disallowed by the IdentifierClass 
defined in RFC 7564 (and thus by the UsernameCaseMapped and 
UsernameCasePreserved profiles defined in RFC 7613).

First, RFC 7564 disallows "default ignorable" code points in the 
IdentifierClass. However, as I understand it some of these code points 
are need to represent characters in names that might be desirable to 
people living within communities that use Indic script and eastern 
Arabic script (e.g., Persian and writing systems derived from Persian). 
In particular, the Unicode Standard specifies that ZWJ and ZWNJ are 
"default ignorable" and it seems that these code points are especially 
important in this context.

Second, apparently some Chinese family names are typically written 
(especially outside the People's Republic of China) using characters 
that the Unicode Consortium assigns to non-BMP code points, or assigns 
in the BMP but as compatibility decomposable characters (and thus 
disallowed by RFC 7564 in the IdentifierClass).

I'm not sure whether we can solve these problems (internationalization 
is messy and we've never tried to guarantee that any particular name or 
preferred string could be represented in PRECIS usernames), but input 
from people with a deeper understanding of these issues would be 
appreciated. I have attempted to reach out to relevant experts, and will 
report back to this list with any findings.

In the meantime, I plan to submit revised I-Ds addressing other issues 
with the PRECIS specifications sometime this evening.

Peter