Re: [xmpp] Unicode Version Interop Concerns in JIDs

"Sam Whited" <sam@samwhited.com> Fri, 13 September 2019 22:20 UTC

Return-Path: <sam@samwhited.com>
X-Original-To: xmpp@ietfa.amsl.com
Delivered-To: xmpp@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 02EA7120116 for <xmpp@ietfa.amsl.com>; Fri, 13 Sep 2019 15:20:52 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.701
X-Spam-Level:
X-Spam-Status: No, score=-2.701 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=samwhited.com header.b=ZV1trocQ; dkim=pass (2048-bit key) header.d=messagingengine.com header.b=bKbEem6G
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id JAwxikMNu2MY for <xmpp@ietfa.amsl.com>; Fri, 13 Sep 2019 15:20:50 -0700 (PDT)
Received: from wout5-smtp.messagingengine.com (wout5-smtp.messagingengine.com [64.147.123.21]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 329C61200FA for <xmpp@ietf.org>; Fri, 13 Sep 2019 15:20:50 -0700 (PDT)
Received: from compute4.internal (compute4.nyi.internal [10.202.2.44]) by mailout.west.internal (Postfix) with ESMTP id 385F354E for <xmpp@ietf.org>; Fri, 13 Sep 2019 18:20:49 -0400 (EDT)
Received: from imap34 ([10.202.2.84]) by compute4.internal (MEProxy); Fri, 13 Sep 2019 18:20:49 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=samwhited.com; h=mime-version:message-id:in-reply-to:references:date:from:to :subject:content-type:content-transfer-encoding; s=fm3; bh=JuKGs KkTQWPaSXwxZEU8Wr3Cxo2vJkxxHiQzvL3xcRg=; b=ZV1trocQMFeE9a8ZP91X6 t4x+XlGZ7Sd1dfgHUc24iE4n9dCrGJYBCZcJ1+YxFBbDqxuiSkHqoO13OCNkOU1e X6TyJby581DsJRtN2Q8kN2hBv6JhIhEbzb7oVwY4Mte1NQ2Y+II2nME8uNQRo1aC PpU3i/H0kL5xFaGzeuIOsU12GynjZC/yi/g/wCfhHE/uzb/YCjKeua6acSLaH+/y lYmtsVtGhbqcCpOK1vgPKvhU13c4jyn+KSPfEstwY2+mUu/6VE1pWbX+OY9IV+tu TJupRmOGTdDfm06ceOQZ/iVxQufWAKZe2U7jHeUo5n23a6ubBUe7ZFlni8wshMC8 w==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=content-transfer-encoding:content-type :date:from:in-reply-to:message-id:mime-version:references :subject:to:x-me-proxy:x-me-proxy:x-me-sender:x-me-sender :x-sasl-enc; s=fm3; bh=JuKGsKkTQWPaSXwxZEU8Wr3Cxo2vJkxxHiQzvL3xc Rg=; b=bKbEem6Gl/1JkG4s2uf+8Z0Zc3Dh1uKaRMvkHkKv5QoUIzgdpAePq5oUM dS32SLsIT8HrCBSN6ubQYB3JfvKTFJAsF40IvHsUW0QLG2GJI1DOtrfDsXLmywFK 4xlgSaapaxyk82F329JddKaxhE23NV6vYfGVF763HGMlOHgUKOmZSqGJfrhWAYjT F/DIfINNXbMS6umqBLPCS19QT0vp6CzmU7UPOUIYlOCZtlw2hJ3dgAttAr54ewXQ gLV+dGwkfattxTdzjb60KBEPpMLf44/d78nkhHzVlVTDOA8wPtRAPHLnrSTEICBd oxca9jSnat54PwNnrjH+EL/Iv5EIA==
X-ME-Sender: <xms:QBZ8XYnGVl0wpTgxxcF9QC398pV-mLOxxnzqbXLyDWuXf-EZw4YhwA>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedufedrtdekgddtfecutefuodetggdotefrodftvf curfhrohhfihhlvgemucfhrghsthforghilhdpqfgfvfdpuffrtefokffrpgfnqfghnecu uegrihhlohhuthemuceftddtnecunecujfgurhepofgfggfkjghffffhvffutgfgsehtqh ertderreejnecuhfhrohhmpedfufgrmhcuhghhihhtvggufdcuoehsrghmsehsrghmfihh ihhtvggurdgtohhmqeenucfrrghrrghmpehmrghilhhfrhhomhepshgrmhesshgrmhifhh hithgvugdrtghomhenucevlhhushhtvghrufhiiigvpedt
X-ME-Proxy: <xmx:QBZ8Xe_PvPsFcluui_QhXEYx0KnrZGSJSE1S336YgkOojjlKg64_tg> <xmx:QBZ8XZC_Y7TS85RIFAKqq6SKzp01txuYqS5lrcuvzRARRq1q0-jkUA> <xmx:QBZ8XeTas6JyTKSu2xsVC0s9zVRN8-_KRTVZl2iiQPATpnjrvmu6Fg> <xmx:QBZ8XfBAK5E_QJusIhtY1-soxmlsluYQ1yk4DylstsY2NuL0L5sf3g>
Received: by mailuser.nyi.internal (Postfix, from userid 501) id 590241460061; Fri, 13 Sep 2019 18:20:48 -0400 (EDT)
X-Mailer: MessagingEngine.com Webmail Interface
User-Agent: Cyrus-JMAP/3.1.7-237-gf35468d-fmstable-20190912v1
Mime-Version: 1.0
Message-Id: <8e5f9f1b-3968-42fe-8882-dce84a47734f@www.fastmail.com>
In-Reply-To: <b1eccf9b-e43b-9c38-c589-81c4a042487f@geekplace.eu>
References: <dbbb91ba-9116-50f7-fefa-09ef2bd5991d@ik.nu> <b1eccf9b-e43b-9c38-c589-81c4a042487f@geekplace.eu>
Date: Fri, 13 Sep 2019 22:20:28 +0000
From: Sam Whited <sam@samwhited.com>
To: xmpp@ietf.org
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Archived-At: <https://mailarchive.ietf.org/arch/msg/xmpp/2VsMKp-FsDlQ_kCJgTEfT3vlmCE>
Subject: Re: [xmpp] Unicode Version Interop Concerns in JIDs
X-BeenThere: xmpp@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: XMPP Working Group <xmpp.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/xmpp>, <mailto:xmpp-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/xmpp/>
List-Post: <mailto:xmpp@ietf.org>
List-Help: <mailto:xmpp-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/xmpp>, <mailto:xmpp-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 13 Sep 2019 22:20:52 -0000

On Fri, Sep 13, 2019, at 07:35, Florian Schmaus wrote:
> So it would be great if runtimes would provide a way to load a new
> Unicode Character Database [1] without updating the runtime
> environment. Something like Character.loadUnicodeDb("unicode-
> 12.1..dat")

The various Go Unicode libraries do this (more or less). They provide
build options that allow you to pick a Unicode version, so you could
always build with the latest one or possibly link in multiple (I think,
I haven't tried that but I don't see why it wouldn't work).

> Now I know that we may not have not much control over what
> functionality runtime environments provide. However, nothing prevents
> you from using (or implementing) a Unicode library with that such
> feature. I am considering implementing something like that in [2]. The
> whole process could eventually be automated, which, I believe, is
> solving the issue Ralph describes.

Sort of. Having libraries that link in UCD data or similar makes things
easier, but we still have to decide what's supported, what special cases
need to be worked around (I'm sure there will be some odd bugs and what
not that crop up with different versions), do we do negotiation or
support a specific range of versions, what if you're using an old
version of the XMPP client that doesn't have the new data files shipped
with it or linked in, etc.

I agree Unicode related libraries should support multiple versions, I
just don't think that really solves the problem of how we address
that in the XMPP specs and how we address compatibility between
clients and servers.

—Sam

-- 
Sam Whited