Re: [Idna-update] IDNA and combining sequences (was: Re: Expiration impending: <draft-klensin-idna-rfc5891bis-01.txt>)

"John Levine" <johnl@taugh.com> Thu, 15 March 2018 19:43 UTC

Return-Path: <johnl@iecc.com>
X-Original-To: idna-update@ietfa.amsl.com
Delivered-To: idna-update@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 58C9B124C27 for <idna-update@ietfa.amsl.com>; Thu, 15 Mar 2018 12:43:00 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.761
X-Spam-Level:
X-Spam-Status: No, score=-1.761 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.249, SPF_PASS=-0.001, T_RP_MATCHES_RCVD=-0.01, URIBL_BLOCKED=0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1536-bit key) header.d=iecc.com header.b=TqPMz31G; dkim=pass (1536-bit key) header.d=taugh.com header.b=OVvTOg/W
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 2cNW9fx8f9F2 for <idna-update@ietfa.amsl.com>; Thu, 15 Mar 2018 12:42:58 -0700 (PDT)
Received: from gal.iecc.com (gal.iecc.com [IPv6:2001:470:1f07:1126:0:43:6f73:7461]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 79D0912D77C for <idna-update@ietf.org>; Thu, 15 Mar 2018 12:42:58 -0700 (PDT)
Received: (qmail 42253 invoked from network); 15 Mar 2018 19:42:57 -0000
DKIM-Signature: v=1; a=rsa-sha256; c=simple; d=iecc.com; h=date:message-id:from:to:cc:subject:in-reply-to:mime-version:content-type:content-transfer-encoding; s=a509.5aaaccc1.k1803; bh=nwsK7jYEW4qKy/+S70uD3S5JhjhfLR75a94epSlj9JM=; b=TqPMz31Ggb/aqKHGftEErvVc48k6wBZ0Ajac/6z6LI5qNkPNBbWMpUWbMcdYOa7hFNmyPVVtzkN6mAbn20ZdzvTJ4tsgvg+ciz6oROz255zoaG0+ByLn9Z9b8WRNkantGawQR5LQRDDCIMXMNjC7J0FCCsqNiIe94rtDnppihKVNZrX4nGc/prgcMV1mLgWVly35U2u9PQ/vpR3AZm1S/YJl+0XgGY+fuYudg6FepkllciIB2IbIDHaw7iJNeGZl
DKIM-Signature: v=1; a=rsa-sha256; c=simple; d=taugh.com; h=date:message-id:from:to:cc:subject:in-reply-to:mime-version:content-type:content-transfer-encoding; s=a509.5aaaccc1.k1803; bh=nwsK7jYEW4qKy/+S70uD3S5JhjhfLR75a94epSlj9JM=; b=OVvTOg/WEs8PC6WRogczy18qsC4mS7gmXkGP7e2AoNBr1XmmJSleE5jnp5BxztaX1xZF4bNyU5HtfqPmWo6/8bAWws4Fa2RDUvF2nPnH93YYzy3kwDwG5HyxldgSjuCP+M5ObSr7DxuFiY9b4Mj/jW1opT4wm4A1wnixUKKgZTS50vg+KE659erP8LtNpj43B7R3omaiad1IPAzNFw+g3bcWZiR+kwUFNgDo+Ymuc4zUxd/JkxEE60ZG/uyvFzRP
Received: from ary.local ([IPv6:2001:470:1f07:1126::78:696d:6170]) by imap.iecc.com ([IPv6:2001:470:1f07:1126::78:696d:6170]) with ESMTP via TCP6; 15 Mar 2018 19:42:57 -0000
Received: by ary.local (Postfix, from userid 501) id CF68F22C3BA1; Thu, 15 Mar 2018 15:42:56 -0400 (AST)
Date: 15 Mar 2018 15:42:56 -0400
Message-Id: <20180315194256.CF68F22C3BA1@ary.local>
From: "John Levine" <johnl@taugh.com>
To: idna-update@ietf.org
Cc: asmusf@ix.netcom.com
In-Reply-To: <1420573e-4d9d-7853-ffdc-7fc7c2290598@ix.netcom.com>
Organization: Taughannock Networks
X-Headerized: yes
Mime-Version: 1.0
Content-type: text/plain; charset=utf-8
Content-transfer-encoding: 8bit
Archived-At: <https://mailarchive.ietf.org/arch/msg/idna-update/shSJZrkNFkpufudJ-I7FXyE_x5s>
Subject: Re: [Idna-update] IDNA and combining sequences (was: Re: Expiration impending: <draft-klensin-idna-rfc5891bis-01.txt>)
X-BeenThere: idna-update@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: "Internationalized Domain Names in Applications \(IDNA\) implementation and update discussions" <idna-update.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idna-update>, <mailto:idna-update-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idna-update/>
List-Post: <mailto:idna-update@ietf.org>
List-Help: <mailto:idna-update-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idna-update>, <mailto:idna-update-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 15 Mar 2018 19:43:00 -0000

In article <1420573e-4d9d-7853-ffdc-7fc7c2290598@ix.netcom.com> you write:
>> I think I can follow them OK.  The characters each are characterized
>> as consonant, various types of vowels, tones, and diacritics.  The
>> ordering rules would make more sense to me as regular expressions.
>Check out the bottom of the HTML version of each scripts LGR file.
>
>Click on
>
>https://www.icann.org/sites/default/files/lgr/lgr-2-thai-script-01jun17-en.html
>
>and go to WLE Rules. You should see a table of regexes.

I was more thinking of regexes for an entire valid string, not for
each rule, but close enough.

R's,
John