Re: [Idna-update] IDNA and combining sequences (was: Re: Expiration impending: <draft-klensin-idna-rfc5891bis-01.txt>)

"Asmus Freytag (c)" <asmusf@ix.netcom.com> Tue, 13 March 2018 23:47 UTC

Return-Path: <asmusf@ix.netcom.com>
X-Original-To: idna-update@ietfa.amsl.com
Delivered-To: idna-update@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 5F40A1201F8 for <idna-update@ietfa.amsl.com>; Tue, 13 Mar 2018 16:47:24 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.72
X-Spam-Level:
X-Spam-Status: No, score=-2.72 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=ix.netcom.com; domainkeys=pass (2048-bit key) header.from=asmusf@ix.netcom.com header.d=ix.netcom.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 3G-hvlPHmOOf for <idna-update@ietfa.amsl.com>; Tue, 13 Mar 2018 16:47:21 -0700 (PDT)
Received: from elasmtp-dupuy.atl.sa.earthlink.net (elasmtp-dupuy.atl.sa.earthlink.net [209.86.89.62]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 85D35126DFB for <idna-update@ietf.org>; Tue, 13 Mar 2018 16:47:21 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ix.netcom.com; s=dk12062016; t=1520984841; bh=GfyVBzLQJuYBHugSvMpiGZcWO+dPSPZP2qU9 A+/w27Y=; h=Received:Subject:To:References:From:Message-ID:Date: User-Agent:MIME-Version:In-Reply-To:Content-Type: Content-Transfer-Encoding:Content-Language:X-ELNK-Trace: X-Originating-IP; b=GzMDuCxZmUrfbqGHvAbL2AXrTpSTiP2hYKs3lBgn65FeSC OJanzRNrPKfakZmC/W05RGlyIesYg6iAZMOeyXpoKvgN+PEDttQocosZg/9AilgKfrZ 14b5lgCc9KD3wImUu5XYEcJpEAz4hDizHw/F2nnGp7NgzyKwvUVQrZHRAn/tiAZY5C1 W7SXUY4ZY0HAuNxJZIL5/sv+2fhvtyK3bCEdIxBI2VBQMvZ8nRIgSC6arnq5z/7ushW tvCoVBriYsUOD/JeSQNLITytV1wSS2HRqh9Wno4uv++eQDYTS/AyaFurYGhTBJCLSzo 9+71QIWwRU4zh3WRzLDW82uGw3dg==
DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=dk12062016; d=ix.netcom.com; b=sad0BEnweBQ3jdN5uzavpepOLUPqzdAuupzxvUmY7x+HeVZK+fX1XZK/1ZDCOAx6PvDfyaPru+6e9TpClErMNBDRFfQsnJJr4Jcil3KTI0Qo+VQwGjP0ulMiysjeriKn423Jk3w/qEdlFdFdLoMDpnHSPK/A/BscQ/Boy6ci5Mq4SpXH+OilYJ83pLyx9UiZUlmrOcbsm49tO0uM/fZEDm1J0QV02PVMIC5G6aH5uvYldKZt8j3hMMBgHe7dny6m+mnoLyW58+j9CCH/o3ghJ1CFgEXEIlgSEWHDWr7p6SjCQWTkpPKPDZZVqYpIErd3VsUVjaAh+5byPPLbuF/yEw==; h=Received:Subject:To:References:From:Message-ID:Date:User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Transfer-Encoding:Content-Language:X-ELNK-Trace:X-Originating-IP;
Received: from [46.21.151.107] (helo=[10.4.47.190]) by elasmtp-dupuy.atl.sa.earthlink.net with esmtpa (Exim 4) (envelope-from <asmusf@ix.netcom.com>) id 1evtdX-000EXT-Rb; Tue, 13 Mar 2018 19:47:20 -0400
To: John Levine <johnl@taugh.com>, idna-update@ietf.org
References: <20180313214745.86AA8228D992@ary.local>
From: "Asmus Freytag (c)" <asmusf@ix.netcom.com>
Message-ID: <1420573e-4d9d-7853-ffdc-7fc7c2290598@ix.netcom.com>
Date: Tue, 13 Mar 2018 16:47:17 -0700
User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.6.0
MIME-Version: 1.0
In-Reply-To: <20180313214745.86AA8228D992@ary.local>
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
Content-Language: en-US
X-ELNK-Trace: 464f085de979d7246f36dc87813833b2c1627926350bb93fd0fd2bb49320ec6434d70163d6cf65b6350badd9bab72f9c350badd9bab72f9c350badd9bab72f9c
X-Originating-IP: 46.21.151.107
Archived-At: <https://mailarchive.ietf.org/arch/msg/idna-update/96exxareNMDk4JsVjLx6B6qgT64>
Subject: Re: [Idna-update] IDNA and combining sequences (was: Re: Expiration impending: <draft-klensin-idna-rfc5891bis-01.txt>)
X-BeenThere: idna-update@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: "Internationalized Domain Names in Applications \(IDNA\) implementation and update discussions" <idna-update.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idna-update>, <mailto:idna-update-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idna-update/>
List-Post: <mailto:idna-update@ietf.org>
List-Help: <mailto:idna-update-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idna-update>, <mailto:idna-update-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 13 Mar 2018 23:47:24 -0000

On 3/13/2018 2:47 PM, John Levine wrote:
> In article <73252c78-361c-c8cb-e392-c5b4c41deded@ix.netcom.com> you write:
>> Here's a link to the human readable version of the Thai LGR:
>> https://www.icann.org/sites/default/files/lgr/lgr-2-thai-script-01jun17-en.html
>> I would be curious, whether you can make sense out of the description
>> for Thai in the Root Zone. I also invite you to compare that LGR to some
>> of the gTLD IDN tables for Thai.
> I think I can follow them OK.  The characters each are characterized
> as consonant, various types of vowels, tones, and diacritics.  The
> ordering rules would make more sense to me as regular expressions.
Check out the bottom of the HTML version of each scripts LGR file.

Click on

https://www.icann.org/sites/default/files/lgr/lgr-2-thai-script-01jun17-en.html

and go to WLE Rules. You should see a table of regexes.

Let me know if you'd like to see any additional information.
A./
>
> R's,
> John
>