Re: [Ietf-languages] Khmer orthographic reform

Richard Wordingham <> Thu, 31 October 2019 00:25 UTC

Return-Path: <>
Received: from localhost (localhost []) by (Postfix) with ESMTP id 2924B12022C for <>; Wed, 30 Oct 2019 17:25:10 -0700 (PDT)
X-Virus-Scanned: amavisd-new at
X-Spam-Flag: NO
X-Spam-Score: -1.999
X-Spam-Status: No, score=-1.999 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: (amavisd-new); dkim=pass (2048-bit key)
Received: from ([]) by localhost ( []) (amavisd-new, port 10024) with ESMTP id dg3WPE1cxdD6 for <>; Wed, 30 Oct 2019 17:25:07 -0700 (PDT)
Received: from ( []) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by (Postfix) with ESMTPS id D2AC9120018 for <>; Wed, 30 Oct 2019 17:25:06 -0700 (PDT)
Received: from JRWUBU2 ([]) by cmsmtp with ESMTP id PyGui7O8z5GlNPyGuiz3A0; Thu, 31 Oct 2019 00:25:04 +0000
X-Originating-IP: []
X-Spam: 0
X-Authority: v=2.3 cv=frbymmwf c=1 sm=1 tr=0 a=yrOAJgItaIMndimPI+pDLQ==:117 a=yrOAJgItaIMndimPI+pDLQ==:17 a=jpOVt7BSZ2e4Z31A5e1TngXxSK0=:19 a=IkcTkHD0fZMA:10 a=8390bBgGAAAA:8 a=IxFLyuZUAAAA:8 a=ZpnIj9rRAAAA:8 a=17rJItsl5FcP3bsfAdwA:9 a=QEXdDO2ut3YA:10 a=ONfENc2Xldf9O1kC1nT1:22 a=rp6CNfvUyW0OTBmOTiB_:22 a=RELIQkRzTb6Ho8CLp0UT:22
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;; s=meg.feb2017; t=1572481504; bh=8Ly63L3lRjpgS6WsDhWDKEQs7BBvhBu4EGcFLlViH64=; h=Date:From:To:Subject:In-Reply-To:References; b=gyC8DjRPjt7qG5t7wqZFFA6IAhKb9LoteTkYM8br8LfK3jSBErbcPLhXd+gteC2n/ 4ZATPhbp66kjqjM0PRJYfB4K6N+MzE6MZAaPNuUMtB2R9/+LIdWp0h4tQoeQRAokg6 BEylWUQFUAmsgaqWJ3uGlpLo+jpld0weZuM7V30JUBgDXkGb+i6KgziSncBqH/Wpp7 ghM/3TJ7tG+vq86SLQ7ImU4hTjuw47Wj8WCljoOndXjdVroC483iSZsIGZEYZxIXyz aOu7vFcJInDQvNkMW19HVboADMwVtt3uOwBCzWQS/uYxGzoKbBZfsuE5ThWqXNDzez uii/OvRorKJOg==
Date: Thu, 31 Oct 2019 00:25:03 +0000
From: Richard Wordingham <>
Message-ID: <20191031002503.17a667d0@JRWUBU2>
In-Reply-To: <>
References: <>
X-Mailer: Claws Mail 3.13.2 (GTK+ 2.24.30; i686-pc-linux-gnu)
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
X-CMAE-Envelope: MS4wfP0eck4jGWFK92lw4V8CTxVPxMiu8QIPxcmze7IuG0oPNNQX3opRyaePnVx3pajUtm7gacHApKtcrBrklSyIffLgunNA1N4NwDDdf1fADCZh/tzagOV3 7SzgpuC8QuLnLW+3BsiwuL7OP/aiBJnTtTwHXf4wLNuz5fREEBzGMwDQ
Archived-At: <>
Subject: Re: [Ietf-languages] Khmer orthographic reform
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <>
List-Unsubscribe: <>, <>
List-Archive: <>
List-Post: <>
List-Help: <>
List-Subscribe: <>, <>
X-List-Received-Date: Thu, 31 Oct 2019 00:25:10 -0000

On Wed, 30 Oct 2019 12:00:18 +0100
Élie Roux <> wrote:

> Dear all,
> I have some data in Khmer that I need to use two tags for:
> - one for Khmer as written before the orthographic reforms of the
> XXth c.
> - one for Khmer written according to said reforms
> The situation has been explained to me as follows:
> """
> The reform is not exactly clear cut, unfortunately. Let me see if I
> can find any sources.

One generally available resource, very interesting but very limited, is
Michel Antelme's article ‘Inventaire provisoire des
charactères et divers siɡnes des écritures khmères pré-modernes
et modernes employés pour la notation du khmer, du siaois et du
pāli’ - .
We may need a new control sequence or character for the old writing of
what he transliterates as _kantora_, a word for 'mouse'.  (We
definitely need a few more graphical characters.)

From p20, it would seems that a key boundary may be the abandonment of
subscript consonants for CVC clusters.