Re: [idn] Re: permission <draft-ietf-idn-ace37-00.txt (attach)

"Edmon" <edmon@neteka.com> Thu, 05 July 2001 23:31 UTC

Received: from psg.com (exim@psg.com [147.28.0.62]) by ietf.org (8.9.1a/8.9.1a) with SMTP id TAA00077 for <idn-archive@lists.ietf.org>; Thu, 5 Jul 2001 19:31:44 -0400 (EDT)
Received: from lserv by psg.com with local (Exim 3.31 #1) id 15IISj-00028V-00 for idn-data@psg.com; Thu, 05 Jul 2001 16:23:09 -0700
Received: from [66.48.4.181] (helo=neteka.com) by psg.com with smtp (Exim 3.31 #1) id 15IISi-00028P-00 for idn@ops.ietf.org; Thu, 05 Jul 2001 16:23:09 -0700
Message-ID: <00a201c105a8$1a31cb00$1001a8c0@neteka.com>
From: Edmon <edmon@neteka.com>
To: Soobok Lee <lsb@postel.co.kr>, idn@ops.ietf.org
References: <5.1.0.14.1.20010705084656.02b0e138@mail.viagenie.qc.ca> <006601c10569$ffa20e00$1001a8c0@neteka.com> <003901c105a4$401cf6e0$ed1bd9d2@postel.co.kr>
Subject: Re: [idn] Re: permission <draft-ietf-idn-ace37-00.txt (attach)
Date: Thu, 05 Jul 2001 19:13:24 -0400
MIME-Version: 1.0
Content-Type: text/plain; charset="ks_c_5601-1987"
Content-Transfer-Encoding: 7bit
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 5.50.4522.1200
X-MimeOLE: Produced By Microsoft MimeOLE V5.50.4522.1200
Sender: owner-idn@ops.ietf.org
Precedence: bulk
Content-Transfer-Encoding: 7bit

> Hi,
>
Great hearing from you.  Actually, I sh
ould give you some credit cause you sta
rted the codepoint reordering thing fir
st, though the ACE37 idea really came a
bout earlier than I saw your draft.  An
yway, I think my code block shifting me
chanism is much more simple and can suc
cessfully get 21+ han ideographs which
is much more acceptable than DUDE's cur
rent state.

> When the diff value is less than 0xF,
what is the ACE37 form for it ?
>
When diff<=0x7F it will be in the 7bit
form, that is
<b4><b32>

the reason <b4> has to be used is that
it signifies how many more characters a
re used for the particular codepoint (or
what I termed as codepoint bracket).  If
only a <b32> is used, then you wont know
whether it is a 15 bit form or a 22 bit
form.
Therefore, if diff=0xF, ACE37=wf

> And any example C doe for ACE37  avai
lable on the web ?
>
Not yet, David & I will work on it and
post it on the web if more people are i
nterested in the idea and think that it
could be better than DUDE.  For now you
can check out my excel worksheet to see
how it works. http://www.dnsii.org/ace3
7/ace37-encode.xls (for encoding), http
://www.dnsii.org/ace37/ace37-decode.xls
(for decoding).

Edmon




> Thanks.
>
> Soobok
>
> > > >Chung & Leung
[Page 6]
> > > >ACE37       ACE Utilizing All 37
Alphanumeric Characters      July 2001
> > > >
> > > >    The following table explains
how base-4 characters are combined
> with
> > > >    base-32 characters to form a
representation of a diff (key:
> b4=base-
> > > >    4, b32=base-32):
> > > >
> > > >              diff value
|bits|       ACE37 Form
> > >
 >        -------------------------
|----|----------------------------
> > > >                  diff<=0x7F
|  7 | <b4><b32>
> > > >            0x80<=diff<=0x7FFF
| 15 | <b32><b32><b32>
> > > >          0x8000<=diff<=0x1FFFF
| 17 | w<b4><b32><b32><b32>
> > > >         0x20000<=diff<=0xFFFFF
| 20 | ww<b32><b32><b32><b32>
> > > >        0x100000<=diff<=0x10FFFF
| 22 | <b4>w<b32><b32><b32><b32>
> > > >
> > > >    Note that the "bits" column
represents the maximum number of
> > > >    significant bits for the giv
en diff value.  For example when
> > > >    diff<=0x7F, the maximum value
is 0b1111111, therefore the number of
>
>
> ----- Original Message -----
> From: "Edmon" <edmon@neteka.com>
> To: "Natalia Syracuse" <nsyracus@ietf
.org>; "David Leung"
> <david@neteka.com>; "Marc Blanchet" <
Marc.Blanchet@viagenie.qc.ca>;
> <idn@ops.ietf.org>
> Cc: <jseng@pobox.org.sg>
> Sent: Friday, July 06, 2001 12:48 AM
> Subject: [idn] Re: permission <draft-
ietf-idn-ace37-00.txt (attach)
>
>
> > Hi all,
> >
> > I was unaware that the workgroup no
longer accepts new drafts.  Anyway, I
> > have drafted a new ACE based on the
simplicity of DUDE which has hugely
> > improved compression.  Worst case s
cenario CJK could have 21 han
> characters!
> > Attached below is a copy of the dra
ft (for my original submission), you
> can
> > also find it at http://www.dnsii.or
g/idn-ace37-00.txt (easier to read) and
> > hopefully in the i-d-n.net website
soon.
> >
> > ACE37 is based on the one-pass one-
mode scheme of DUDE (diiferential XOR),
> > then utilizes a simple code block s
hifting (similar to the reference
> points
> > in the AMC series) to hugely increa
se the capacity for CJK (worst case
> > scenario 21 han characters!) and th
en utilizes base-32 for compression (as
> > in LACE) (DUDE and AMC-w/v uses bas
e-32 only for flagging).  In addition
> to
> > base-32, a base-4 scheme is introdu
ced by using the remaining characters
> > {wxyz}.  These contain 2 bits of ch
aracter information and doubles as an
> > indicator for codepoint brackets.
All the while, the algorithm is kept to
> > be as simple as DUDE.
> >
> > Hopefully you might find that it is
interesting and appropriate to be
> > considered as an ACE within the IET
F.  Afterall, it was intended to be an
> > integrated version of the three pri
mary ACEs: DUDE, LACE and the AMC
> series,
> > identified by the ACE design team r
eport.
> >
> > Looking forward to all your inputs.
> >
> > Edmon
>