Re: [xml2rfc] [xml2rfc-dev] [Rfc-markdown] New xml2rfc release: v2.12.1

Henrik Levkowetz <henrik@levkowetz.com> Tue, 30 October 2018 18:41 UTC

Return-Path: <henrik@levkowetz.com>
X-Original-To: xml2rfc@ietfa.amsl.com
Delivered-To: xml2rfc@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 1E3D1130E26; Tue, 30 Oct 2018 11:41:14 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.9
X-Spam-Level:
X-Spam-Status: No, score=-1.9 tagged_above=-999 required=5 tests=[BAYES_00=-1.9] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id korFsezoYdbZ; Tue, 30 Oct 2018 11:41:11 -0700 (PDT)
Received: from zinfandel.tools.ietf.org (zinfandel.tools.ietf.org [IPv6:2001:1890:126c::1:2a]) (using TLSv1.2 with cipher DHE-RSA-AES128-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id DBD24130E36; Tue, 30 Oct 2018 11:41:11 -0700 (PDT)
Received: from h-37-140.a357.priv.bahnhof.se ([94.254.37.140]:52727 helo=[192.168.1.126]) by zinfandel.tools.ietf.org with esmtpsa (TLS1.2:DHE_RSA_AES_256_CBC_SHA256:256) (Exim 4.80) (envelope-from <henrik@levkowetz.com>) id 1gHYws-000599-Mm; Tue, 30 Oct 2018 11:41:08 -0700
Content-Type: text/plain; charset="us-ascii"
Mime-Version: 1.0 (1.0)
From: Henrik Levkowetz <henrik@levkowetz.com>
X-Mailer: iPad Mail (15F79)
In-Reply-To: <a6d979a3-9b27-84b2-1561-15b185c39bb5@nostrum.com>
Date: Tue, 30 Oct 2018 19:40:58 +0100
Cc: Julian Reschke <julian.reschke@gmx.de>, Miek Gieben <miek@miek.nl>, xml2rfc@ietf.org, xml2rfc-dev@ietf.org
Content-Transfer-Encoding: quoted-printable
Message-Id: <5306F385-5593-4AF7-92E1-A1200EF4C7FF@levkowetz.com>
References: <E1gH6Wn-0002ca-CU@durif.tools.ietf.org> <20181029204446.xwjkdfdpj7sbhvsr@miek.nl> <0455bd48-3392-bb4c-f510-c57489792606@levkowetz.com> <20181029214325.5i3rrv55dpamvbsa@miek.nl> <a614b58e-0d4c-67d0-77f9-f5b6b66d7ff1@levkowetz.com> <d0a3693b-87b5-c8e9-8137-9d61805f58f0@gmx.de> <f01715aa-ee1b-ec80-0657-7d19b23da98a@levkowetz.com> <1d03c466-bd4a-934e-18f0-f9ac5bc2225e@gmx.de> <a6d979a3-9b27-84b2-1561-15b185c39bb5@nostrum.com>
To: Robert Sparks <rjsparks@nostrum.com>
X-SA-Exim-Connect-IP: 94.254.37.140
X-SA-Exim-Rcpt-To: xml2rfc@ietf.org, miek@miek.nl, julian.reschke@gmx.de, henrik@levkowetz.com, rjsparks@nostrum.com, xml2rfc-dev@ietf.org
X-SA-Exim-Mail-From: henrik@levkowetz.com
X-SA-Exim-Version: 4.2.1 (built Mon, 26 Dec 2011 16:24:06 +0000)
X-SA-Exim-Scanned: Yes (on zinfandel.tools.ietf.org)
X-Clacks-Overhead: GNU Terry Pratchett
Archived-At: <https://mailarchive.ietf.org/arch/msg/xml2rfc/LjPH_9I0XkRn-B7DLuWOZO24ar4>
Subject: Re: [xml2rfc] [xml2rfc-dev] [Rfc-markdown] New xml2rfc release: v2.12.1
X-BeenThere: xml2rfc@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <xml2rfc.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/xml2rfc/>
List-Post: <mailto:xml2rfc@ietf.org>
List-Help: <mailto:xml2rfc-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 30 Oct 2018 18:41:24 -0000

Hi Robert,

> On 30 Oct 2018, at 19:34, Robert Sparks <rjsparks@nostrum.com> wrote:
> 
> Meta-comment/question:
> 
> Is this a discussion about the vocabulary that we should be tracking as an issue on the vocabulary document (to at least make it easier to find later)?

I've tried to capture the relevant bits in my implementation notes draft in parallel with writing the code.  The next rev will be submitted when submissions open again.

Best regards,

        Henrik

> 
> RjS
> 
>> On 10/30/18 2:20 AM, Julian Reschke wrote:
>>> On 2018-10-30 07:29, Henrik Levkowetz wrote:
>>> 
>>> 
>>>> On 2018-10-30 05:57, Julian Reschke wrote:
>>>>> On 2018-10-29 22:57, Henrik Levkowetz wrote:
>>>>> ...
>>>>> Country-specific address formatting.  Without it, you'll be stuck with the
>>>>> fallback, which is very americentric for the text output, somewhat less so
>>>>> for the html output (but not very refined).  I'd hate to see it go.
>>>>> ...
>>>> 
>>>> Example? Smells a bit like overkill, given that v3 has simplified
>>>> address formatting (postalLine).
>>> 
>>> Example: In Norwegian and Swedish address format, the postal code goes
>>> before the city, not after the region code.  This is correct:
>>> 
>>>    Midtskogveien 18
>>>    2020 Skedsmokorset
>>>    Norway
>>> 
>>> The current html specification would instead have produced, depending on
>>> how you work around its problems, one of
>>> 
>>>    Midtskogveien 18
>>>    Skedsmokorset
>>>    2020
>>>    Norway
>>> 
>>> or
>>> 
>>>    Midtskogveien 18
>>>    Skedsmokorset, 2020
>>>    Norway
>>> 
>>> which is a slighter degree of mangling than for places like various areas
>>> of Japan, that needs city-region.  Here is a Japanese example. This is
>>> correct (except that it's also a translation, not just a romanization):
>>> 
>>>    Japan
>>>    112-0001
>>>    Tokyo-to
>>>    Bunkyo-ku
>>>    4-3-2 Hakusan
>>>    3rd Floor Room B
>>> 
>>> and the RFC7992 specification would have rendered it as either this,
>>> preserving the semantic labelling of the parts, but loosing the city-region
>>> part:
>>> 
>>>    3rd Floor Room B
>>>    4-3-2 Hakusan
>>>    Tokyo-to, 112-0001
>>>    Japan
>>> 
>>> or would loose all semantic labelling through forced use of postalLine for
>>> all entries.
>>> 
>>> This problem has however been solved to a large degree by modern libraries.
>>> If you can do this *right*, why continue to do it wrong?
>> 
>> Because it's complex, and now you are relying on a specific implementation. Is the algorithm documented?
>> 
>>> postalLine would have been a bit of an improvement, if it hadn't been
>>> handicapped by throwing out interesting and valid information through
>>> the choice of not permitting any of the existing elements that made sense
>> 
>> Well, we can discuss it. <postalLine> is the way it is because the design team thought that the level of detail that we had before simply was unneeded and just made things too complex. I guess this discussion supports this point :-)
>> 
>>> for the location's address.  A better choice would have been to add
>>> city-region and postal-box as new address elements, and specify the use
>>> of country-specific formatting.  The latter is something I could do.
>>> I haven't done the former, even if it would have been tempting, given
>>> that it would have given a pretty complete set of labelled address entities
>>> to work with.
>>> 
>>> Take the statistics that are available here:
>>>    https://datatracker.ietf.org/stats/document/yearly/country/
>> 
>> Understood. I'm just not sure that the fact these kind of stats exists (and apparently are already done with heuristics) justifies the complexity of the vocabulary.
>> 
>>> Forcing the removal of country (and region, and city) labelling if your
>>> address needs a non-US address format makes producing that kind of info
>>> harder; and it also removes the vCard/hCard annotation made possible by
>>> having properly identified address parts.  The current code provides the
>>> correct hCard property labels also for non-US addresses.
>>> 
>>> I'm *tired* of having software only work right for one country out of
>>> many, and it's not necessary any longer.  Look at all the variations
>> 
>> Yes, that's why we took the complexity out of the vocabulary.
>> 
>>> across the globe, and come and tell me that it is preferred not to do
>>> things as right as you can for most of them:
>>> 
>>>    https://en.wikipedia.org/wiki/Postal_address
>>> 
>>> Finally, given an old conversation you had about address formatting, the
>>> rendering of the bottom address here may please you:
>>> 
>>> https://durif.tools.ietf.org/src/xml2rfc/html/show/elements.html#name-authors-addresses 
>>> ...
>> 
>> Best regards, Julian
>> 
>> _______________________________________________
>> xml2rfc-dev mailing list
>> xml2rfc-dev@ietf.org
>> https://www.ietf.org/mailman/listinfo/xml2rfc-dev
> 
>