Re: [xml2rfc] [xml2rfc-dev] [Rfc-markdown] New xml2rfc release: v2.12.1

Robert Sparks <rjsparks@nostrum.com> Tue, 30 October 2018 18:34 UTC

Return-Path: <rjsparks@nostrum.com>
X-Original-To: xml2rfc@ietfa.amsl.com
Delivered-To: xml2rfc@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 89C8E130DD2; Tue, 30 Oct 2018 11:34:17 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.88
X-Spam-Level:
X-Spam-Status: No, score=-1.88 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, T_SPF_HELO_PERMERROR=0.01, T_SPF_PERMERROR=0.01] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ULs3GZK-whpm; Tue, 30 Oct 2018 11:34:15 -0700 (PDT)
Received: from nostrum.com (raven-v6.nostrum.com [IPv6:2001:470:d:1130::1]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 97406127333; Tue, 30 Oct 2018 11:34:15 -0700 (PDT)
Received: from unescapeable.local ([47.186.18.66]) (authenticated bits=0) by nostrum.com (8.15.2/8.15.2) with ESMTPSA id w9UIYAdK061212 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128 verify=NO); Tue, 30 Oct 2018 13:34:10 -0500 (CDT) (envelope-from rjsparks@nostrum.com)
X-Authentication-Warning: raven.nostrum.com: Host [47.186.18.66] claimed to be unescapeable.local
To: Julian Reschke <julian.reschke@gmx.de>, Henrik Levkowetz <henrik@levkowetz.com>, Miek Gieben <miek@miek.nl>
Cc: xml2rfc@ietf.org, xml2rfc-dev@ietf.org
References: <E1gH6Wn-0002ca-CU@durif.tools.ietf.org> <20181029204446.xwjkdfdpj7sbhvsr@miek.nl> <0455bd48-3392-bb4c-f510-c57489792606@levkowetz.com> <20181029214325.5i3rrv55dpamvbsa@miek.nl> <a614b58e-0d4c-67d0-77f9-f5b6b66d7ff1@levkowetz.com> <d0a3693b-87b5-c8e9-8137-9d61805f58f0@gmx.de> <f01715aa-ee1b-ec80-0657-7d19b23da98a@levkowetz.com> <1d03c466-bd4a-934e-18f0-f9ac5bc2225e@gmx.de>
From: Robert Sparks <rjsparks@nostrum.com>
Message-ID: <a6d979a3-9b27-84b2-1561-15b185c39bb5@nostrum.com>
Date: Tue, 30 Oct 2018 13:34:09 -0500
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.12; rv:52.0) Gecko/20100101 Thunderbird/52.9.1
MIME-Version: 1.0
In-Reply-To: <1d03c466-bd4a-934e-18f0-f9ac5bc2225e@gmx.de>
Content-Type: text/plain; charset="utf-8"; format="flowed"
Content-Transfer-Encoding: 8bit
Content-Language: en-US
Archived-At: <https://mailarchive.ietf.org/arch/msg/xml2rfc/OIY_Gxq0lySAZNU5G1xcIL0vkPw>
Subject: Re: [xml2rfc] [xml2rfc-dev] [Rfc-markdown] New xml2rfc release: v2.12.1
X-BeenThere: xml2rfc@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: <xml2rfc.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/xml2rfc/>
List-Post: <mailto:xml2rfc@ietf.org>
List-Help: <mailto:xml2rfc-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/xml2rfc>, <mailto:xml2rfc-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 30 Oct 2018 18:34:18 -0000

Meta-comment/question:

Is this a discussion about the vocabulary that we should be tracking as 
an issue on the vocabulary document (to at least make it easier to find 
later)?

RjS

On 10/30/18 2:20 AM, Julian Reschke wrote:
> On 2018-10-30 07:29, Henrik Levkowetz wrote:
>>
>>
>> On 2018-10-30 05:57, Julian Reschke wrote:
>>> On 2018-10-29 22:57, Henrik Levkowetz wrote:
>>>> ...
>>>> Country-specific address formatting.  Without it, you'll be stuck 
>>>> with the
>>>> fallback, which is very americentric for the text output, somewhat 
>>>> less so
>>>> for the html output (but not very refined).  I'd hate to see it go.
>>>> ...
>>>
>>> Example? Smells a bit like overkill, given that v3 has simplified
>>> address formatting (postalLine).
>>
>> Example: In Norwegian and Swedish address format, the postal code goes
>> before the city, not after the region code.  This is correct:
>>
>>    Midtskogveien 18
>>    2020 Skedsmokorset
>>    Norway
>>
>> The current html specification would instead have produced, depending on
>> how you work around its problems, one of
>>
>>    Midtskogveien 18
>>    Skedsmokorset
>>    2020
>>    Norway
>>
>> or
>>
>>    Midtskogveien 18
>>    Skedsmokorset, 2020
>>    Norway
>>
>> which is a slighter degree of mangling than for places like various 
>> areas
>> of Japan, that needs city-region.  Here is a Japanese example. This is
>> correct (except that it's also a translation, not just a romanization):
>>
>>    Japan
>>    112-0001
>>    Tokyo-to
>>    Bunkyo-ku
>>    4-3-2 Hakusan
>>    3rd Floor Room B
>>
>> and the RFC7992 specification would have rendered it as either this,
>> preserving the semantic labelling of the parts, but loosing the 
>> city-region
>> part:
>>
>>    3rd Floor Room B
>>    4-3-2 Hakusan
>>    Tokyo-to, 112-0001
>>    Japan
>>
>> or would loose all semantic labelling through forced use of 
>> postalLine for
>> all entries.
>>
>> This problem has however been solved to a large degree by modern 
>> libraries.
>> If you can do this *right*, why continue to do it wrong?
>
> Because it's complex, and now you are relying on a specific 
> implementation. Is the algorithm documented?
>
>> postalLine would have been a bit of an improvement, if it hadn't been
>> handicapped by throwing out interesting and valid information through
>> the choice of not permitting any of the existing elements that made 
>> sense
>
> Well, we can discuss it. <postalLine> is the way it is because the 
> design team thought that the level of detail that we had before simply 
> was unneeded and just made things too complex. I guess this discussion 
> supports this point :-)
>
>> for the location's address.  A better choice would have been to add
>> city-region and postal-box as new address elements, and specify the use
>> of country-specific formatting.  The latter is something I could do.
>> I haven't done the former, even if it would have been tempting, given
>> that it would have given a pretty complete set of labelled address 
>> entities
>> to work with.
>>
>> Take the statistics that are available here:
>>    https://datatracker.ietf.org/stats/document/yearly/country/
>
> Understood. I'm just not sure that the fact these kind of stats exists 
> (and apparently are already done with heuristics) justifies the 
> complexity of the vocabulary.
>
>> Forcing the removal of country (and region, and city) labelling if your
>> address needs a non-US address format makes producing that kind of info
>> harder; and it also removes the vCard/hCard annotation made possible by
>> having properly identified address parts.  The current code provides the
>> correct hCard property labels also for non-US addresses.
>>
>> I'm *tired* of having software only work right for one country out of
>> many, and it's not necessary any longer.  Look at all the variations
>
> Yes, that's why we took the complexity out of the vocabulary.
>
>> across the globe, and come and tell me that it is preferred not to do
>> things as right as you can for most of them:
>>
>>    https://en.wikipedia.org/wiki/Postal_address
>>
>> Finally, given an old conversation you had about address formatting, the
>> rendering of the bottom address here may please you:
>>
>> https://durif.tools.ietf.org/src/xml2rfc/html/show/elements.html#name-authors-addresses 
>>
>> ...
>
> Best regards, Julian
>
> _______________________________________________
> xml2rfc-dev mailing list
> xml2rfc-dev@ietf.org
> https://www.ietf.org/mailman/listinfo/xml2rfc-dev