Re: [weirds] Internationalization Issues

Byron Ellacott <bje@apnic.net> Wed, 24 October 2012 05:13 UTC

Return-Path: <bje@apnic.net>
X-Original-To: weirds@ietfa.amsl.com
Delivered-To: weirds@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 674BD21F8CC9 for <weirds@ietfa.amsl.com>; Tue, 23 Oct 2012 22:13:02 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.599
X-Spam-Level:
X-Spam-Status: No, score=-2.599 tagged_above=-999 required=5 tests=[AWL=-0.000, BAYES_00=-2.599, NO_RELAYS=-0.001, WEIRD_PORT=0.001]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ZIkvK04Umg5k for <weirds@ietfa.amsl.com>; Tue, 23 Oct 2012 22:13:01 -0700 (PDT)
Received: from asmtp.apnic.net (asmtp.apnic.net [IPv6:2001:dc0:2001:11::199]) by ietfa.amsl.com (Postfix) with ESMTP id DDC9C21F8CCA for <weirds@ietf.org>; Tue, 23 Oct 2012 22:13:00 -0700 (PDT)
Received: from [IPv6:2001:dc0:a000:4:819d:25c8:1fc:5926] (unknown [IPv6:2001:dc0:a000:4:819d:25c8:1fc:5926]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (No client certificate requested) by asmtp.apnic.net (Postfix) with ESMTP id 75FF1B6850; Wed, 24 Oct 2012 15:12:58 +1000 (EST)
Content-Type: multipart/signed; boundary="Apple-Mail=_75993298-E671-4CD4-ACF5-5E562136562C"; protocol="application/pkcs7-signature"; micalg="sha1"
Mime-Version: 1.0 (Mac OS X Mail 6.2 \(1499\))
From: Byron Ellacott <bje@apnic.net>
In-Reply-To: <50877817.9060805@nic.ad.jp>
Date: Wed, 24 Oct 2012 15:12:58 +1000
Message-Id: <1045CB70-C1E9-48F9-9E3C-6F9032581650@apnic.net>
References: <CCA6B194.DE13%andy@arin.net> <6C6109C6-0FA3-40A0-9562-A8F55F178003@apnic.net> <50875975.8090908@jprs.co.jp> <50877817.9060805@nic.ad.jp>
To: MasaYUKI Okada <okadams@nic.ad.jp>
X-Mailer: Apple Mail (2.1499)
Cc: weirds@ietf.org
Subject: Re: [weirds] Internationalization Issues
X-BeenThere: weirds@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "WHOIS-based Extensible Internet Registration Data Service \(WEIRDS\)" <weirds.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/weirds>, <mailto:weirds-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/weirds>
List-Post: <mailto:weirds@ietf.org>
List-Help: <mailto:weirds-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/weirds>, <mailto:weirds-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 24 Oct 2012 05:13:02 -0000

Thank you both for your input.  If you are intending to implement a weirds service down the line, what would be your expectations on language support in the protocol?

  Byron

On 24/10/2012, at 3:09 PM, MasaYUKI Okada <okadams@nic.ad.jp> wrote:

> Byron-san,
> 
> I'm Masayuki Okada at JPNIC.
> 
> Our WHOIS system(whois.nic.ad.jp:43) will response English(ASCII)
> if '/e' is put into the character of the last of an input. 
> 
> By default, it answers in Japanese(ISO-2022-JP). 
> 
> About main registration records, JPNIC are collecting both Japanese 
> and English as an mandantory.
> 
> just for reference:
>  - How to use JPNIC WHOIS - 
>   http://www.nic.ad.jp/en/db/whois/
> 
> --
> Masayuki Okada
> JPNIC
> 
> (2012/10/24 11:59), Kentaro Mori wrote:
>> Byron-san and folks,
>> 
>> I'm Kentaro Mori from JPRS.
>> (sorry for late reply)
>> 
>> For Whois, JPRS collects English data as well as native (Japanese) one
>> at the time registrant applies to .JP domain name registration.
>> Additionally, the English data doesn't cover all of Japanese data,
>> e.g., it is partial.
>> ISO-2022-JP as character-set was a normal choice when JPRS (more
>> correctly, JPNIC at that time) started Whois service, though we may have
>> alternative choice such as UTF-8 now.
>> 
>> --Kentaro
>> 
>> (2012/10/22 9:56), Byron Ellacott wrote:
>>> 
>>> On 19/10/2012, at 9:33 PM, Andy Newton <andy@arin.net> wrote:
>>> 
>>>> On 10/18/12 8:40 PM, "Byron Ellacott" <bje@apnic.net> wrote:
>>>> 
>>>>> Indicate to the end user that it's not a native language?
>>>>> Auto-translate?
>>> 
>>> Murray has the right sense of what I meant, for both of these.
>>> 
>>>>> Negotiate for native language with Accepts-Language, if indicated as
>>>>> possible via a Vary header?
>>>> 
>>>> That's HTTP layer stuff. We're talking about embedding multiple language
>>>> tags in the response.
>>> 
>>> Are we?  I thought draft-sheng-weirds-icann-rws-dnrd-01 sect. 7.3
>>> suggested a single language tag for the entire response, with "possible
>>> considerations" of multiple language tags.
>>> 
>>> But, with this point, what I'm suggesting is that the user of a particular
>>> client likely has one or a few preferred languages, which they could
>>> potentially indicate to the server, in the event that the server has multiple
>>> translations.  This would be applicable for mixed language responses as
>>> well as single language responses, since it only indicates a client preference,
>>> not a strict requirement.
>>> 
>>> My primary perspective on this entire subject is that whatever mechanisms
>>> or systems indicate language or language preference need to be optional,
>>> and should support reasonable use cases for current or likely future operators.
>>> I think there's a use case for language preference indication, as per below,
>>> and I think Ning is suggesting a use case for tagging the language of an
>>> entire response, inline in the response.  What are your (collective "your")
>>> thoughts on how reasonable these use cases are?
>>> 
>>>>> Some RDAP services will not support multiple languages meaningfully, but
>>>>> there are existing whois services that provide (non-standard, varying)
>>>>> ways to indicate a preferred language on query, with multiple language
>>>>> options available for many response fields.
>>>> 
>>>> Can you provide an example of one of these services so we can query it?
>>>> That would go a long way in helping shape this need, I would think. Are
>>>> there registries that collect contact data in multiple languages?
>>> 
>>> $ whois -h whois.nic.ad.jp -- 'NET 113.32.19.157'
>>> $ whois -h whois.nic.ad.jp -- 'NET 113.32.19.157 /e'
>>> 
>>> $ whois -h whois.jprs.jp -- 'jprs.jp'
>>> $ whois -h whois.jprs.jp -- 'jprs.jp /e'
>>> 
>>> The data labels are sometimes translated, sometimes not.  In the native
>>> language responses, there's often an English translation.  JPRS includes
>>> an English help/info block even for the native language response.  I don't
>>> know for sure if they collect the information in multiple languages, though
>>> I think they do - any JPRS or JPNIC operators on the list to confirm?
>>> Character set is ISO-2022-JP.
>>> 
>>> I don't know if there are other services with such a switch mechanism,
>>> either - we're all aware of how hard it is to find out what's actually done
>>> out there on port 43 :-) - but for another comparison whois.kisa.kr returns
>>> both native and english output, at least for "kisa.kr".  Character set is
>>> EUC-KR.
>>> 
>>>  Byron
>>> 
>>> 
>>> 
>>> _______________________________________________
>>> weirds mailing list
>>> weirds@ietf.org
>>> https://www.ietf.org/mailman/listinfo/weirds
>>> 
>> 
>> 
>> 
>> _______________________________________________
>> weirds mailing list
>> weirds@ietf.org
>> https://www.ietf.org/mailman/listinfo/weirds
>> 
> 
> _______________________________________________
> weirds mailing list
> weirds@ietf.org
> https://www.ietf.org/mailman/listinfo/weirds

-- 
Byron Ellacott                  email:           bje@apnic.net
Technical Director, APNIC       sip:        bje@voip.apnic.net
http://www.apnic.net            phone:         +61 7 3858 3100
________________________________________________________________________
 * Sent by email to save paper. Print only if necessary.