Re: [weirds] Internationalization Issues

MasaYUKI Okada <okadams@nic.ad.jp> Wed, 24 October 2012 05:09 UTC

Return-Path: <okadams@nic.ad.jp>
X-Original-To: weirds@ietfa.amsl.com
Delivered-To: weirds@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 55F2021F8CC1 for <weirds@ietfa.amsl.com>; Tue, 23 Oct 2012 22:09:47 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.598
X-Spam-Level:
X-Spam-Status: No, score=-2.598 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, WEIRD_PORT=0.001]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id GefP03nAbXqo for <weirds@ietfa.amsl.com>; Tue, 23 Oct 2012 22:09:45 -0700 (PDT)
Received: from dist3.nic.ad.jp (dist3.nic.ad.jp [IPv6:2001:dc2:1000:2004::3]) by ietfa.amsl.com (Postfix) with ESMTP id B7BD921F8C9F for <weirds@ietf.org>; Tue, 23 Oct 2012 22:09:45 -0700 (PDT)
Received: from dist3.nic.ad.jp (localhost [127.0.0.1]) by localhost (Postfix) with SMTP id 3FAA81F00C4; Wed, 24 Oct 2012 14:09:43 +0900 (JST)
Received: from mbox2.nic.ad.jp (mbox2.nic.ad.jp [10.10.10.1]) by dist3.nic.ad.jp (Postfix) with ESMTP id 3655D1F00BB; Wed, 24 Oct 2012 14:09:43 +0900 (JST)
Received: from [10.10.11.157] (k11-pc157.nic.ad.jp [10.10.11.157]) by mbox2.nic.ad.jp (Postfix) with ESMTP id 0E84C158003F; Wed, 24 Oct 2012 14:09:43 +0900 (JST)
Message-ID: <50877817.9060805@nic.ad.jp>
Date: Wed, 24 Oct 2012 14:09:43 +0900
From: MasaYUKI Okada <okadams@nic.ad.jp>
User-Agent: Mozilla/5.0 (Windows NT 5.1; rv:16.0) Gecko/20121010 Thunderbird/16.0.1
MIME-Version: 1.0
To: Kentaro Mori <kentaro@jprs.co.jp>, weirds@ietf.org
References: <CCA6B194.DE13%andy@arin.net> <6C6109C6-0FA3-40A0-9562-A8F55F178003@apnic.net> <50875975.8090908@jprs.co.jp>
In-Reply-To: <50875975.8090908@jprs.co.jp>
X-Enigmail-Version: 1.4.5
Content-Type: text/plain; charset="ISO-8859-1"
Content-Transfer-Encoding: 7bit
Subject: Re: [weirds] Internationalization Issues
X-BeenThere: weirds@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "WHOIS-based Extensible Internet Registration Data Service \(WEIRDS\)" <weirds.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/weirds>, <mailto:weirds-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/weirds>
List-Post: <mailto:weirds@ietf.org>
List-Help: <mailto:weirds-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/weirds>, <mailto:weirds-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 24 Oct 2012 05:09:47 -0000

Byron-san,

I'm Masayuki Okada at JPNIC.

Our WHOIS system(whois.nic.ad.jp:43) will response English(ASCII)
 if '/e' is put into the character of the last of an input. 

By default, it answers in Japanese(ISO-2022-JP). 

About main registration records, JPNIC are collecting both Japanese 
and English as an mandantory.

just for reference:
  - How to use JPNIC WHOIS - 
   http://www.nic.ad.jp/en/db/whois/

--
Masayuki Okada
JPNIC

(2012/10/24 11:59), Kentaro Mori wrote:
> Byron-san and folks,
> 
> I'm Kentaro Mori from JPRS.
> (sorry for late reply)
> 
> For Whois, JPRS collects English data as well as native (Japanese) one
> at the time registrant applies to .JP domain name registration.
> Additionally, the English data doesn't cover all of Japanese data,
> e.g., it is partial.
> ISO-2022-JP as character-set was a normal choice when JPRS (more
> correctly, JPNIC at that time) started Whois service, though we may have
> alternative choice such as UTF-8 now.
> 
> --Kentaro
> 
> (2012/10/22 9:56), Byron Ellacott wrote:
>>
>> On 19/10/2012, at 9:33 PM, Andy Newton <andy@arin.net> wrote:
>>
>>> On 10/18/12 8:40 PM, "Byron Ellacott" <bje@apnic.net> wrote:
>>>
>>>> Indicate to the end user that it's not a native language?
>>>> Auto-translate?
>>
>> Murray has the right sense of what I meant, for both of these.
>>
>>>> Negotiate for native language with Accepts-Language, if indicated as
>>>> possible via a Vary header?
>>>
>>> That's HTTP layer stuff. We're talking about embedding multiple language
>>> tags in the response.
>>
>> Are we?  I thought draft-sheng-weirds-icann-rws-dnrd-01 sect. 7.3
>> suggested a single language tag for the entire response, with "possible
>> considerations" of multiple language tags.
>>
>> But, with this point, what I'm suggesting is that the user of a particular
>> client likely has one or a few preferred languages, which they could
>> potentially indicate to the server, in the event that the server has multiple
>> translations.  This would be applicable for mixed language responses as
>> well as single language responses, since it only indicates a client preference,
>> not a strict requirement.
>>
>> My primary perspective on this entire subject is that whatever mechanisms
>> or systems indicate language or language preference need to be optional,
>> and should support reasonable use cases for current or likely future operators.
>> I think there's a use case for language preference indication, as per below,
>> and I think Ning is suggesting a use case for tagging the language of an
>> entire response, inline in the response.  What are your (collective "your")
>> thoughts on how reasonable these use cases are?
>>
>>>> Some RDAP services will not support multiple languages meaningfully, but
>>>> there are existing whois services that provide (non-standard, varying)
>>>> ways to indicate a preferred language on query, with multiple language
>>>> options available for many response fields.
>>>
>>> Can you provide an example of one of these services so we can query it?
>>> That would go a long way in helping shape this need, I would think. Are
>>> there registries that collect contact data in multiple languages?
>>
>> $ whois -h whois.nic.ad.jp -- 'NET 113.32.19.157'
>> $ whois -h whois.nic.ad.jp -- 'NET 113.32.19.157 /e'
>>
>> $ whois -h whois.jprs.jp -- 'jprs.jp'
>> $ whois -h whois.jprs.jp -- 'jprs.jp /e'
>>
>> The data labels are sometimes translated, sometimes not.  In the native
>> language responses, there's often an English translation.  JPRS includes
>> an English help/info block even for the native language response.  I don't
>> know for sure if they collect the information in multiple languages, though
>> I think they do - any JPRS or JPNIC operators on the list to confirm?
>> Character set is ISO-2022-JP.
>>
>> I don't know if there are other services with such a switch mechanism,
>> either - we're all aware of how hard it is to find out what's actually done
>> out there on port 43 :-) - but for another comparison whois.kisa.kr returns
>> both native and english output, at least for "kisa.kr".  Character set is
>> EUC-KR.
>>
>>   Byron
>>
>>
>>
>> _______________________________________________
>> weirds mailing list
>> weirds@ietf.org
>> https://www.ietf.org/mailman/listinfo/weirds
>>
> 
> 
> 
> _______________________________________________
> weirds mailing list
> weirds@ietf.org
> https://www.ietf.org/mailman/listinfo/weirds
>