Re: [MORG] Review of draft-ietf-morg-fuzzy-search-02.txt

Timo Sirainen <tss@iki.fi> Wed, 25 August 2010 17:51 UTC

Return-Path: <tss@iki.fi>
X-Original-To: morg@core3.amsl.com
Delivered-To: morg@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 244B63A687E for <morg@core3.amsl.com>; Wed, 25 Aug 2010 10:51:08 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -105.866
X-Spam-Level:
X-Spam-Status: No, score=-105.866 tagged_above=-999 required=5 tests=[AWL=0.733, BAYES_00=-2.599, RCVD_IN_DNSWL_MED=-4, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id AqsPnVMC+dYU for <morg@core3.amsl.com>; Wed, 25 Aug 2010 10:51:05 -0700 (PDT)
Received: from dovecot.org (dovecot.org [62.236.108.70]) by core3.amsl.com (Postfix) with ESMTP id C9F2C3A686B for <morg@ietf.org>; Wed, 25 Aug 2010 10:51:04 -0700 (PDT)
Received: from [10.134.132.86] (unknown [194.65.5.235]) by dovecot.org (Postfix) with ESMTP id 0B9A4FA88EF; Wed, 25 Aug 2010 20:51:35 +0300 (EEST)
From: Timo Sirainen <tss@iki.fi>
To: Alexey Melnikov <alexey.melnikov@isode.com>
In-Reply-To: <4C739609.5020101@isode.com>
References: <4C5021F0.5020002@isode.com> <AANLkTik3ayOVth5v5gVowi8ybtj=k99n=evgt7YZQYzw@mail.gmail.com> <4C5BDDD3.10405@isode.com> <1282328966.6489.20.camel@kurkku.sapo.corppt.com> <AANLkTi=L+xekVz67V4v84-7Gf-x4MGjAvXmAJFnxPw_k@mail.gmail.com> <4C739609.5020101@isode.com>
Content-Type: text/plain; charset="UTF-8"
Date: Wed, 25 Aug 2010 18:51:34 +0100
Message-ID: <1282758694.6489.372.camel@kurkku.sapo.corppt.com>
Mime-Version: 1.0
X-Mailer: Evolution 2.28.3
Content-Transfer-Encoding: 7bit
Cc: morg@ietf.org, barryleiba@computer.org
Subject: Re: [MORG] Review of draft-ietf-morg-fuzzy-search-02.txt
X-BeenThere: morg@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Messaging Organization <morg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/morg>, <mailto:morg-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/morg>
List-Post: <mailto:morg@ietf.org>
List-Help: <mailto:morg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/morg>, <mailto:morg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 25 Aug 2010 17:51:08 -0000

On Tue, 2010-08-24 at 10:51 +0100, Alexey Melnikov wrote:
> >>Invalid input may also be problematic. For example if the search engine
> >>takes UTF-8 stream as input, it might fail more or less badly when
> >>illegal UTF-8 sequences are fed to it from a message whose character set
> >>was claimed to be UTF-8. This could be avoided by validating all the
> >>input and replacing illegal UTF-8 sequences with the Unicode replacement
> >>character (U+FFFD).
> >>
> I am too. I am not convinced that mentioning Unicode replacement 
> character (U+FFFD) is necessary, but I suppose it is Ok.

So just "This could be avoided by validating all the input." would be
better? Or just remove the whole sentence.