Re: [Asrg] Iteration #3.

Alessandro Vesely <vesely@tana.it> Sun, 07 February 2010 09:32 UTC

Return-Path: <vesely@tana.it>
X-Original-To: asrg@core3.amsl.com
Delivered-To: asrg@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 4B6053A7014 for <asrg@core3.amsl.com>; Sun, 7 Feb 2010 01:32:50 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.667
X-Spam-Level:
X-Spam-Status: No, score=-4.667 tagged_above=-999 required=5 tests=[AWL=0.052, BAYES_00=-2.599, HELO_EQ_IT=0.635, HOST_EQ_IT=1.245, RCVD_IN_DNSWL_MED=-4]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 4RhklI4Jl7XN for <asrg@core3.amsl.com>; Sun, 7 Feb 2010 01:32:49 -0800 (PST)
Received: from wmail.tana.it (wmail.tana.it [62.94.243.226]) by core3.amsl.com (Postfix) with ESMTP id 64BDB3A6FD3 for <asrg@irtf.org>; Sun, 7 Feb 2010 01:32:49 -0800 (PST)
Received: from mach-4.tana.it (mach-4.tana.it [194.243.254.189]) (AUTH: CRAM-MD5 515, TLS: TLS1.0,256bits,RSA_AES_256_CBC_SHA1) by wmail.tana.it with ESMTPSA; Sun, 07 Feb 2010 10:33:41 +0100 id 00000000005DC033.000000004B6E88F5.00001F09
Message-ID: <4B6E89DB.8040608@tana.it>
Date: Sun, 07 Feb 2010 10:37:31 +0100
From: Alessandro Vesely <vesely@tana.it>
User-Agent: Thunderbird 2.0.0.23 (Macintosh/20090812)
MIME-Version: 1.0
To: Anti-Spam Research Group - IRTF <asrg@irtf.org>
References: <4B6C6D35.1050101@nortel.com> <4B6D41E3.8000209@tana.it> <4B6DAD0C.3020109@nortel.com>
In-Reply-To: <4B6DAD0C.3020109@nortel.com>
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"
Content-Transfer-Encoding: 7bit
Subject: Re: [Asrg] Iteration #3.
X-BeenThere: asrg@irtf.org
X-Mailman-Version: 2.1.9
Precedence: list
Reply-To: Anti-Spam Research Group - IRTF <asrg@irtf.org>
List-Id: Anti-Spam Research Group - IRTF <asrg.irtf.org>
List-Unsubscribe: <http://www.irtf.org/mailman/listinfo/asrg>, <mailto:asrg-request@irtf.org?subject=unsubscribe>
List-Archive: <http://www.irtf.org/mail-archive/web/asrg>
List-Post: <mailto:asrg@irtf.org>
List-Help: <mailto:asrg-request@irtf.org?subject=help>
List-Subscribe: <http://www.irtf.org/mailman/listinfo/asrg>, <mailto:asrg-request@irtf.org?subject=subscribe>
X-List-Received-Date: Sun, 07 Feb 2010 09:32:50 -0000

Chris Lewis wrote:
>>> Astute readers will notice that (1) is a trivially simple MUA hack, and 
>>> that (2) isn't necessary for many installations wanting TiS info (for 
>>> filter tuning) and don't forward them anywhere.
>>
>> For filter training you also need "ham" type submissions,
>
> Only if you're doing server-based Bayes and the abuse handling mechanism 
> is separate from the inbound mail flow. You don't necessarily need 
> user-end ham submissions if you can get at the mailflow.

IME, FPs increase after spam training and the only remedy I've been 
able to find is to do ham training with messages manually retrieved 
from spam folders. For POP3 users, you have no way to know which 
messages they discard, either manually or automatically, and which 
they consider ham. How do you tune the filter using just the mail flow?