From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 25186 invoked by alias); 5 Aug 2007 18:45:44 -0000 Received: (qmail 25018 invoked by uid 22791); 5 Aug 2007 18:45:43 -0000 X-Spam-Check-By: sourceware.org Received: from pool-71-248-179-24.bstnma.fios.verizon.net (HELO ednor.cgf.cx) (71.248.179.24) by sourceware.org (qpsmtpd/0.31) with ESMTP; Sun, 05 Aug 2007 18:45:41 +0000 Received: by ednor.cgf.cx (Postfix, from userid 201) id 938082B352; Sun, 5 Aug 2007 14:46:04 -0400 (EDT) Date: Sun, 05 Aug 2007 18:45:00 -0000 From: Christopher Faylor To: overseers@sourceware.org Subject: Re: questions about blocking disclaimers Message-ID: <20070805184604.GA14471@ednor.casa.cgf.cx> Mail-Followup-To: overseers@sourceware.org References: <20070805053429.GA12910@ednor.casa.cgf.cx> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20070805053429.GA12910@ednor.casa.cgf.cx> User-Agent: Mutt/1.5.15 (2007-04-06) Mailing-List: contact overseers-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: , Sender: overseers-owner@sourceware.org X-SW-Source: 2007-q3/txt/msg00049.txt.bz2 On Sun, Aug 05, 2007 at 01:34:29AM -0400, Christopher Faylor wrote: >As many people know, I'm working on modifying the spam filter software >so that it will block email which contains unenforceable company >disclaimers. I've been surveying the archives for samples of >disclaimers to seed the spam blocking regex. I'd like to be able to >include the match that triggered the disclaimer bounce so that people >can see what is causing the problem but that may increase the size >of the bounce. Is that ok, though? > >Here's what I have for the bounce message right now: > > Sorry. Something in the body of your message triggered a pattern match > which indicates that you included a company disclaimer or privacy notice > in your message to a mailing list at this site. Inclusion of these types > of disclaimers is against site policy. We do not accept email of this form. > > See: > http://sourceware.org/lists.html > http://gcc.gnu.org/lists.html#policies > >Please tell me what works or doesn't work in the above language. Here are the regexes that I have now (these are perl regexes): \n\s+This email [a-z\s]+is for the exclusive use (?i:if (?i:the reader is|you are) not the intended recipient) \n\s+This e-?mail (?:is|may be) (?:contain\s+)?(?:privileged|confidential) read[a-z\s,]{20,}by the intended recipient CONFIDENTIALITY AND SECURITY NOTICE (?i:if you received this email in error) And here's another stab at the bounce message: Sorry. Something in the body of your message triggered a match indicating that your email contained a company disclaimer or privacy notice. Inclusion of these types of disclaimers in mail to mailing lists is against site policy. For more information see: http://sourceware.org/lists.html http://gcc.gnu.org/lists.html#policies Matched: "" I'm not going to go live with this until I get some feedback on the above so please let me know what works or doesn't work. cgf