[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Duplicate In-Reply-To entries in reply buffer

To: "Bashford, Donald" <Don.Bashford@stjude.org>
Subject: Re: Duplicate In-Reply-To entries in reply buffer
From: Kazuhiro Ito <kzhr@d1.dion.ne.jp>
Date: Tue, 24 Jul 2012 23:26:18 +0900
Cc: wl-en@ml.gentei.org
In-reply-to: <rfwy5mba3je.wl%Don.Bashford@stjude.org>
List-help: <mailto:wl-en-ctl@ml.gentei.org?body=help>
List-id: wl-en.ml.gentei.org
List-owner: <mailto:wl-en-admin@ml.gentei.org>
List-post: <mailto:wl-en@ml.gentei.org>
List-software: fml [fml 4.0 STABLE (20040215/4.0.4_BETA)]
List-unsubscribe: <mailto:wl-en-ctl@ml.gentei.org?body=unsubscribe>
References: <rfwhatmhqxt.wl%Don.Bashford@stjude.org> <20120706133200.058FC2C803A@msa105.auone-net.jp> <87bojhng9t.wl%dmaus@ictsoc.de> <20120716011649.786232C803A@msa105.auone-net.jp> <20120716080148.0FD6D34803A@msa103.auone-net.jp> <rfwliifuzbi.wl%Don.Bashford@stjude.org> <20120721081531.333A634803A@msa103.auone-net.jp> <87d33ob7bk.wl%dmaus@ictsoc.de> <rfwy5mba3je.wl%Don.Bashford@stjude.org>
Reply-to: wl-en@ml.gentei.org
User-agent: Wanderlust/2.15.9 (Almost Unreal) SEMI-EPG/1.14.7 (Harue) FLIM/1.14.9 (Gojō) APEL/10.8 EasyPG/1.0.0 Emacs/24.1.50 (i386-mingw-nt6.1.7601) MULE/6.0 (HANACHIRUSATO)

> 1) Aren't message-id header created by mail transfer agents rather
> than mail user agents? Doesn't this make crazy headers less likely?

Accodring to RFC2822, MUA should create Message-ID, (although I don't
so).  There may be the case that inappropriate settings result in
inappropriate Message-ID header.  But, the problem whould not be the
format of the header but such as uniqueness.  I don't think MUA or MTA
is more problematic than other at least about the format of the
header.

> 2) The "strict" regex we've been discussing shouldn't have that
> dollar sign near the end. It conflicts with the \' in the tests I've
> done.

Of cource, $ should be removed.  Thanks for pointing out.

> 3) Looking again at RFC 5322, I'm dismayed to see that comments (set
> off by parentheses) seem to be allowed in the message-id header field
> both before and after the actual message ID.

(snip)

> 4) the std11 module of flim seems to provide some machinery for
> handling this.

I know FLIM has lexical analyzers, but I didn't know about a comment
on Message-ID: header.  For example, the below code could extract
Message-ID more strictly.

(let ((string "<zzz@example.com>"))
  (let* ((tokens (std11-parse-msg-ids-string string))
	 (id (assq 'msg-id tokens)))
    (setq id
	  (unless (assq 'msg-id (delq id tokens))
	    (std11-addr-to-string (cdr id))))
    ;; Return nil when result is "".
    (when (> (length id) 0) id)))

But FLIM's lexical analyzer is really strict.  If string is invalid
Message-ID, e.g. "<zzz.@example.com>", nil is returned.  I think we
does not have to support invalid Message-ID, but more tolerant would
be better. Therefore, if we use FLIM's lexical analyzer, combination
with other extracting method would be better.

(let ((string "<zzz.@example.com>"))
  (or
   (let* ((tokens (std11-parse-msg-ids-string string))
	  (id (assq 'msg-id tokens)))
     (setq id
	   (unless (assq 'msg-id (delq id tokens))
	     (std11-addr-to-string (cdr id))))
     ;; Return nil when result is "".
     (when (> (length id) 0) id))
   (and (string-match "\\`[ \n\t]*\\(<.+>\\)[ \n\t]*\\'" string)
	(match-string 1 string))))

As you decribed, it is more costly method than current.  I will post
another message to ML about performance issue.

-- 
Kazuhiro Ito

References:
- Duplicate In-Reply-To entries in reply buffer
  - From: "Bashford, Donald" <Don.Bashford@stjude.org>
- Re: Duplicate In-Reply-To entries in reply buffer
  - From: Kazuhiro Ito <kzhr@d1.dion.ne.jp>
- Re: Duplicate In-Reply-To entries in reply buffer
  - From: David Maus <dmaus@ictsoc.de>
- Re: Duplicate In-Reply-To entries in reply buffer
  - From: Kazuhiro Ito <kzhr@d1.dion.ne.jp>
- Re: Duplicate In-Reply-To entries in reply buffer
  - From: Kazuhiro Ito <kzhr@d1.dion.ne.jp>
- Re: Duplicate In-Reply-To entries in reply buffer
  - From: "Bashford, Donald" <Don.Bashford@stjude.org>
- Re: Duplicate In-Reply-To entries in reply buffer
  - From: Kazuhiro Ito <kzhr@d1.dion.ne.jp>
- Re: Duplicate In-Reply-To entries in reply buffer
  - From: David Maus <dmaus@ictsoc.de>
- Re: Duplicate In-Reply-To entries in reply buffer
  - From: "Bashford, Donald" <Don.Bashford@stjude.org>

Prev by Date: Re: Duplicate In-Reply-To entries in reply buffer
Next by Date: elmo-msgdb-get-message-id-from-buffer's performance issue
Previous by thread: Re: Duplicate In-Reply-To entries in reply buffer
Next by thread: Re: Duplicate In-Reply-To entries in reply buffer
Index(es):
- Date
- Thread