[PD] Mailing list archive search: Remove attachment.htm results?

Peter P. peterparker at fastmail.com
Tue Apr 23 12:16:17 CEST 2024


* IOhannes m zmoelnig <zmoelnig at iem.at> [2024-04-23 12:06]:
> On 4/23/24 08:56, Peter P. wrote:
> > Hi,
> > 
> > The search function on https://lists.puredata.info/pipermail/pd-list//
> > is great, but a large number of search results are of the form
> > "/pipermail/pd-list/attachments/20160527/3e480100/attachment.html"
> > and are more or less unreadable.
> > 
> > I would like to suggest to exclude them from the indexing or from the
> > results if possible.
> 
> 
> what you are seeing is emails that come with both HTML and plaintext.
> 
> typically, a mail-client that composes HTML mail, will include the same
> information in a plaintext part of the email. but there's not really
> anything enforcing this (if you are usually reading plaintext emails, you
> will know this "you need an HTML-capable mail client to read this" message)
Thanks, indeed known to me.
 
> as such, i do not think there's anything wrong with including the HTML parts
> (even if they are rendered verbatim) in the search results, as they might
> contain information otherwise unavailable.
The main issue is, that I don't get the email's subject for the HTML
emails, do you?

> in any case, i'm afraid this is rather low priority.
I don't know. It renders about 50% of the results useless, at least to
me, and even more so to less experienced users.
> in the meantime you could try filtering out the HTML attachments by
> excluding some html tag (e.g. adding ` -div ` to the search terms helps for
> me; however, this will also exclude mails that discuss the use of [div]...)
Thanks, will try this bit of inside knowledge!

best,
Peter





More information about the Pd-list mailing list