public inbox for overseers@sourceware.org
 help / color / mirror / Atom feed
From: Mark Wielaard <mark@klomp.org>
To: Overseers mailing list <overseers@sourceware.org>
Cc: Simon Marchi <simon.marchi@polymtl.ca>
Subject: Re: inbox.sourceware.org experiment
Date: Wed, 24 Aug 2022 23:06:06 +0200	[thread overview]
Message-ID: <YwaSvnrccPud3IHQ@wildebeest.org> (raw)
In-Reply-To: <YwX3zyp8d6GsHlG+@wildebeest.org>

Hi,

On Wed, Aug 24, 2022 at 12:05:03PM +0200, Mark Wielaard via Overseers wrote:
> I noticed two issues some lists seem to have a bad/corrupt xapian
> database and generate an error while indexing (gcc-patches).

I tried reindexing and compacting the largest lists. This did not
help. But the compacting did reduce the disk size of the xapian
indexes by 10GB (!).

There is now a bit more logging in
/home/inbox/logs/public-inbox-mda.out.log

It looks like this error:

rollback ineffective with AutoCommit enabled at
/usr/share/perl5/vendor_perl/PublicInbox/V2Writable.pm line 621.
checkpoint: Exception: Error writing block 147232
shard close: Exception: Error writing block 147236

Only happens after importing a new gcc-patches message. The message
isn't fully indexed, but can be referenced normally. It won't show up
in full text searches though. I haven't figured out why. I'll ask
upstream how the better debug this.

> emails with slashes / in the Message-ID sometimes get wrongly
> escaped and appear to not be in the archive while they really are.
> e.g. the message I am replying to shows as:
> https://inbox.sourceware.org/overseers/YwVP8+LHvyLzUG%2F+@wildebeest.org/
> But should be:
> https://inbox.sourceware.org/overseers/YwVP8+LHvyLzUG/+@wildebeest.org/

This isn't a big deal except when the / is at the end of the
Message-ID. Which unfortunately happens for bugzilla emails which end
in @http.sourceware.org/bugzilla/ that last slash seems to be a real
problem. Don't know a workaround for that yet.

You see public-inbox does know about the Message-ID by searching for:
https://inbox.sourceware.org/libabigail/bug-29464-9487@http.sourceware.org/bugzilla//
Which will suggest that actual URL as "partial match" but then when
following that link the slashes get escaped again... Will ask upstream
if there is any solution for this.

Finally there are some lists that accept HTML emails (by stripping off
the HTML part). public-inbox however simply rejects those emails.

*** We only accept plain-text mail, No HTML ***

Again, we should ask upstream if there could be an option to accept
just the plain/text part of such emails.

Note that such emails do end up in the .public-inbox/emergency mailbox
so in theory we could remove the text/html part and then reinsert the
message.

So there are some issues, but in general I think it works just fine
now.

Cheers,

Mark

      reply	other threads:[~2022-08-24 21:06 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-08-13 14:14 Mark Wielaard
2022-08-15 13:00 ` Mark Wielaard
2022-08-16 21:36 ` Mark Wielaard
2022-08-16 22:10   ` Frank Ch. Eigler
2022-08-17 12:25     ` Mark Wielaard
2022-08-17 13:24       ` Frank Ch. Eigler
2022-08-17 21:18         ` Mark Wielaard
2022-08-17 21:33           ` Frank Ch. Eigler
2022-08-18 13:50             ` Mark Wielaard
2022-08-18 14:40               ` Simon Marchi
2022-08-21 17:41                 ` Mark Wielaard
2022-08-23 20:15                   ` Mark Wielaard
2022-08-23 22:08               ` Mark Wielaard
2022-08-24 10:05                 ` Mark Wielaard
2022-08-24 21:06                   ` Mark Wielaard [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=YwaSvnrccPud3IHQ@wildebeest.org \
    --to=mark@klomp.org \
    --cc=overseers@sourceware.org \
    --cc=simon.marchi@polymtl.ca \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).