public inbox for gcc@gcc.gnu.org
 help / color / mirror / Atom feed
From: Jeff Law <law@redhat.com>
To: "Eric S. Raymond" <esr@thyrsus.com>,
	GCC Development <gcc@gcc.gnu.org>,
	fallenpegasus@gmail.com
Subject: Re: Repo conversion troubles.
Date: Mon, 09 Jul 2018 19:40:00 -0000	[thread overview]
Message-ID: <ed09c23f-de47-3fbc-886b-e4e843e0c0b8@redhat.com> (raw)
In-Reply-To: <20180709191911.648443A4AA7@snark.thyrsus.com>

On 07/09/2018 01:19 PM, Eric S. Raymond wrote:
> Last time I did a comparison between SVN head and the git conversion
> tip they matched exactly.  This time I have mismatches in the following
> files.
> 
> libtool.m4
> libvtv/ChangeLog
> libvtv/configure
> libvtv/testsuite/lib/libvtv.exp
> ltmain.sh
> lto-plugin/ChangeLog
> lto-plugin/configure
> lto-plugin/lto-plugin.c
> MAINTAINERS
> maintainer-scripts/ChangeLog
> maintainer-scripts/crontab
> maintainer-scripts/gcc_release
> Makefile.def
> Makefile.in
> Makefile.tpl
> zlib/configure
> zlib/configure.ac
> 
> Now I'll explain what this means and why it's a serious problem.
[ ... ]
That's weird -- let's take maintainer-scripts/crontab as our victim.
That file (according to the git mirror) has only changed on the trunk 3
times in the last year.  They're all changes from Jakub and none look
unusual at all.  Just trivial looking updates.

libvtv.exp is another interesting file.  It changed twice in early May
of this year.  Prior to that it hadn't changed since 2015.


[ ... ]

> 
> There are brute-force ways to pin down such malformations, but none of
> them are practical at the huge scale of this repository.  The main
> problem here wouldn't reposurgeon itself but the fact that Subversion
> checkouts on a repo this large are very slow. I've seen a single one
> take 12 hours; an attempt at a whole bisection run to pin down the
> divergence point on trunk would therefore probably cost log2 of the
> commit length times that, or about 18 days.
I'm not aware of any such merges, but any that occurred most likely
happened after mid-April when the trunk was re-opened for development.

I'm assuming that it's only work that merges onto the trunk that's
potentially problematical here.

> 
> So...does that list of changed files look familar to anyone?  If we can
> identify the revision number of the bad commit, the odds of being able
> to unscramble this mess go way up.  They still aren't good, not when
> merely loading the repository for examination takes over four hours,
> but they would way better than if I were starting from zero.
They're familiar only in the sense that I know what those files are :-)

Jeff

  reply	other threads:[~2018-07-09 19:40 UTC|newest]

Thread overview: 20+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2018-07-09 19:19 Eric S. Raymond
2018-07-09 19:40 ` Jeff Law [this message]
2018-07-09 19:57   ` Eric S. Raymond
2018-07-09 20:01     ` Jeff Law
2018-07-09 20:06       ` Eric S. Raymond
2018-07-09 19:46 ` Bernd Schmidt
2018-07-09 19:59   ` Eric S. Raymond
2018-07-10  1:13     ` Alexandre Oliva
2018-07-20 21:48       ` Joseph Myers
2018-07-21  2:04         ` Eric S. Raymond
2018-07-10  8:20     ` Jonathan Wakely
2018-07-10  8:34       ` Jonathan Wakely
2018-07-10 10:48         ` Eric S. Raymond
2018-07-20 22:06       ` Joseph Myers
2018-07-09 20:04 ` Richard Biener
2018-07-09 20:20   ` Eric S. Raymond
2018-07-10  4:57     ` Richard Biener
2018-07-10 11:22     ` Philip Martin
2018-07-20 21:43     ` Joseph Myers
2018-07-20 23:48       ` Eric S. Raymond

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ed09c23f-de47-3fbc-886b-e4e843e0c0b8@redhat.com \
    --to=law@redhat.com \
    --cc=esr@thyrsus.com \
    --cc=fallenpegasus@gmail.com \
    --cc=gcc@gcc.gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).