public inbox for gcc@gcc.gnu.org
 help / color / mirror / Atom feed
* Git conversion: fixing email addresses from ChangeLog files
@ 2019-12-27 19:47 Richard Earnshaw (lists)
  2019-12-28 11:37 ` Segher Boessenkool
                   ` (2 more replies)
  0 siblings, 3 replies; 26+ messages in thread
From: Richard Earnshaw (lists) @ 2019-12-27 19:47 UTC (permalink / raw)
  To: gcc

[-- Attachment #1: Type: text/plain, Size: 1038 bytes --]

Email addresses from the ChangeLog files are not validated during
commits, so a number of typos exist in the extracted data.  I've
extracted the 'Author:' entry from a prototype conversion and then piped
that through sort and uniq -c.  Subsequent analysis shows the following
addresses/names that are likely in need of some resolution.

Several of the names differ only in case and a case-insensitive sort
would ignore these.  But some clearly show typing errors.

I've marked with a '*' those entries that I believe should be canonical,
but if you think I've made a mistake somewhere, please let me know.
It's easier to fix this before I write a script to do the changes than
afterwards.

There are many entries that look like unix machine names rather than
genuine email addresses for a domain.  I have *not* attempted to
canonicalize those, but if some folk would like that doing, please
supply entries in the form

'name to match <email@to-match>': 'name to use <email@to-use>',

form and I'll paste those into the conversion script.

[-- Attachment #2: x --]
[-- Type: text/plain, Size: 35880 bytes --]

*      3 Author: Aaron W. LaFramboise <awlaframboise@aol.com>
      1 Author: Aaron W LaFramboise <AWLaFramboise@aol.com>

*      1 Author: Adam Nemet <adambnemet@gmail.com>
      2 Author: Adam Nemet <adambnmet@gmail.com>

      1 Author: Alan Modra <alan@spri.levels.unisa.edu.au>
*      3 Author: Alan Modra <alan@SPRI.Levels.UniSA.Edu.Au>

      1 Author: Aldy Hernandez <aldyh@quesejoda.com>
*      5 Author: Aldy Hernandez <aldy@quesejoda.com>

      1 Author: Alexander Kabaev <kan@freebsd.org>
*      5 Author: Alexander Kabaev <kan@FreeBSD.ORG>

      1 Author: Alexandre Oilva <aoliva@redhat.com>
*   1618 Author: Alexandre Oliva <aoliva@redhat.com>

*    191 Author: Alexandre Petit-Bianco <apbianco@cygnus.com>
      1 Author: Alexandre Petit-Bianco <apbianco@dcygnus.com>

      1 Author: Alex Velenko <alex.velenko@arm.com>
*     37 Author: Alex Velenko <Alex.Velenko@arm.com>

*    228 Author: Andreas Krebbel <krebbel1@de.ibm.com>
      1 Author: Andreas Krebbel <krebbel@de.ibm.com>

      1 Author: Andre Vieira <andre.simoesdiasvieira@arm>
*    110 Author: Andre Vieira <andre.simoesdiasvieira@arm.com>
      1 Author: Andre Vieira <Andre.SimoesDiasVieira@arm.com>

      2 Author: Andrew Haley <aph@ALL6500>
*      1 Author: Andrew Haley <aph@all6500.kos.to>

*      1 Author: Andrew Haley <aph@littlepinkcloud.com>
      1 Author: Andrew Haley <aph.redhat.com@littlepinkcloud.com>

      1 Author: Andrew Macleod <amacleod@redat.com>
     20 Author: Andrew Macleod <amacleod@redhat.com>
      2 Author: Andrew MacLeod <amacleod@redhat,com>
*    309 Author: Andrew MacLeod <amacleod@redhat.com>
      1 Author: Andrew MacLeod <amacloeod@redhat.com>

      1 Author: Andrew Pinsi <apinski@cavium.com>
*    150 Author: Andrew Pinski <apinski@cavium.com>
      1 Author: Andrew PInski <apinski@cavium.com>

      1 Author: Andrew Pinsiki <pinskia@gmail.com>
*     87 Author: Andrew Pinski <pinskia@gmail.com>
      1 Author: Andrew Pinski <pinski@gmail.com>

*      2 Author: Andrew Pinski <andrew_pinski@caviumnetworks.com>
      1 Author: Andrew Pinski <andrew.pinski@caviumnetworks.com>

      1 Author: Andrew Pinski <andrew_pinskia@playstation.sony.com>
*    218 Author: Andrew Pinski <andrew_pinski@playstation.sony.com>
      2 Author: Andrew Pinski <Andrew_Pinski@playstation.sony.com>

      1 Author: Andrew Pinski <pinskia@ohysics.uc.edu>
*    700 Author: Andrew Pinski <pinskia@physics.uc.edu>
      2 Author: Andrew PInski <pinskia@physics.uc.edu>

      2 Author: Andrey Belevanstev <abel@ispras.ru>
*    191 Author: Andrey Belevantsev <abel@ispras.ru>

      1 Author: Andy Hutchinson <hutchinsonandy@netscape.net>
*      5 Author: Andy Hutchinson <HutchinsonAndy@netscape.net>

*     51 Author: Arnaud Charlet <charlet@act-europe.fr>
      1 Author: Arnaud Charlet <charlet@ACT-Europe.FR>

      3 Author: Artjoms Sinkarovs <artyom.shinakroff@gmail.com>
*      5 Author: Artjoms Sinkarovs <artyom.shinkaroff@gmail.com>

      1 Author: Balaji V Iyer <balaji.v.iyer@intel.com>
*    239 Author: Balaji V. Iyer <balaji.v.iyer@intel.com>

      1 Author: Ben Elliston <bje@au1.ibm.com>
      1 Author: Ben Elliston <bje@au.ibm.co>
*    670 Author: Ben Elliston <bje@au.ibm.com>

      1 Author: Benjamin Kosnik <bkoz@purist.redhat.soma.com>
*     59 Author: Benjamin Kosnik <bkoz@purist.soma.redhat.com>

      1 Author: Benjamin Kosnik <bkoz@redat.com>
*   1275 Author: Benjamin Kosnik <bkoz@redhat.com>
      2 Author: Benjamin Kosnik <bkoz@rehat.com>
      1 Author: Benjamin Kosnik <bkoz@kredhat.com>

      1 Author: bernardo innocenti <bernie@develer.com>
*     80 Author: Bernardo Innocenti <bernie@develer.com>

*     39 Author: Bernd Schmidt <crux@pool.informatik.rwth-aachen.de>
     16 Author: Bernd Schmidt <crux@Pool.Informatik.RWTH-Aachen.DE>

      1 Author: Bernhard Reuther-Fischer <aldot@gcc.gnu.org>
*     55 Author: Bernhard Reutner-Fischer <aldot@gcc.gnu.org>

      1 Author: "Billinghurst, David (CRTS)" <David.Billinghurst@riotinto.com>
*      1 Author: Billinghurst, David <David.Billinghurst@riotinto.com>

*     83 Author: Bill Schmidt <wschmidt@linux.ibm.com>
      2 Author: Bill Schmidt <wshmidt@linux.ibm.com>

*    542 Author: Bill Schmidt <wschmidt@linux.vnet.ibm.com>
      1 Author: Bill Schmidt <wschmidt@Linux.vnet.ibm.com>

      2 Author: bin cheng <bin.cheng@arm.com>
*    354 Author: Bin Cheng <bin.cheng@arm.com>
      2 Author: chengbin <bin.cheng@arm.com>

*     24 Author: Bin Cheng <bin.cheng@linux.alibaba.com>
      4 Author: Bin Cheng <bin.linux@linux.alibaba.com>

*    220 Author: Bob Wilson <bob.wilson@acm.org>
      1 Author: Bob Wilson <bob.wilson@asm.org>

      1 Author: Bradley Lucier <lucier@math.purdue.edu>
*     21 Author: Brad Lucier <lucier@math.purdue.edu>

      1 Author: Brooks Moses <brooks.mose@codesourcery.com>
*    156 Author: Brooks Moses <brooks.moses@codesourcery.com>

      1 Author: Bruce Korb <bkorb@gcc.org>
*    124 Author: Bruce Korb <bkorb@gnu.org>

*     18 Author: Bruce Korb <ddsinc09@ix.netcom.com>
      1 Author: BruceKorb <ddsinc09@ix.netcom.com>

      2 Author: Bruce Korb <korbb@datadesign.com>
*      4 Author: Bruce Korb <korb@datadesign.com>

      1 Author: Bryce McKinlay <bryce@abatross.co.nz>
*    205 Author: Bryce McKinlay <bryce@albatross.co.nz>

      1 Author: Carlos O'Donell <carlos@codesoucery.com>
*     73 Author: Carlos O'Donell <carlos@codesourcery.com>

*    121 Author: Chao-ying Fu <fu@mips.com>
      1 Author: Chao-Ying Fu <fu@mips.com>
      1 Author: Chao-ying Fur <fu@mips.com>

      1 Author: Charles Bayis <charles.baylis@linaro.org>
*     43 Author: Charles Baylis <charles.baylis@linaro.org>

      3 Author: Chen liqin <liqin@sunnorth.com.cn>
*     11 Author: Chen Liqin <liqin@sunnorth.com.cn>

*     96 Author: Christian Bruel <christian.bruel@st.com>
      1 Author: Christian Bruel <Christian.Bruel@st.com>

*    191 Author: Claudiu Zissulescu <claziss@synopsys.com>
      1 Author: Claudiu Zissulescu <claziss@sysnopsys.com>

      1 Author: Dale Johanensen <dalej@apple.com>
      2 Author: Dale Johannese <dalej@apple.com>
      2 Author: Dale Johannesen <dale@apple.com>
      1 Author: dale Johannesen <dalej@apple.com>
      1 Author: Dale Johannesen <dalej.@apple.com>
*    353 Author: Dale Johannesen <dalej@apple.com>

      1 Author: Damian Rouson <damain@sourceryinstitue.org>
*      2 Author: Damian Rouson <damian@sourceryinstitute.org>

      7 Author: Daniel Berlin <dan@dberlin.org>
*    493 Author: Daniel Berlin <dberlin@dberlin.org>
      2 Author: Daniel Berlin <dberlin@dbrelin.org>
      5 Author: Danny Berlin <dberlin@dberlin.org>

*      6 Author: Daniel Kruegler <daniel.kruegler@gmail.com>
      3 Author: Daniel Krugler <daniel.kruegler@gmail.com>

      1 Author: Danny Smith <dannysmith@sourceforge.users.net>
      1 Author: Danny Smith <dannysmith@users.net>
      1 Author: Danny Smith <dannysmith@user.sourceforge.net>
      1 Author: Danny Smith <dannysmith@users.souceforge.net>
      1 Author: Danny Smith <dannysmith@users.sourceforeg.net>
      1 Author: Danny Smith <dannysmith@users,sourceforge.net>
      3 Author: Danny Smith <dannysmith@users.source.forge.net>
*    193 Author: Danny Smith <dannysmith@users.sourceforge.net>
      1 Author: Danny Smith <dannysmith@users.sourcforge.net>
      1 Author: Danny Smith <dannysmith@users.sourforge.net>
      1 Author: Danny Smith <dannysmith@usrs.sourceforge.net>
      1 Author: Danny Smith <dannysmit@users.sourceforge.net>
      2 Author: Danny Smith <danyssmith@users.sourceforge.net>

*    114 Author: David Billinghurst <David.Billinghurst@riotinto.com>
     16 Author: David.Billinghurst <David.Billinghurst@riotinto.com>
      1 Author: David Billinghurst <Davod.Billinghurst@riotinto.com>

      3 Author: David Daney <daney@avtrex.com>
*    145 Author: David Daney <ddaney@avtrex.com>

      1 Author: David Edelshn <dje.gcc@gmail.com>
*    326 Author: David Edelsohn <dje.gcc@gmail.com>
      1 Author: David Edelson <dje.gcc@gmail.com>

      1 Author: David Edelsohn <edelsohn@gnu.og>
      1 Author: David Edelsohn <edelsohn@gnu.ogr>
*    735 Author: David Edelsohn <edelsohn@gnu.org>
      1 Author: David Edelsohn <edesohn@gnu.org>

*     62 Author: David Jee <djee@redhat.com>
      1 Author: David Jee  <djee@redhat.com>

      2 Author: "David O'Brien" <obrien@NUXI.com>
*      1 Author: David O'Brien <obrien@NUXI.com>

*     19 Author: David Ung <davidu@mips.com>
      1 Author: David Ung <davidung@mips.com>

      1 Author: Devang patel <dpatel@apple.com>
*    317 Author: Devang Patel <dpatel@apple.com>
      1 Author: Devang Patel <dpatel@appple.com>

      2 Author: Dhananjay Deshpande <dhananjayd@kpit.com>
      1 Author: "Dhananjay R. Deshpande" <dhananjayd@kpit.com>
*      2 Author: Dhananjay R. Deshpande <dhananjayd@kpit.com>

*    131 Author: Dominique d'Humieres <dominiq@lps.ens.fr>
     11 Author: Dominique Dhumieres <dominiq@lps.ens.fr>
      1 Author: Dominique d'Humières <dominiq@lps.ens.fr>
      1 Author: Dominique d'Humières <dominiq@lps.ens.fr>

      1 Author: Dorit Naishlos <dorit@il.ib.com>
*     92 Author: Dorit Naishlos <dorit@il.ibm.com>
      3 Author: Dorit Naishlos <DORIT@il.ibm.com>

      1 Author: Dorit Nuzman <doirit@il.ibm.com>
*    135 Author: Dorit Nuzman <dorit@il.ibm.com>

*      7 Author: Doug Gregor <dgregor@apple.com>
      1 Author: Doug Gregor <dgregor@apple.com.>

      1 Author: Douglas B. <rupp@gnat.com>
*     84 Author: Douglas B Rupp <rupp@gnat.com>
     35 Author: Douglas B. Rupp <rupp@gnat.com>
      5 Author: Douglas Rupp <rupp@gnat.com>
      2 Author: Douglass B. Rupp <rupp@gnat.com>
      1 Author: Doug Rupp <rupp@gnat.com>

      3 Author: Douglas B Rupp <rupp@adacore.com>
      2 Author: Douglas Rupp <rupp@adacore.com>
*     33 Author: Doug Rupp <rupp@adacore.com>

      5 Author: Ed Schonberg <schonberg@gnat.com>
*     33 Author: Ed Schonberg <schonber@gnat.com>

      3 Author: Eric Botcazou <botcazou@adacore.com>
*   3176 Author: Eric Botcazou <ebotcazou@adacore.com>


      1 Author:  Eric Botcazou <ebotcazou@libertysurf.fr>
*    725 Author: Eric Botcazou <ebotcazou@libertysurf.fr>

      1 Author: Eric Christopher <ecechristo@redhat.com>
*    112 Author: Eric Christopher <echristo@redhat.com>

      1 Author: Erik Edelman <eedelman@acclab.helsinki.fi>
*      2 Author: Erik Edelmann <eedelman@acclab.helsinki.fi>

      2 Author: Fabien Chene <fabien.chene@gmail.com>
*      5 Author: Fabien Chêne <fabien.chene@gmail.com>

*     27 Author: Fabien Chêne <fabien@gcc.gnu.org>
      1 Author: Fabien Chêne <fabien@gcc.gnu.org>

*    359 Author: Francois-Xavier Coudert <fxcoudert@gcc.gnu.org>
      1 Author: François-Xavier Coudert <fxcoudert@gcc.gnu.org>
      1 Author: François-Xavier Coudett <fxcoudert@gcc.gnu.org>

      7 Author: Franz Sirl <franz.sirl-kernel@lauterbach.com>
*    192 Author: Franz Sirl <Franz.Sirl-kernel@lauterbach.com>

      1 Author: Gabriel Dos Reis <gdr@codesourcery>
*    138 Author: Gabriel Dos Reis <gdr@codesourcery.com>
      1 Author: Gabriel Dos Reis <gdr@codesourcery.coom>

      1 Author: Gabriel Dos Reis <gdr@integrable-solution.net>
*    308 Author: Gabriel Dos Reis <gdr@integrable-solutions.net>
      1 Author: Gabriel Dos Reis <gdr@integrablesolutions.net>
      3 Author: Gabriel Dos Reis <gcc@integrable-solutions.net>
      1 Author: Garbiel Dos Reis <gcc@integrable-solutions.net>

      5 Author: Gabriel Dos Reis <gdr@cse.tamu.edu>
*     10 Author: Gabriel Dos Reis <gdr@cs.tamu.edu>

*     23 Author: Gavin Koch <gavin@cygnus.com>
      2 Author: Gavin Kock <gavin@cygnus.com>

      2 Author: Geert bosch <bosch@gnat.com>
*     43 Author: Geert Bosch <bosch@gnat.com>
      1 Author: Geert Bosh <bosch@gnat.com>

      5 Author: Geoff Keating <geoffk@apple.com>
*    598 Author: Geoffrey Keating <geoffk@apple.com>

*    107 Author: Geoff Keating <geoffk@cygnus.com>
     50 Author: Geoffrey Keating <geoffk@cygnus.com>

      5 Author: Geoff Keating <geoffk@geoffk.org>
*      9 Author: Geoffrey Keating <geoffk@geoffk.org>

     20 Author: Geoff Keating <geoffk@redhat.com>
      2 Author: Geoff Keating <geoff@redhat.com>
*    205 Author: Geoffrey Keating <geoffk@redhat.com>

*      5 Author: Geoff Keating <geoffk@ozemail.com.au>
      1 Author: Geoffrey Keating <geoffk@ozemail.com.au>
      2 Author: Geoffrey KEATING <geoffk@ozemail.com.au>

*      2 Author: Graham Stott <grahams@rcp.co.uk>
      2 Author: Graham Stott <GrahamS@RCP.co.uk>

*     22 Author: Graham Stott <graham.stott@btinternet.com>
      1 Author: Graham Stott <graham.stott@btinternet.con>

*     12 Author: Hale Wang <hale.wang@arm.com>
      2 Author: Hale Wang <Hale.Wang@arm.com>

      6 Author: Hans Boehm <hans_boehm@hp.com>
*     26 Author: Hans Boehm <Hans_Boehm@hp.com>
     20 Author: Hans Boehm <Hans.Boehm@hp.com>
      1 Author: Hans Bohem <hans_boehm@hp.com>
      1 Author: Hans-J. Boehm <Hans_Boehm@hp.com>

*     64 Author: Hartmut Penner <hpenner@de.ibm.com>
      1 Author: Hartmut Penner <HPENNER@de.ibm.com>

      1 Author: Hongtao Liu <hongtao.liu@inte.com>
*     13 Author: Hongtao Liu <hongtao.liu@intel.com>

*      5 Author: Huang Chun <chunhuang73@hotmail.com>
      1 Author: Huang Chun <chunhuang73@hotmal.com>

      1 Author: Jakub Jeilnek <jakub@redhat.com>
*   9972 Author: Jakub Jelinek <jakub@redhat.com>
      2 Author: Jakub Jelinek <jakuB@redhat.com>

      1 Author: James A. Mmorrison <ja2morri@uwaterloo.ca>
      1 Author: James A Morrison <ja2morri@uwaterloo.ca>
*     11 Author: James A. Morrison <ja2morri@uwaterloo.ca>

*      3 Author: James A Morrison <ja2morri@student.math.uwaterloo.ca>
      1 Author: James A. Morrison <ja2morri@student.math.uwaterloo.ca>

*     96 Author: James A. Morrison <phython@gcc.gnu.org>
      1 Author: James A. Morrison <phython@gc.gnu.org>
      3 Author: James A. Morrrison <phython@gcc.gnu.org>
      1 Author: James Morrison <phython@gcc.gnu.org>

     40 Author: James E Wilson <wilson@specifix.com>
     15 Author: James E. Wilson <wilson@specifix.com>
*    129 Author: James E Wilson <wilson@specifixinc.com>
     19 Author: James E. Wilson <wilson@specifixinc.com>

*     20 Author: James E Wilson <wilson@tuliptree.org>
     11 Author: James E. Wilson <wilson@tuliptree.org>

      1 Author: James Greenahlgh <james.greenhalgh@arm.com>
      1 Author: James Greenhalgh <jame.greenhalgh@arm.com>
*    397 Author: James Greenhalgh <james.greenhalgh@arm.com>
      1 Author: James Greenhalgh <james.greenhalghj@arm.com>
      1 Author: James Grennahlgh <james.greenhalgh@arm.com>

      1 Author: Jan-benedict Glaw <jbglaw@lug-owl.de>
*     60 Author: Jan-Benedict Glaw <jbglaw@lug-owl.de>

      2 Author: Jan Hubicka <hubcika@ucw.cz>
      2 Author: Jan hubicka <hubicka@ucw.cz>
*   1023 Author: Jan Hubicka <hubicka@ucw.cz>
      5 Author: Jan HUbicka <hubicka@ucw.cz>

*   2049 Author: Jan Hubicka <jh@suse.cz>
      2 Author: Jan Hubicka <jH@suse.cz>
      3 Author: Jan HUbicka <jh@suse.cz>
      3 Author: Jan Hubicka <jh@susue.cz>
      1 Author: Jan Hubicka <j@suse.cz>

*    632 Author: Janis Johnson <janis187@us.ibm.com>
      2 Author: Janis Johnson <janis287@us.ibm.com>

      5 Author: Jeff Law <jeff@redhat.com>
      1 Author: Jeff Law <law@redha.com>
      4 Author: Jeff law <law@redhat.com>
*   1218 Author: Jeff Law <law@redhat.com>
     40 Author: Jeffrey A Law <law@redhat.com>
     14 Author: Jeffrey Law <law@redhat.com>

      3 Author: Jeff Law <law@cygnus.com>
*   1628 Author: Jeffrey A Law <law@cygnus.com>

*    110 Author: Jeff Sturm <jsturm@one-point.com>
      1 Author: Jeff Sturm <jsturm@one-pont.com>

      3 Author: jeremie Salvucci <jeremie.salvucci@free.fr>
      2 Author: Jeremie salvucci <jeremie.salvucci@free.fr>
*      7 Author: Jeremie Salvucci <jeremie.salvucci@free.fr>

*     35 Author: Jerome Lambourg <lambourg@adacore.com>
      1 Author: Jérôme Lambourg <lambourg@adacore.com>

     38 Author: Jerry DeLisle <jvdelisle@gcc.gnu>
*   1069 Author: Jerry DeLisle <jvdelisle@gcc.gnu.org>
      8 Author: Jerry DeLisle <jvdelisle@gcc.ngu.org>

      2 Author: "Jerry Quinn" <jquinn@nortelnetworks.com>
*      7 Author: Jerry Quinn <jquinn@nortelnetworks.com>

      1 Author: Joel Brobecker <brobecke@gnat.com>
*     11 Author: Joel Brobecker <brobecker@gnat.com>

     11 Author: Joel Sherrill <joel@oarcorp.com>
*     25 Author: Joel Sherrill <joel@OARcorp.com>

*     50 Author: Joel Sherrill <joel.sherrill@oarcorp.com>
      3 Author: Joel Sherrill <joel.sherrill@OARcorp.com>

*     49 Author: Joey Ye <joey.ye@arm.com>
      1 Author: Joey Ye <Joey.Ye@arm.com>

      3 Author: John Dave Anglin <danglin@gcc.gnu.org>
*    416 Author: John David Anglin <danglin@gcc.gnu.org>
      3 Author: John David Anglin <danglin@gcc.gnu.orig>
      1 Author: John David Anglin <danlgin@gcc.gnu.org>

      4 Author: John David Anglin <dava.anglin@nrc-cnrc.gc.ca>
      2 Author: John David Anglin <dave.anflin@nrc-cnrc.gc.ca>
      1 Author: John David Anglin <dave.anglin@gcc-cnrc.gc.ca>
      3 Author: John David Anglin <dave.anglin@nrc-cnrc.ca>
      4 Author: John David Anglin <dave.anglin@nrc-cnrc.gc>
      2 Author: John David Anglin <dave.anglin.@nrc-cnrc.gc.ca>
*    834 Author: John David Anglin <dave.anglin@nrc-cnrc.gc.ca>
      1 Author: John David Anglin <dave.anglin@nrc.cnrc.gc.ca>
      1 Author: John David Anglin <dave.anglin@nrc-crc.gc.ca>
     12 Author: John David Anglin <dave.anglin@nrc-crnc.gc.ca>
      3 Author: John David Anglin <dave.anglin@nrc.gc.ca>
      2 Author: John David Anglin <dave.anglin@nrc-gnrc.gc.ca>
      2 Author: John David Anglin <dave.danglin@nrc-cnrc.gc.ca>
      2 Author: John Davind Anglin <dave.anglin@nrc-cnrc.gc.ca>

      1 Author: John David Anglin <dave@hiauly1>
      1 Author: John David Anglin <dave@hiauly1.hia.nrc>
*    221 Author: John David Anglin <dave@hiauly1.hia.nrc.ca>
      1 Author: John David Anglin <dave@hiauly.hia.nrc.ca>
      1 Author: John David Anglin <dave@hiuly1.hia.nrc.ca>

      2 Author: John F Carr <jfc@mit.edu>
*      9 Author: John F. Carr <jfc@mit.edu>

*      5 Author: Joost VandeVondele <vondele@gcc.gnu.org>
      4 Author: Joost VandeVondele <vondele@gnu.gcc.org>

*    290 Author: J"orn Rennecke <amylaar@cygnus.co.uk>
      1 Author: Jorn Rennecke <amylaar@cygnus.co.uk>

      1 Author: Joseph Myers <joseph@codesourcery.cmo>
*   1868 Author: Joseph Myers <joseph@codesourcery.com>
    598 Author: Joseph S. Myers <joseph@codesourcery.com>

      1 Author: Joseph Myers <jsm@polyomino.org.uk>
*    304 Author: Joseph S. Myers <jsm@polyomino.org.uk>

      1 Author: Jun Ma <junma@linux.alibaba.com>
*      3 Author: Jun Ma <JunMa@linux.alibaba.com>

      1 Author: Kai Tietz <kai,tietz@onevision.com>
*    221 Author: Kai Tietz <kai.tietz@onevision.com>
      1 Author: Kai Tietz <Kai.Tietz@onevision.com>
      3 Author: Kai TIetz <kai.tietz@onevision.com>
      1 Author: Kai Tietz <kai.tietz@onevison.com>
      2 Author: Kai Tietz <kai.tietz@onvision.com>

      2 Author: Kaz kojima <kkojima@gcc.gnu.org>
*    444 Author: Kaz Kojima <kkojima@gcc.gnu.org>

*      8 Author: kaz Kojima <kkojima@rr.iij4u.or.jp>
      4 Author: Kaz Kojima <kkojima@rr.iij4u.or.jp>

      2 Author: Kazuhiro Inaoka <inaoka.kazuhiro@reneasas.com>
*     42 Author: Kazuhiro Inaoka <inaoka.kazuhiro@renesas.com>
      1 Author: Kazuhiro Inaoka <inaoka.kazuhiro@renesas.comn>
      1 Author: Kazuhiro Inaoka <inaoka.lazuhiro@renesas.com>
      3 Author: Kazuhiro Inaoka <kazuhiro.inaoka.ud@renesas.com>

*     26 Author: Kelley Cook <kelleycook@wideopenwest.com>
      2 Author: Kelley Cook <kelleycoook@wideopenwest.com>

*     43 Author: Kim Ho <kho@redhat.com>
      1 Author: Kim Ho  <kho@redhat.com>
      2 Author: Kim Ho  <kho@redhat.com>

*      3 Author: Kirill Yukhin <kirill.yukhin@gmail.com>
      2 Author: Kirill Yukhin <kirill.yukhin@gmaile.com>

      1 Author: Kugan Vivekanandarajah <kuganvi@linaro.org>
*    106 Author: Kugan Vivekanandarajah <kuganv@linaro.org>

      1 Author: Larin Hennessey <larin@science.oregonstate.edu>
*      4 Author: Larin Hennessy <larin@science.oregonstate.edu>

      8 Author: lars brinkhoff <lars@nocrew.org>
*     16 Author: Lars Brinkhoff <lars@nocrew.org>

      1 Author: Laurent Aflonsi <laurent.alfonsi@st.com>
*      2 Author: Laurent Alfonsi <laurent.alfonsi@st.com>

      7 Author: Laurent Guerby <laurent@guerby.net>
*     60 Author: Laurent GUERBY <laurent@guerby.net>

     30 Author: Loren James Rittle <ljrittle@acm.org>
*    270 Author: Loren J. Rittle <ljrittle@acm.org>
     24 Author: Loren Rittle <ljrittle@acm.org>

      2 Author: Maciej Rozycki <macro@wdc.com>
*     14 Author: Maciej W. Rozycki <macro@wdc.com>

      1 Author: Manfred Schwarbi <manfred99@gmx.ch>
*     10 Author: Manfred Schwarb <manfred99@gmx.ch>

*      5 Author: Mantas Mikaitis <mantas.mikaitis@arm.com>
      1 Author: Mantas Mikaitis <Mantas.Mikaitis@arm.com>

    150 Author: Manuel Lopez-Ibanez <manu@gcc.gnu.org>
*    175 Author: Manuel López-Ibáñez <manu@gcc.gnu.org>

      1 Author: Marcel BÃhme <boehme.marcel@gmail.com>
*      4 Author: Marcel Böhme <boehme.marcel@gmail.com>

      1 Author: Marcin Baczynski <marbacz@gmail.com>
*      1 Author: Marcin Baczyński <marbacz@gmail.com>

*     16 Author: Mark Eggleston <mark.eggleston@codethink.com>
      1 Author: Mark Eggleston <markeggleston@codethink.com>

      2 Author: Mark <mark@codesourcery.com>
*   2971 Author: Mark Mitchell <mark@codesourcery.com>
     20 Author: Mark P Mitchell <mark@codesourcery.com>
     18 Author: Mark P. Mitchell <mark@codesourcery.com>

*     89 Author: Mark Mitchell <mmitchell@usa.net>
      1 Author: Mark P. Mitchell <mmitchell@usa.net>

*    313 Author: Mark Wielaard <mark@klomp.org>
      1 Author: Mark WIelaard <mark@klomp.org>

*    870 Author: Martin Jambor <mjambor@suse.cz>
      1 Author: Martin Jambor <mjmabor@suse.cz>

      1 Author: Martin liska <mliska@suse.cz>
*   1721 Author: Martin Liska <mliska@suse.cz>

      2 Author: Martin v. Loewis <loewis@informatik.hu-berlin.de>
*     46 Author: Martin v. Löwis <loewis@informatik.hu-berlin.de>
     13 Author: Martin von Loewis <loewis@informatik.hu-berlin.de>
     34 Author: Martin von Löwis <loewis@informatik.hu-berlin.de>
      2 Author: Martin von Lvwis <loewis@informatik.hu-berlin.de>

*      4 Author: Martin v. Loewis <martin@mira.isdn.cs.tu-berlin.de>
      1 Author: Martin von Loewis <martin@mira.isdn.cs.tu-berlin.de>

      2 Author: matthew green <mrg@eterna.com.au>
*      1 Author: Matthew Green <mrg@eterna.com.au>

*     93 Author: Matthew Wahab <matthew.wahab@arm.com>
      1 Author: Matthew Wawhab <matthew.wahab@arm.com>

*     53 Author: Matt Kraai <kraai@alumni.cmu.edu>
      2 Author: Matt Kraai <kraii@alumni.cmu.edu>
      1 Author: Matt Kraii <kraii@alumni.cmu.edu>

     17 Author: Maxim Ostapenko <m.ostapenko@partner.samsung.com>
*     19 Author: Max Ostapenko <m.ostapenko@partner.samsung.com>

*   1059 Author: Michael Koch <konqueror@gmx.de>
      2 Author: Michael Koch <koqnueror@gmx.de>
      2 Author: Michael <konqueror@gmx.de>
      1 Author: Micheal Koch <konqueror@gmx.de>

      4 Author: Nathan sidwell <nathan@acm.org>
*   2104 Author: Nathan Sidwell <nathan@acm.org>

*     85 Author: Neil Booth <neilb@earthling.net>
     54 Author: Neil Booth <NeilB@earthling.net>

*      1 Author: NightStrike <nightstrike@gmail.com>
      1 Author: NightStrike <NightStrike@gmail.com>

*      2 Author: Nitin Yewale <nitiny@kpitcummins.com>
      2 Author: Nitin Yewale <NitinY@KPITCummins.com>

*     19 Author: Olga Golovanevsky <olga@il.ibm.com>
      1 Author: Olga Golovanevsky <OLGA@il.ibm.com>
      2 Author: Olga Golovonevsky <olga@il.ibm.com>

*     58 Author: Olga Rodimina <rodimina@redhat.com>
      1 Author: Olga Rodimina <rodimina@redhat.coom>

*    231 Author: Olivier Hainque <hainque@adacore.com>
      1 Author: OLivier Hainque <hainque@adacore.com>
      1 Author: Olivier Hainque <hainqueu@adacore.com>

*      7 Author: Ondřej Bílka <neleai@seznam.cz>
      3 Author: Ondřej Bílka <neleai@seznam.cz>

*      1 Author: Pádraig Brady <p@draigbrady.com>
      1 Author: Pádraig Brady <P@draigBrady.com>

*    643 Author: Paolo Bonzini <bonzini@gnu.org>
      1 Author: Paolo Bonzini <bonzinI@gnu.org>
      1 Author: Paolo Bonzini <bonzinI@gnu.rg>

*      2 Author: Paolo Bonzini <bonzini@gcc.gnu.org>
      1 Author: Paolo Bonzini <gcc@gcc.gnu.org>

      1 Author: Paolo Carlini <paolo.carlini@oracle>
*   1973 Author: Paolo Carlini <paolo.carlini@oracle.com>

      2 Author: Paolo Carlini <pcaelini@suse.de>
*   1338 Author: Paolo Carlini <pcarlini@suse.de>

      1 Author: Paul Brook <pau@codesourcery.com>
      2 Author: Paul Brook <paul@codeosurcery.com>
      2 Author: Paul brook <paul@codesourcery.com>
*    589 Author: Paul Brook <paul@codesourcery.com>
      2 Author: Paul Brook <paul@coodesourcery.com>
      2 Author: Paul Brook <paul@coudesourcery.com>
      1 Author: Paul Brook <paul@cpodesourcery.com>

*      5 Author: Paulo Matos <paulo@matos-sorge.com>
      1 Author: Paulo J. Matos <paulo@matos-sorge.com>

*      9 Author: Peter O'Gorman <peter@pogma.com>
      1 Author: Peter O'Gorman <peter@pogman.com>

      1 Author: Philip Blundell <bp@nexus.co.uk>
*     23 Author: Philip Blundell <pb@nexus.co.uk>

*     77 Author: Pierre-Marie de Rodat <derodat@adacore.com>
      1 Author: Pierre-Marie Derodat <derodat@adacore.com>

*     25 Author: Pierre Vittet <piervit@pvittet.com>
      1 Author: Pierre Vittet <piervit@vpittet.com>

*     13 Author: Pitchumani Sivanupandi <pitchumani.s@atmel.com>
      3 Author: Pitchumani Sivanupandi <Pitchumani.S@atmel.com>

      1 Author: Po Chang <pchang9@cs.wisc.edu>
*      3 Author: Po-Chun Chang <pchang9@cs.wisc.edu>

*      2 Author: Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
      1 Author: Przemyslaw Wirkus <przemyslaw.wirkus@arm.com\>

      4 Author: qing zhao <qing.zhao@oracle.com>
*     12 Author: Qing Zhao <qing.zhao@oracle.com>

*    161 Author: Rafael Avila de Espindola <espindola@google.com>
      1 Author: Rafael Ávila de Espíndola <espindola@google.com>
    126 Author: Rafael Espindola <espindola@google.com>

      2 Author: Rafael Avila de Espindola <rafael.espindola@gmail.com>
      2 Author: Rafael Ávila de Espíndola <rafael.espindola@gmail.com>
      1 Author: Rafael ?vila de Esp?ndola <rafael.espindola@gmail.com>
*      5 Author: Rafael Ávila de Espíndola <rafael.espindola@gmail.com>

      1 Author: Ralf Corsepius <corepiu@faw.uni-ulm.de>
*     15 Author: Ralf Corsepius <corsepiu@faw.uni-ulm.de>

*     55 Author: Ralf Corsepius <ralf.corsepius@rtems.org>
     36 Author: Ralf Corsépius <ralf.corsepius@rtems.org>
      1 Author: Ralf Corsépius <ralf.corsepius@rtems.org.>

      1 Author: Ralf Wildenhalf Wildenhues <Ralf.Wildenhues@gmx.de>
*    247 Author: Ralf Wildenhues <Ralf.Wildenhues@gmx.de>


*    159 Author: Ramana Radhakrishnan <ramana.radhakrishnan@arm.com>
      1 Author: Ramana Radhakrishnan <Ramana.Radhakrishnan@arm.com>
      1 Author: Ramana Radhakrishnan <ramrad01@arm.com>

      2 Author: Rask Ingemann Lamberisen <rask@sygehus.dk>
*     48 Author: Rask Ingemann Lambertsen <rask@sygehus.dk>

*     41 Author: Razya Ladelsky <razya@il.ibm.com>
      3 Author: Razya Ladklesky <razya@il.ibm.com>

*     74 Author: Renlin Li <renlin.li@arm.com>
     16 Author: Renlin Li <Renlin.Li@arm.com>
      1 Author: Renlin <renlin.li@arm.com>

      1 Author: Revital1 Eres <ERES@il.ibm.com>
*     49 Author: Revital Eres <eres@il.ibm.com>
      1 Author: Revital Eres <ERES@il.ibm.com>

*   3467 Author: Richard Biener <rguenther@suse.de>
      2 Author: Richard Biener <rguenthe@suse.de>
      1 Author: Richard Biener <rguenth@suse.de>

*      1 Author: Richard B. Kreckel <richard.kreckel@framatome-anp.com>
      1 Author: Richard B. Kreckel <Richard.Kreckel@Framatome-ANP.com>

      1 Author: Richard Guenther <rguenther@gcc.gnu.org>
*     48 Author: Richard Guenther <rguenth@gcc.gnu.org>

*   3287 Author: Richard Guenther <rguenther@suse.de>
      1 Author: Richard GUenther <rguenther@suse.de>
      1 Author: Richrad Guenther <rguenther@suse.de>

      1 Author: Richard Hendersion <rth@redhat.com>
      1 Author: Richard Henderson <rth@redehat.com>
*   3717 Author: Richard Henderson <rth@redhat.com>
      3 Author: Richard Hendeson <rth@redhat.com>
      1 Author: Richard Henerson <rth@redhat.com>

*   1034 Author: Richard Henderson <rth@cygnus.com>
      1 Author: Richard Hendeson <rth@cygnus.com>

*    443 Author: Richard Sandiford <richard@codesourcery.com>
      1 Author: Richard SAndiford <richard@codesourcery.com>

      1 Author: Richard Sandiford <richard.sandifird@linaro.org>
*    751 Author: Richard Sandiford <richard.sandiford@linaro.org>

*    629 Author: Richard Sandiford <rsandifo@redhat.com>
      2 Author: Richard SAndiford <rsandifo@redhat.com>
      1 Author: Richard Sandiford <rsandif@redhat.com>

*     39 Author: Robert Lipe <robertlipe@usa.net>
      1 Author: Robert Lipe <RobertLipe@usa.net>

*      9 Author: Robert Schuster <thebohemian@gmx.net>
      3 Author: Robert Schuster <theBohemian@gmx.net>

*     29 Author: Robert Suchanek <robert.suchanek@imgtec.com>
      2 Author: Robert Suchanek <Robert.Suchanek@imgtec.com>

      1 Author: Rodney Brown <rodneybrown@pmsc.com>
*      8 Author: Rodney Brown <RodneyBrown@pmsc.com>
      3 Author: RodneyBrown <RodneyBrown@pmsc.com>

*     10 Author: Rodrigo Rivas Costa <rodrigorivascosta@gmail.com>
      1 Author: Rodrigo Rivas <rodrigorivascosta@gmail.com>

*      2 Author: Rohit Arul Raj <rohitarulraj@freescale.com>
      1 Author: Rohit Arul Raj <rohitrulraj@freescale.com>
      5 Author: Rohit <rohitarulraj@freescale.com>

      7 Author: scott snyder <snyder@fnal.gov>
*     12 Author: Scott Snyder <snyder@fnal.gov>

      1 Author: Segher Boessenkool <segher@kernel,crashing.org>
*    730 Author: Segher Boessenkool <segher@kernel.crashing.org>
      2 Author: Segher Boesssenkool <segher@kernel.crashing.org>

      2 Author: Sergey Shalnov <sergey.shalnov@intel.com>
*      7 Author: Sergey Shalnov <Sergey.Shalnov@intel.com>

      3 Author: Sherrill <joel@oarcorp.com>
*     32 Author: Sherrill <joel@OARcorp.com>

*     69 Author: Sriraman Tallam <tmsriram@google.com>
      1 Author: Sriraman Tallan <tmsriram@google.com>

      1 Author: Stamatis Markianos-Wright <stam.markianos-wright@arm.com>
*      2 Author: Stam Markianos-Wright <stam.markianos-wright@arm.com>

      2 Author: (Stephen L Moshier) <moshier@world.std.com>
*      2 Author: Stephen L Moshier <moshier@world.std.com>

      1 Author: Stephen M. Webb <stephen@bregmasoft..com>
*      7 Author: Stephen M. Webb <stephen@bregmasoft.com>

*     20 Author: Steven Bosscher <s.bosscher@student.tudelft.nl>
      2 Author: Steven Bosscher <S.Bosscher@student.tudelft.nl>

*     37 Author: Steven Bosscher <stevenb.gcc@gmail.com>
      1 Author: Steven Bosscher <steven.gcc@gmail.com>

      1 Author: Steven Bosscher <steven.bosscher@usafa.af.mil>
*      2 Author: Steven Bosscher <Steven.Bosscher@usafa.af.mil>

      1 Author: Steevn Bosscher <stevenb@suse.de>
*    214 Author: Steven Bosscher <stevenb@suse.de>
      1 Author: Steven Bosscher <steven@suse.de>

*    446 Author: Steven Bosscher <steven@gcc.gnu.org>
      1 Author: Steven Bosscher  <steven@gcc.gnu.org>

      2 Author: Steven G. Kargl <kargl@gccc.gnu.org>
      5 Author: Steven g. Kargl <kargl@gcc.gnu.org>
      3 Author: Steven G . Kargl <kargl@gcc.gnu.org>
*    694 Author: Steven G. Kargl <kargl@gcc.gnu.org>
      2 Author: Steven G. Kargl <kargl@gcc.ngu.org>
      3 Author: Steven G. Kargl <kargl@gc.gnu.org>
      1 Author: Steven G. Kargl <kargl@gcvc.gnu.org>
      3 Author: Steven G. Kargl <kargl@kgcc.gnu.org>
      3 Author: Steven G. Kargl <kargls@gcc.gnu.org>
      1 Author: Steven G. Kargll <kargl@gcc.gnu.org>
      1 Author: Steven K. kargl <kargl@gcc.gnu.org>

      1 Author: Steven G. Kargl <kargls@comcast>
*    163 Author: Steven G. Kargl <kargls@comcast.net>
      2 Author: Steven G. Kargl <kargls@comcast.nt>
      1 Author: Steven G. Kargl <kargls@comcat.net>
      1 Author: Steven G. Kargl <kargsl@comcast.net>

*      2 Author: Steve Pribyl <steve@netfuel.com>
      1 Author: Steve Pribyl <steve@netfuel.com.>

*    119 Author: Stuart Hastings <stuart@apple.com>
      1 Author: Stuart Hastings <stuarth@apple.com>

*    159 Author: Tamar Christina <tamar.christina@arm.com>
      1 Author: Tamar Christina  <tamar.christina@arm.com>

*      4 Author: Thierry Lafage <thierry.lafage@inria.fr>
      2 Author: Thierry LAFAGE <thierry.lafage@inria.fr>

*    213 Author: Thomas Fitzsimmons <fitzsim@redhat.com>
      1 Author: Thomas Fitzsimmons <fitzsim@redhat.com.h>

      3 Author: Thomas Koenig <tkoeng@gcc.gnu.org>
*    635 Author: Thomas Koenig <tkoenig@gcc.gnu.org>
     23 Author: Thomas König <tkoenig@gcc.gnu.org>

      1 Author: Tobias Burnus <burnus@net-b.>
*   1218 Author: Tobias Burnus <burnus@net-b.de>
      1 Author: Tobias Burnus <burnus@net.b.de>
      1 Author: Tobias Burnus <burnus@net-de>

      1 Author: Tobias Grosser <grosser@fim.uni-passau>
*    164 Author: Tobias Grosser <grosser@fim.uni-passau.de>
      5 Author: Tobias Grosser <grosser@fmi.uni-passau.de>

      7 Author: Tobias Schlüter <tobias.schlueter@physik.uni-muenchen.de>
     10 Author: Tobias Schlüter <tobias.schlueter@physik.uni-muenchen.de>
*    167 Author: Tobias Schlueter <tobias.schlueter@physik.uni-muenchen.de>
      3 Author: Tobias Schlueter <tobias.shclueter@physik.uni-muenchen.de>
      2 Author: Tobias Schlueter <tobis.schlueter@physik.uni-muenchen.de>
     94 Author: Tobias Schl"uter <tobias.schlueter@physik.uni-muenchen.de>

*     39 Author: Tobias Schlüter <tobi@gcc.gnu.org>
      1 Author: Tobias Schlueter <tobi@gcc.gnu.org>
      1 Author: Tobias Schlüuter <tobi@gcc.gnu.org>
      4 Author: Tobias Schlüter <tobi@gcc.gnu.org>

      1 Author: trevor Saunders <tbsaunde@tbsaunde.org>
*      5 Author: Trevor Saunders <tbsaunde@tbsaunde.org>

      1 Author: trevor Saunders <tsaunders@mozilla.com>
      1 Author: Trevor saunders <tsaunders@mozilla.com>
*     74 Author: Trevor Saunders <tsaunders@mozilla.com>

     12 Author: Trevor Smigiel <trevor_smigiel@playstation.sony.com>
*     12 Author: Trevor Smigiel <Trevor_Smigiel@playstation.sony.com>

      1 Author: Tristan Gingold <gindold@adacore.com>
*    171 Author: Tristan Gingold <gingold@adacore.com>

*   1753 Author: Uros Bizjak <ubizjak@gmail.com>
    114 Author: Uroš Bizjak <ubizjak@gmail.com>

      1 Author: Venkataramanan kumar <venkataramanan.kumar@amd.com>
*     25 Author: Venkataramanan Kumar <venkataramanan.kumar@amd.com>
      7 Author: Venkataramanan Kumar <Venkataramanan.kumar@amd.com>
      2 Author: Venkataramanan Kumar <Venkataramanan.Kumar@amd.com>

      1 Author: venkataramanan kumar <venkataramanan.kumar@linaro.org>
*     11 Author: Venkataramanan Kumar <venkataramanan.kumar@linaro.org>
      1 Author: Venkataramanan.Kumar <venkataramanan.kumar@linaro.org>

      1 Author: Volker Reichelt <reichelt@igpm.rwth-aaachen.de>
*    413 Author: Volker Reichelt <reichelt@igpm.rwth-aachen.de>

*    227 Author: Wilco Dijkstra <wdijkstr@arm.com>
     10 Author: Wilco Dijkstra <wilco.dijkstra@arm.com>

      1 Author: will wray <wjwray@gmail.com>
*      1 Author: Will Wray <wjwray@gmail.com>

*    131 Author: Xinliang David Li <davidxl@google.com>
      2 Author: Xinliang Davidl Li <davidxl@google.com>

*      8 Author: Yaakov Selkowitz <yselkowi@redhat.com>
      1 Author: Yaakov Selkowitz <yselkowitz@redhat.com>

*     88 Author: Zack Weinberg <zackw@stanford.edu>
      2 Author: Zack Weinberg <zackw@Stanford.EDU>

      1 Author: Zhenqiang Chen <zhenqiang.chen@linaor.org>
*     35 Author: Zhenqiang Chen <zhenqiang.chen@linaro.org>

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-27 19:47 Git conversion: fixing email addresses from ChangeLog files Richard Earnshaw (lists)
@ 2019-12-28 11:37 ` Segher Boessenkool
  2019-12-28 13:05   ` Joseph Myers
  2019-12-28 12:04 ` Jakub Jelinek
  2019-12-28 23:20 ` Richard Earnshaw (lists)
  2 siblings, 1 reply; 26+ messages in thread
From: Segher Boessenkool @ 2019-12-28 11:37 UTC (permalink / raw)
  To: Richard Earnshaw (lists); +Cc: gcc

On Fri, Dec 27, 2019 at 07:47:02PM +0000, Richard Earnshaw (lists) wrote:
>       1 Author: Segher Boessenkool <segher@kernel,crashing.org>
> *    730 Author: Segher Boessenkool <segher@kernel.crashing.org>
>       2 Author: Segher Boesssenkool <segher@kernel.crashing.org>

The first and third are only in changelogs.  The second even happened
only once, afaics?

These errors only happen in the reposurgeon conversion.


Segher

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-27 19:47 Git conversion: fixing email addresses from ChangeLog files Richard Earnshaw (lists)
  2019-12-28 11:37 ` Segher Boessenkool
@ 2019-12-28 12:04 ` Jakub Jelinek
  2019-12-28 16:55   ` Richard Earnshaw (lists)
  2019-12-28 23:20 ` Richard Earnshaw (lists)
  2 siblings, 1 reply; 26+ messages in thread
From: Jakub Jelinek @ 2019-12-28 12:04 UTC (permalink / raw)
  To: Richard Earnshaw (lists); +Cc: gcc

On Fri, Dec 27, 2019 at 07:47:02PM +0000, Richard Earnshaw (lists) wrote:
> Email addresses from the ChangeLog files are not validated during
> commits, so a number of typos exist in the extracted data.  I've
> extracted the 'Author:' entry from a prototype conversion and then piped
> that through sort and uniq -c.  Subsequent analysis shows the following
> addresses/names that are likely in need of some resolution.
> 
> Several of the names differ only in case and a case-insensitive sort
> would ignore these.  But some clearly show typing errors.

Looks like quite mixed bag of various issues.  Some are caused by some level
of dysgraphia of lots of us and the right choice is quite obvious, others
are just that the same name can have different forms and all of them are ok
(Bradley vs. Brad, Jeffrey vs. Jeff), others whether middle name is used or
not (and whether . appears after the first letter of middle name or not),
and another category is whether non-accented name is used, or accented one.
If we need to choose just one between the various correct forms of
name/email, I'd say it is important that we at least don't choose forms with
encoding issues.
Should we give people the choice of accented vs. non-accented form, e.g.
Uroš started using the accented form recently, using non-accented form in
the past probably because the encoding of the ChangeLog files in the past used
to be non-trustable, I remember several times when some people committed a
ChangeLog change that recoded everything in there as if e.g. the input was
in ISO-8859-1 and saved it as UTF-8 (which did quite some harm to already
UTF-8 encoded names).
E.g. in
      7 Author: Tobias Schl??ter <tobias.schlueter@physik.uni-muenchen.de>
     10 Author: Tobias Schl?ter <tobias.schlueter@physik.uni-muenchen.de>
*    167 Author: Tobias Schlueter <tobias.schlueter@physik.uni-muenchen.de>
       3 Author: Tobias Schlueter <tobias.shclueter@physik.uni-muenchen.de>
       2 Author: Tobias Schlueter <tobis.schlueter@physik.uni-muenchen.de>
      94 Author: Tobias Schl"uter <tobias.schlueter@physik.uni-muenchen.de>
 
*     39 Author: Tobias Schl?ter <tobi@gcc.gnu.org>
       1 Author: Tobias Schlueter <tobi@gcc.gnu.org>
       1 Author: Tobias Schl?uter <tobi@gcc.gnu.org>
       4 Author: Tobias Schlüter <tobi@gcc.gnu.org>
the second * certainly doesn't look right even when it is most common,
Tobias' name is surely Tobias Schlüter with possible transliteration
Tobias Schlueter.
*      7 Author: OndÅ?ej Bílka <neleai@seznam.cz>
       3 Author: Ond?ej Bílka <neleai@seznam.cz>
Neither of these is correct and the first one looks like an example of the
ISO-8859-1 to UTF-8 recodings, the latter like ISO-8859-2 encoded name,
the correct name is Ondřej Bílka in UTF-8 accented and Ondrej Bilka if
non-accented.

	Jakub

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 11:37 ` Segher Boessenkool
@ 2019-12-28 13:05   ` Joseph Myers
  2019-12-28 14:54     ` Segher Boessenkool
  0 siblings, 1 reply; 26+ messages in thread
From: Joseph Myers @ 2019-12-28 13:05 UTC (permalink / raw)
  To: Segher Boessenkool; +Cc: Richard Earnshaw (lists), gcc

On Sat, 28 Dec 2019, Segher Boessenkool wrote:

> On Fri, Dec 27, 2019 at 07:47:02PM +0000, Richard Earnshaw (lists) wrote:
> >       1 Author: Segher Boessenkool <segher@kernel,crashing.org>
> > *    730 Author: Segher Boessenkool <segher@kernel.crashing.org>
> >       2 Author: Segher Boesssenkool <segher@kernel.crashing.org>
> 
> The first and third are only in changelogs.  The second even happened
> only once, afaics?
> 
> These errors only happen in the reposurgeon conversion.

This is about extracting attributions from changelogs when unambiguous 
there, and then correcting mistakes or otherwise making minor variants 
more uniform.
  
The last one comes from r235585 and r245880.

-- 
Joseph S. Myers
jsm@polyomino.org.uk

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 13:05   ` Joseph Myers
@ 2019-12-28 14:54     ` Segher Boessenkool
  2019-12-28 16:34       ` Richard Earnshaw (lists)
  2019-12-28 16:36       ` Joseph Myers
  0 siblings, 2 replies; 26+ messages in thread
From: Segher Boessenkool @ 2019-12-28 14:54 UTC (permalink / raw)
  To: Joseph Myers; +Cc: Richard Earnshaw (lists), gcc

On Sat, Dec 28, 2019 at 01:05:13PM +0000, Joseph Myers wrote:
> On Sat, 28 Dec 2019, Segher Boessenkool wrote:
> 
> > On Fri, Dec 27, 2019 at 07:47:02PM +0000, Richard Earnshaw (lists) wrote:
> > >       1 Author: Segher Boessenkool <segher@kernel,crashing.org>
> > > *    730 Author: Segher Boessenkool <segher@kernel.crashing.org>
> > >       2 Author: Segher Boesssenkool <segher@kernel.crashing.org>
> > 
> > The first and third are only in changelogs.  The second even happened
> > only once, afaics?
> > 
> > These errors only happen in the reposurgeon conversion.
> 
> This is about extracting attributions from changelogs when unambiguous 
> there, and then correcting mistakes or otherwise making minor variants 
> more uniform.

Yes, and I'm saying you probably shouldn't do that.

Note that these errors did not exist in the changelog in the commit
message, for example.

Since people very often typo their own name (as the evidence shows), the
heuristic for deriving it should be robust against that.

> The last one comes from r235585 and r245880.

Ah, thanks.


Segher

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 14:54     ` Segher Boessenkool
@ 2019-12-28 16:34       ` Richard Earnshaw (lists)
  2019-12-28 20:11         ` Segher Boessenkool
  2019-12-28 16:36       ` Joseph Myers
  1 sibling, 1 reply; 26+ messages in thread
From: Richard Earnshaw (lists) @ 2019-12-28 16:34 UTC (permalink / raw)
  To: Segher Boessenkool, Joseph Myers; +Cc: gcc

On 28/12/2019 14:54, Segher Boessenkool wrote:
> On Sat, Dec 28, 2019 at 01:05:13PM +0000, Joseph Myers wrote:
>> On Sat, 28 Dec 2019, Segher Boessenkool wrote:
>>
>>> On Fri, Dec 27, 2019 at 07:47:02PM +0000, Richard Earnshaw (lists) wrote:
>>>>       1 Author: Segher Boessenkool <segher@kernel,crashing.org>
>>>> *    730 Author: Segher Boessenkool <segher@kernel.crashing.org>
>>>>       2 Author: Segher Boesssenkool <segher@kernel.crashing.org>
>>>
>>> The first and third are only in changelogs.  The second even happened
>>> only once, afaics?
>>>
>>> These errors only happen in the reposurgeon conversion.
>>
>> This is about extracting attributions from changelogs when unambiguous 
>> there, and then correcting mistakes or otherwise making minor variants 
>> more uniform.
> 
> Yes, and I'm saying you probably shouldn't do that.

Why, for heavens sake?  Even Maxim's conversion is doing this.

> 
> Note that these errors did not exist in the changelog in the commit
> message, for example.

Yes, they did.  Or at least, they did at the time of the original commit.

> 
> Since people very often typo their own name (as the evidence shows), the
> heuristic for deriving it should be robust against that.
> 

And the statistics show that it's not hard to identify the odd cases and
fix them up.  Only committers with just a single commits are really hard
to spot since we don't have data to compare against other entries.

R.

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 14:54     ` Segher Boessenkool
  2019-12-28 16:34       ` Richard Earnshaw (lists)
@ 2019-12-28 16:36       ` Joseph Myers
  1 sibling, 0 replies; 26+ messages in thread
From: Joseph Myers @ 2019-12-28 16:36 UTC (permalink / raw)
  To: Segher Boessenkool; +Cc: Richard Earnshaw (lists), gcc

On Sat, 28 Dec 2019, Segher Boessenkool wrote:

> > This is about extracting attributions from changelogs when unambiguous 
> > there, and then correcting mistakes or otherwise making minor variants 
> > more uniform.
> 
> Yes, and I'm saying you probably shouldn't do that.

Extracting attributions from ChangeLog files is clearly useful to 
attribute changes committed by someone other than their author, and to 
provide more precise attributions in the cases where someone was 
committing under multiple email addresses to distinguish the affiliation 
relevant for each commit.

> Note that these errors did not exist in the changelog in the commit
> message, for example.

Mostly the changelogs in the commit message don't give the author email at 
all (or if they do, it's because they've cut-and-pasted the complete 
ChangeLog entry from one place to another, so with the same typos).

> Since people very often typo their own name (as the evidence shows), the
> heuristic for deriving it should be robust against that.

I think Richard's heuristic for finding cases of typos and proposing fixes 
for them (then to be applied automatically, after adjusting the preferred 
version in cases where comments such as Jakub's indicate something other 
than the most common version is to be preferred) looks robust to me.

-- 
Joseph S. Myers
jsm@polyomino.org.uk

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 12:04 ` Jakub Jelinek
@ 2019-12-28 16:55   ` Richard Earnshaw (lists)
  2019-12-28 17:14     ` Joseph Myers
  2019-12-28 18:37     ` Andreas Schwab
  0 siblings, 2 replies; 26+ messages in thread
From: Richard Earnshaw (lists) @ 2019-12-28 16:55 UTC (permalink / raw)
  To: Jakub Jelinek; +Cc: gcc

On 28/12/2019 12:04, Jakub Jelinek wrote:
> On Fri, Dec 27, 2019 at 07:47:02PM +0000, Richard Earnshaw (lists) wrote:
>> Email addresses from the ChangeLog files are not validated during
>> commits, so a number of typos exist in the extracted data.  I've
>> extracted the 'Author:' entry from a prototype conversion and then piped
>> that through sort and uniq -c.  Subsequent analysis shows the following
>> addresses/names that are likely in need of some resolution.
>>
>> Several of the names differ only in case and a case-insensitive sort
>> would ignore these.  But some clearly show typing errors.
> 
> Looks like quite mixed bag of various issues.  Some are caused by some level
> of dysgraphia of lots of us and the right choice is quite obvious, others
> are just that the same name can have different forms and all of them are ok
> (Bradley vs. Brad, Jeffrey vs. Jeff), others whether middle name is used or
> not (and whether . appears after the first letter of middle name or not),
> and another category is whether non-accented name is used, or accented one.
> If we need to choose just one between the various correct forms of
> name/email, I'd say it is important that we at least don't choose forms with
> encoding issues.

I don't know whether tools that analyse git repos to generate statistics
about users contributions care about canonicalization of names; they may
just key off email addresses.  I'm not going to try to try to fix those
up, unless specifically asked.  For example, some users have

<first.last@domain>

and

<userid@domain>


> Should we give people the choice of accented vs. non-accented form, e.g.
> Uroš started using the accented form recently, using non-accented form in
> the past probably because the encoding of the ChangeLog files in the past used
> to be non-trustable, I remember several times when some people committed a
> ChangeLog change that recoded everything in there as if e.g. the input was
> in ISO-8859-1 and saved it as UTF-8 (which did quite some harm to already
> UTF-8 encoded names).
> E.g. in
>       7 Author: Tobias Schl??ter <tobias.schlueter@physik.uni-muenchen.de>
>      10 Author: Tobias Schl?ter <tobias.schlueter@physik.uni-muenchen.de>
> *    167 Author: Tobias Schlueter <tobias.schlueter@physik.uni-muenchen.de>
>        3 Author: Tobias Schlueter <tobias.shclueter@physik.uni-muenchen.de>
>        2 Author: Tobias Schlueter <tobis.schlueter@physik.uni-muenchen.de>
>       94 Author: Tobias Schl"uter <tobias.schlueter@physik.uni-muenchen.de>
>  
> *     39 Author: Tobias Schl?ter <tobi@gcc.gnu.org>
>        1 Author: Tobias Schlueter <tobi@gcc.gnu.org>
>        1 Author: Tobias Schl?uter <tobi@gcc.gnu.org>
>        4 Author: Tobias Schlüter <tobi@gcc.gnu.org>
> the second * certainly doesn't look right even when it is most common,
> Tobias' name is surely Tobias Schlüter with possible transliteration
> Tobias Schlueter.
> *      7 Author: OndÅ?ej Bílka <neleai@seznam.cz>
>        3 Author: Ond?ej Bílka <neleai@seznam.cz>
> Neither of these is correct and the first one looks like an example of the
> ISO-8859-1 to UTF-8 recodings, the latter like ISO-8859-2 encoded name,
> the correct name is Ondřej Bílka in UTF-8 accented and Ondrej Bilka if
> non-accented.
> 
> 	Jakub
> 

Yes, accents are tricky.  Some people care massively, others not at all.
 I think the above renderings come from the fact that the file I posted
contains more than one encoding style and that breaks any tools that try
to automatically resolve the encoding.  When I grep the email for Ondřej
directly it displays correctly for the second one in a UTF-8 locale.
When I load the attachment I posted yesterday into emacs it doesn't even
try to render the extended characters and simply displays them as hex
values with a leading \.

My suggestion would be that we try to canonicalize all the author
entries to UTF-8 as that avoids the limitations of ISO-8859-1, but that
would probably need further fixups to detect the additional names that
need rewriting.

R.

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 16:55   ` Richard Earnshaw (lists)
@ 2019-12-28 17:14     ` Joseph Myers
  2019-12-28 17:23       ` Richard Earnshaw (lists)
  2019-12-28 18:37     ` Andreas Schwab
  1 sibling, 1 reply; 26+ messages in thread
From: Joseph Myers @ 2019-12-28 17:14 UTC (permalink / raw)
  To: Richard Earnshaw (lists); +Cc: Jakub Jelinek, gcc

On Sat, 28 Dec 2019, Richard Earnshaw (lists) wrote:

> My suggestion would be that we try to canonicalize all the author
> entries to UTF-8 as that avoids the limitations of ISO-8859-1, but that
> would probably need further fixups to detect the additional names that
> need rewriting.

What I've implemented in bugdb.py already includes converting ISO-8859-1 
to UTF-8 (in any case where the author name is not valid UTF-8 - a general 
property of text encodings is that if something is valid UTF-8, it almost 
certainly is already encoded in ASCII or UTF-8 already), with special 
handling of NBSP and with fixups for all the cases where the results of 
converting ISO-8859-1 to UTF-8 looked wrong (i.e. where it looked like the 
name in the original ChangeLog was not in fact UTF-8).

I've also now made bugdb.py check the list of fixups both before and after 
recoding (which may help in some cases where e.g. a fixup is putting a 
name in canonical form, meaning such a fixup doesn't need to be given in 
forms with both UTF-8 and ISO-8859-1 encodings even if the name appears 
with both those encodings in the history).

Because the author extraction is based on the ChangeLog entry included in 
the original commit, any subsequent commits that (wrongly or correctly) 
recoded ChangeLog entries are not relevant.

-- 
Joseph S. Myers
jsm@polyomino.org.uk

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 17:14     ` Joseph Myers
@ 2019-12-28 17:23       ` Richard Earnshaw (lists)
  2019-12-28 17:34         ` Joseph Myers
  0 siblings, 1 reply; 26+ messages in thread
From: Richard Earnshaw (lists) @ 2019-12-28 17:23 UTC (permalink / raw)
  To: Joseph Myers; +Cc: Jakub Jelinek, gcc

On 28/12/2019 17:14, Joseph Myers wrote:
> On Sat, 28 Dec 2019, Richard Earnshaw (lists) wrote:
> 
>> My suggestion would be that we try to canonicalize all the author
>> entries to UTF-8 as that avoids the limitations of ISO-8859-1, but that
>> would probably need further fixups to detect the additional names that
>> need rewriting.
> 
> What I've implemented in bugdb.py already includes converting ISO-8859-1 
> to UTF-8 (in any case where the author name is not valid UTF-8 - a general 
> property of text encodings is that if something is valid UTF-8, it almost 
> certainly is already encoded in ASCII or UTF-8 already), with special 
> handling of NBSP and with fixups for all the cases where the results of 
> converting ISO-8859-1 to UTF-8 looked wrong (i.e. where it looked like the 
> name in the original ChangeLog was not in fact UTF-8).
> 
> I've also now made bugdb.py check the list of fixups both before and after 
> recoding (which may help in some cases where e.g. a fixup is putting a 
> name in canonical form, meaning such a fixup doesn't need to be given in 
> forms with both UTF-8 and ISO-8859-1 encodings even if the name appears 
> with both those encodings in the history).
> 
> Because the author extraction is based on the ChangeLog entry included in 
> the original commit, any subsequent commits that (wrongly or correctly) 
> recoded ChangeLog entries are not relevant.
> 

I've added the list of emails that I posted yesterday to the conversion
scripts.  I've not written anything to reprocess that yet.  I want to
leave that until we've completed the general review of the preferred
changes we want.  Auto-generating that data from the list will probably
be easier than maintaining it inside bugdb.py for now.

R.

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 17:23       ` Richard Earnshaw (lists)
@ 2019-12-28 17:34         ` Joseph Myers
  2019-12-28 18:54           ` Joseph Myers
  0 siblings, 1 reply; 26+ messages in thread
From: Joseph Myers @ 2019-12-28 17:34 UTC (permalink / raw)
  To: Richard Earnshaw (lists); +Cc: Jakub Jelinek, gcc

On Sat, 28 Dec 2019, Richard Earnshaw (lists) wrote:

> I've added the list of emails that I posted yesterday to the conversion
> scripts.  I've not written anything to reprocess that yet.  I want to
> leave that until we've completed the general review of the preferred
> changes we want.  Auto-generating that data from the list will probably
> be easier than maintaining it inside bugdb.py for now.

I've now pushed a change to automate removing "" or () around names.  
Together with the automatic conversion of ISO-8859-1 names to UTF-8 that 
should slightly reduce the number of cases needing handling from that 
list.

-- 
Joseph S. Myers
jsm@polyomino.org.uk

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 16:55   ` Richard Earnshaw (lists)
  2019-12-28 17:14     ` Joseph Myers
@ 2019-12-28 18:37     ` Andreas Schwab
  1 sibling, 0 replies; 26+ messages in thread
From: Andreas Schwab @ 2019-12-28 18:37 UTC (permalink / raw)
  To: Richard Earnshaw (lists); +Cc: Jakub Jelinek, gcc

On Dez 28 2019, Richard Earnshaw (lists) wrote:

> I don't know whether tools that analyse git repos to generate statistics
> about users contributions care about canonicalization of names; they may
> just key off email addresses.

git shortlog supports that via .mailmap.

Andreas.

-- 
Andreas Schwab, schwab@linux-m68k.org
GPG Key fingerprint = 7578 EB47 D4E5 4D69 2510  2552 DF73 E780 A9DA AEC1
"And now for something completely different."

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 17:34         ` Joseph Myers
@ 2019-12-28 18:54           ` Joseph Myers
  2019-12-28 19:18             ` Eric S. Raymond
  2019-12-29 17:44             ` Joseph Myers
  0 siblings, 2 replies; 26+ messages in thread
From: Joseph Myers @ 2019-12-28 18:54 UTC (permalink / raw)
  To: Richard Earnshaw (lists); +Cc: Jakub Jelinek, gcc

On Sat, 28 Dec 2019, Joseph Myers wrote:

> On Sat, 28 Dec 2019, Richard Earnshaw (lists) wrote:
> 
> > I've added the list of emails that I posted yesterday to the conversion
> > scripts.  I've not written anything to reprocess that yet.  I want to
> > leave that until we've completed the general review of the preferred
> > changes we want.  Auto-generating that data from the list will probably
> > be easier than maintaining it inside bugdb.py for now.
> 
> I've now pushed a change to automate removing "" or () around names.  
> Together with the automatic conversion of ISO-8859-1 names to UTF-8 that 
> should slightly reduce the number of cases needing handling from that 
> list.

Concretely, what I'd suggest is: convert ISO-8859-1 entries in the 
checked-in list to UTF-8, removing anything that thereby becomes a 
duplicate or unnecessary; handle anything whose encoding isn't simply 
ISO-8859-1 or UTF-8 via a hardcoded entry in bugdb.py using hex escapes 
like the existing such entries there.  Once the checked-in list is pure 
UTF-8 it's easier for people to review and edit.  Where the issue is only 
presence of ISO-8859 NBSP, or "" or () around the names, remove that in 
the checked-in list and again remove duplicates.  That way the list can be 
limited to non-encoding variations.

-- 
Joseph S. Myers
jsm@polyomino.org.uk

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 18:54           ` Joseph Myers
@ 2019-12-28 19:18             ` Eric S. Raymond
  2019-12-29 17:44             ` Joseph Myers
  1 sibling, 0 replies; 26+ messages in thread
From: Eric S. Raymond @ 2019-12-28 19:18 UTC (permalink / raw)
  To: Joseph Myers; +Cc: Richard Earnshaw (lists), Jakub Jelinek, gcc

Joseph Myers <jsm@polyomino.org.uk>:
> Concretely, what I'd suggest is: convert ISO-8859-1 entries in the 
> checked-in list to UTF-8, removing anything that thereby becomes a 
> duplicate or unnecessary; handle anything whose encoding isn't simply 
> ISO-8859-1 or UTF-8 via a hardcoded entry in bugdb.py using hex escapes 
> like the existing such entries there.  Once the checked-in list is pure 
> UTF-8 it's easier for people to review and edit.  Where the issue is only 
> presence of ISO-8859 NBSP, or "" or () around the names, remove that in 
> the checked-in list and again remove duplicates.  That way the list can be 
> limited to non-encoding variations.

Be aware that repusurgeon has a "transcode" command for moving
a specified set of object to UTF-8 from a specified encoding.
-- 
		<a href="http://www.catb.org/~esr/">Eric S. Raymond</a>


^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 16:34       ` Richard Earnshaw (lists)
@ 2019-12-28 20:11         ` Segher Boessenkool
  2019-12-28 23:09           ` Richard Earnshaw (lists)
  0 siblings, 1 reply; 26+ messages in thread
From: Segher Boessenkool @ 2019-12-28 20:11 UTC (permalink / raw)
  To: Richard Earnshaw (lists); +Cc: Joseph Myers, gcc

On Sat, Dec 28, 2019 at 04:34:20PM +0000, Richard Earnshaw (lists) wrote:
> On 28/12/2019 14:54, Segher Boessenkool wrote:
> > On Sat, Dec 28, 2019 at 01:05:13PM +0000, Joseph Myers wrote:
> >> On Sat, 28 Dec 2019, Segher Boessenkool wrote:
> >>
> >>> On Fri, Dec 27, 2019 at 07:47:02PM +0000, Richard Earnshaw (lists) wrote:
> >>>>       1 Author: Segher Boessenkool <segher@kernel,crashing.org>
> >>>> *    730 Author: Segher Boessenkool <segher@kernel.crashing.org>
> >>>>       2 Author: Segher Boesssenkool <segher@kernel.crashing.org>
> >>>
> >>> The first and third are only in changelogs.  The second even happened
> >>> only once, afaics?
> >>>
> >>> These errors only happen in the reposurgeon conversion.
> >>
> >> This is about extracting attributions from changelogs when unambiguous 
> >> there, and then correcting mistakes or otherwise making minor variants 
> >> more uniform.
> > 
> > Yes, and I'm saying you probably shouldn't do that.
> 
> Why, for heavens sake?  Even Maxim's conversion is doing this.

No, it doesn't.  If people sometimes mispel their own name in a changelog
it does not put that mispeling as Author: in the git commit.

> > Note that these errors did not exist in the changelog in the commit
> > message, for example.
> 
> Yes, they did.  Or at least, they did at the time of the original commit.

No, they never did.  I always cut off the date/name/email line from the
changelog in the commit message.

> > Since people very often typo their own name (as the evidence shows), the
> > heuristic for deriving it should be robust against that.
> 
> And the statistics show that it's not hard to identify the odd cases and
> fix them up.  Only committers with just a single commits are really hard
> to spot since we don't have data to compare against other entries.

Sure, so do that?  :-)


Segher

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 20:11         ` Segher Boessenkool
@ 2019-12-28 23:09           ` Richard Earnshaw (lists)
  0 siblings, 0 replies; 26+ messages in thread
From: Richard Earnshaw (lists) @ 2019-12-28 23:09 UTC (permalink / raw)
  To: Segher Boessenkool; +Cc: Joseph Myers, gcc

On 28/12/2019 20:11, Segher Boessenkool wrote:
> On Sat, Dec 28, 2019 at 04:34:20PM +0000, Richard Earnshaw (lists) wrote:
>> On 28/12/2019 14:54, Segher Boessenkool wrote:
>>> On Sat, Dec 28, 2019 at 01:05:13PM +0000, Joseph Myers wrote:
>>>> On Sat, 28 Dec 2019, Segher Boessenkool wrote:
>>>>
>>>>> On Fri, Dec 27, 2019 at 07:47:02PM +0000, Richard Earnshaw (lists) wrote:
>>>>>>       1 Author: Segher Boessenkool <segher@kernel,crashing.org>
>>>>>> *    730 Author: Segher Boessenkool <segher@kernel.crashing.org>
>>>>>>       2 Author: Segher Boesssenkool <segher@kernel.crashing.org>
>>>>>
>>>>> The first and third are only in changelogs.  The second even happened
>>>>> only once, afaics?
>>>>>
>>>>> These errors only happen in the reposurgeon conversion.
>>>>
>>>> This is about extracting attributions from changelogs when unambiguous 
>>>> there, and then correcting mistakes or otherwise making minor variants 
>>>> more uniform.
>>>
>>> Yes, and I'm saying you probably shouldn't do that.
>>
>> Why, for heavens sake?  Even Maxim's conversion is doing this.
> 
> No, it doesn't.  If people sometimes mispel their own name in a changelog
> it does not put that mispeling as Author: in the git commit.

Then either it's psycic, or Maxim is already doing what I suggest.  The
information must come from *somewhere*.

> 
>>> Note that these errors did not exist in the changelog in the commit
>>> message, for example.
>>
>> Yes, they did.  Or at least, they did at the time of the original commit.
> 
> No, they never did.  I always cut off the date/name/email line from the
> changelog in the commit message.

the changelogs command does not extract the data from the commit
message.  I never suggested that it did.

> 
>>> Since people very often typo their own name (as the evidence shows), the
>>> heuristic for deriving it should be robust against that.
>>
>> And the statistics show that it's not hard to identify the odd cases and
>> fix them up.  Only committers with just a single commits are really hard
>> to spot since we don't have data to compare against other entries.
> 
> Sure, so do that?  :-)
> 

Which is the very purpose of this email thread ;-)

R.
> 
> Segher
> 

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-27 19:47 Git conversion: fixing email addresses from ChangeLog files Richard Earnshaw (lists)
  2019-12-28 11:37 ` Segher Boessenkool
  2019-12-28 12:04 ` Jakub Jelinek
@ 2019-12-28 23:20 ` Richard Earnshaw (lists)
  2019-12-29 12:32   ` Eric S. Raymond
  2 siblings, 1 reply; 26+ messages in thread
From: Richard Earnshaw (lists) @ 2019-12-28 23:20 UTC (permalink / raw)
  To: gcc

[-- Attachment #1: Type: text/plain, Size: 1676 bytes --]

On 27/12/2019 19:47, Richard Earnshaw (lists) wrote:
> Email addresses from the ChangeLog files are not validated during
> commits, so a number of typos exist in the extracted data.  I've
> extracted the 'Author:' entry from a prototype conversion and then piped
> that through sort and uniq -c.  Subsequent analysis shows the following
> addresses/names that are likely in need of some resolution.
> 
> Several of the names differ only in case and a case-insensitive sort
> would ignore these.  But some clearly show typing errors.
> 
> I've marked with a '*' those entries that I believe should be canonical,
> but if you think I've made a mistake somewhere, please let me know.
> It's easier to fix this before I write a script to do the changes than
> afterwards.
> 
> There are many entries that look like unix machine names rather than
> genuine email addresses for a domain.  I have *not* attempted to
> canonicalize those, but if some folk would like that doing, please
> supply entries in the form
> 
> 'name to match <email@to-match>': 'name to use <email@to-use>',
> 
> form and I'll paste those into the conversion script.
> 

Another source of missed authorship is where the changelogs file was
unable to parse the email address from the changelog due to syntactic
errors.  The list below is from my last run a couple of days back.  I've
added fixups for these commits as shown in the the list. There's only
two committers for whom I've been unable to trace the email address, so
I've just commented that one out for now; if anybody knows the correct
addresses, please let me know.  Also, there's one joint list that I've
not attempted to fix at this time.

R.



[-- Attachment #2: author_add.list --]
[-- Type: text/plain, Size: 11576 bytes --]

  "17168": "Nick Clifton <nickc@cygnus.com>",
  "21115": "Henning Petersen <Henning.Petersen@t-online.de>",
  "22681": "Robert Lipe <robertl@dgii.com>",
  "24509": "Mark Elbrecht <snowball3@usa.net>",
  "24510": "Mark Elbrecht <snowball3@usa.net>",
  "25835": "Mark Elbrecht <snowball3@usa.net>",
  "25958": "Mark Elbrecht <snowball3@usa.net>",
#  "28488": "Jim Kingdon <http://developer.redhat.com>",
#  "28921": "Jim Kingdon <http://developer.redhat.com>",
#  "29010": "Jim Kingdon <http://developer.redhat.com>",
#  "29588": "Jim Kingdon <http://developer.redhat.com>",
#  "29773": "Jim Kingdon <http://developer.redhat.com>",
#  "30174": "Jim Kingdon <http://developer.redhat.com/>",
#  "30175": "Jim Kingdon <http://developer.redhat.com/>",
  "30613": "Bruce Korb <autogen@linuxbox.com>",
  "30614": "Bruce Korb <autogen@linuxbox.com>",
  "31356": "Zack Weinberg <zack@wolery.cumb.org>",
  "33449": "Rodney Brown <RodneyBrown@pmsc.com>",
  "33454": "Neil Booth <NeilB@earthling.net>",
  "33603": "Vadim Egorov <egorovv@mailandnews.com>",
  "33712": "David Billinghurst <David.Billinghurst@riotinto.com.au>",
  "34383": "David Billinghurst <David.Billinghurst@riotinto.com.au>",
  "34432": "David Billinghurst <David.Billinghurst@riotinto.com.au>",
  "34836": "Zack Weinberg <zack@wolery.cumb.org>",
  "34879": "David Billinghurst <David.Billinghurst@riotinto.com.au>",
  "34992": "David Billinghurst <David.Billinghurst@riotinto.com.au>",
  "38507": "Alan Lehotsky <lehotsky@tiac.net>",
  "38625": "Richard Kenner <kenner@vlsi1.ultra.nyu.edu>",
  "38629": "Richard Kenner <kenner@vlsi1.ultra.nyu.edu>",
  "38848": "Peter Schmid <schmid@snake.iap.physik.tu-darmstadt.de>",
  "41676": "Toomas Rosin <toomas@ns.tklabor.ee>",
  "41866": "Jakub Jelinek <jakub@redhat.com>",
  "43021": "Jakub Jelinek <jakub@redhat.com>",
  "43836": "Jeff Sturm <jsturm@one-point.com>",
  "44574": "David Billinghurst <David.Billinghurst@riotinto.com.au>",
  "46448": "Richard Kenner <kenner@vlsi1.ultra.nyu.edu>",
#  "47044": "<bjornw@planetarion.com>",
  "47560": "Robert Dewar <dewar@gnat.com>",
  "49681": "Adam Megacz <adam@xwt.org>",
#  "49710": "Naveen Sharma,Nitin Gupta <naveens@noida.hcltech.com,niting@noida.hcltech.com>",
  "52180": "Richard Sandiford <rsandifo@redhat.com>",
  "52181": "Richard Sandiford <rsandifo@redhat.com>",
  "54251": "Jeff Law <law@redhat.com>",
  "54358": "Jeff Law <law@redhat.com>",
  "54509": "Jeffrey Law <law@redhat.com>",
  "57749": "Matt Austern <austern@apple.com>",
  "58824": "John David Anglin <dave@hiauly1.hia.nrc.ca>",
  "59204": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59274": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59276": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59302": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59303": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59361": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59387": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59389": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59400": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59409": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59412": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59415": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59438": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59439": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59440": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "59441": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "61282": "Andrew Haley <aph@host240.cambridge.redhat.com>",
  "62786": "Jeff Law <law@redhat.com>",
  "63601": "Mark Mitchell <mark@codesourcery.com>",
  "64405": "Garbiel Dos Reis <gcc@integrable-solutions.net>",
  "64615": "Kazu Hirata <kazu@cs.umass.edu>",
  "64727": "DJ Delorie <dj@redhat.com>,",
  "65513": "Paolo Carlini <pcarlini@unitus.it>",
  "65665": "Benjamin Kosnik <bkoz@redhat.com>",
  "66375": "Gerald Pfeifer <pfeifer@dbai.tuwien.ac.at>",
  "66377": "Gerald Pfeifer <pfeifer@dbai.tuwien.ac.at>",
  "66533": "Neil Booth <neil@cat.daikokuya.co.uk>",
  "67286": "Daniel Berlin <dberlin@dberlin.org>",
  "67614": "Eric Botcazou <ebotcazou@libertysurf.fr>",
  "69900": "Hans-Peter Nilsson <hp@bitrange.com>",
  "71613": "Anthony Green <green@redhat.com>",
  "73559": "Jeff Law <law@redhat.com>",
  "74416": "Arnaud Charlet <charlet@act-europe.fr>",
  "75815": "Devang Patel <dpatel@apple.com>",
  "76025": "J. Brobecker <brobecker@gnat.com>",
  "78329": "Matt Austern <austern@apple.com>",
  "80747": "Kazuhiro Inaoka <inaoka.kazuhiro@renesas.com>",
  "80885": "Anil Paranjpe <anilp1@kpitcummins.com>",
  "81578": "Caroline Tice <ctice@apple.com>",
  "82073": "Fariborz Jahanian <fjahanian@apple.com>",
  "82388": "Janis Johnson <janis187@us.ibm.com>",
  "83455": "Tobias Schlueter <tobias.schlueter@physik.uni-muenchen.de>",
  "83663": "Stuart Hastings <stuart@apple.com>",
  "85189": "Hans-Peter Nilsson <hp@bitrange.com>",
  "85771": "Caroline Tice <ctice@apple.com>",
  "85904": "Janis Johnson <janis187@us.ibm.com>",
  "85907": "Janis Johnson <janis187@us.ibm.com>",
  "87394": "Richard Henderson <rth@redhat.com>",
  "87395": "Richard Henderson <rth@redhat.com>",
  "88896": "Andrew Pinski <pinskia@physics.uc.edu>",
  "93713": "Tobias Schl\"uter <tobias.schlueter@physik.uni-muenchen.de>",
  "94575": "Richard Henderson <rth@redhat.com>",
  "95687": "Geert Bosch <bosch@gnat.com>",
  "95725": "John David Anglin <dave.anglin@nrc-cnrc.gc.ca>",
  "95729": "John David Anglin <dave.anglin@nrc-cnrc.gc.ca>",
  "95730": "John David Anglin <dave.anglin@nrc-cnrc.gc.ca>",
  "96325": "Devang Patel <dpatel@apple.com>",
  "98148": "Janis Johnson <janis187@us.ibm.com>",
  "104276": "Jerry DeLisle <jvdelisle@verizon.net>",
  "107252": "Graham Stott <graham.stott@btinternet.com>",
  "110670": "Janis Johnson <janis187@us.ibm.com>",
  "117215": "Janis Johnson <janis187@us.ibm.com>",
  "119176": "Andrew Pinski <andrew_pinski@playstation.sony.com>",
  "120776": "John David Anglin <dave.anglin@nrc-cnrc.gc.ca>",
  "123492": "Danny Smith <dannysmith@users.sourceforge.net>",
  "124149": "Danny Smith <dannysmith@users.sourceforge.net>",
  "125178": "David Daney <ddaney@avtrex.com>",
  "125778": "Danny Smith <dannysmith@users.sourceforge.net>",
  "125818": "David Daney <ddaney@avtrex.com>",
  "125824": "David Daney <ddaney@avtrex.com>",
  "125852": "David Daney <ddaney@avtrex.com>",
  "128396": "Steve Kenton <skenton@ou.edu>",
  "128399": "Janis Johnson <janis187@us.ibm.com>",
  "128449": "Janis Johnson <janis187@us.ibm.com>",
  "128654": "Bernhard Fischer <aldot@gcc.gnu.org>",
  "129999": "Bernhard Fischer <aldot@gcc.gnu.org>",
  "130953": "Bernhard Fischer <aldot@gcc.gnu.org>",
  "133469": "Thomas Koenig <tkoenig@gcc.gnu.org>",
  "135427": "Tobias Burnus <burnus@net-b.de>",
  "136580": "Arnaud Charlet <charlet@adacore.com>",
  "136749": "Tobias Grosser <grosser@fim.uni-passau.de>",
  "136779": "Tobias Grosser <grosser@fim.uni-passau.de>",
  "136794": "Tobias Grosser <grosser@fim.uni-passau.de>",
  "136828": "Tobias Grosser <grosser@fim.uni-passau.de>",
  "136829": "Tobias Grosser <grosser@fim.uni-passau.de>",
  "136866": "Bernhard Fischer <aldot@gcc.gnu.org>",
  "137336": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "137337": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "140576": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "140683": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "140684": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "140995": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "140997": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "141016": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "141101": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "141102": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "141207": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "141227": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "141228": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "141231": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "141317": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "141421": "Jerry DeLisle <jvdelisle@gcc.gnu.org>",
  "144410": "Michael Eager <eager@eagercon.com>",
  "146520": "Andrey Belevantsev <abel@ispras.ru>",
  "148151": "Sebastian Pop <sebastian.pop@amd.com>",
  "151203": "Kai Tietz <kai.tietz@onevision.com>",
  "151204": "Kai Tietz <kai.tietz@onevision.com>",
  "155732": "Tobias Burnus <burnus@net-b.de>",
  "157383": "Tobias Burnus <burnus@net-b.de>",
  "158004": "Jan Hubicka <jh@suse.cz>",
  "158429": "Kaushik Phatak <kaushik.phatak@kpitcummins.com>",
  "158566": "Anatoly Sokolov <aesok@post.ru>",
  "159922": "Jon `:wqBeniston <jon@beniston.com>",
  "161255": "Arnaud Charlet <charlet@adacore.com>",
  "161313": "Nick Clifton <nickc@redhat.com>",
  "164616": "Steven G. Kargl <kargl@gcc.gnu.org>",
  "165699": "Francois-Xavier Coudert <fxcoudert@gcc.gnu.org>",
  "166058": "Nick Clifton <nickc@redhat.com>",
  "166205": "Steven G. Kargl <kargl@gcc.gnu.org>",
  "166210": "Paolo Carlini <paolo.carlini@oracle.com>",
  "167038": "Paolo Bonzini <bonzini@gnu.org>",
  "172203": "Basile Starynkevitch <basile@starynkevitch.net>",
  "175396": "Basile Starynkevitch <basile@starynkevitch.net>",
  "179066": "Benjamin Kosnik <bkoz@redhat.com>",
  "179221": "Benjamin Kosnik <bkoz@redhat.com>",
  "179580": "Benjamin Kosnik <bkoz@redhat.com>",
  "187826": "Jim MacArthur <jim.macarthur@arm.com>",
  "188059": "Diego Novillo <dnovillo@google.com>",
  "191401": "Ben Cheng <bccheng@google.com>",
  "192830": "Wei Mi <wmi@google.com>",
  "195967": "Sriraman Tallam <tmsriram@google.com>",
  "197963": "Andrew Hsieh <andrewhsieh@google.com>",
  "199910": "Michael Meissner <meissner@linux.vnet.ibm.com>",
  "199912": "Michael Meissner <meissner@linux.vnet.ibm.com>",
  "199913": "Michael Meissner <meissner@linux.vnet.ibm.com>",
  "200002": "Basile Starynkevitch <basile@starynkevitch.net>",
  "200005": "Basile Starynkevitch <basile@starynkevitch.net>",
  "200007": "Basile Starynkevitch <basile@starynkevitch.net>",
  "200232": "Sharad Singhai <singhai@google.com>",
  "201247": "Mark Kettenis <kettenis@openbsd.org>",
  "201248": "Mark Kettenis <kettenis@openbsd.org>",
  "202250": "Sandeep Kumar Singh <Sandeep.Singh2@kpitcummins.com>",
  "205603": "Sterling Augustine <saugustine@google.com>",
  "216075": "Kyrylo Tkachov <kyrylo.tkachov@arm.com>",
  "220367": "Dodji Seketeli <dodji@redhat.com>",
  "226476": "Michael Collison <michael.collison@linaro.org>",
  "227040": "Gleb Fotengauer-Malinovskiy <glebfm@altlinux.org>",
  "227536": "Alan Lawrence <alan.lawrence@arm.com>",
  "227538": "Alan Lawrence <alan.lawrence@arm.com>",
  "227925": "Michael Meissner <meissner@vnet.ibm.com>",
  "229449": "Caroline Tice <cmtice@google.com>",
  "229603": "Martin Liska <mliska@suse.cz>",
  "229895": "Michael Collison <michael.collison@linaro.org>",
  "230606": "Nathan Sidwell <nathan@acm.org>",
  "231284": "Segher Boessenkool <segher@kernel.crashing.org>",
  "246243": "Palmer Dabbelt <palmer@dabbelt.com>",
  "246733": "Thomas Preud'homme <thomas.preudhomme@arm.com>",
  "247214": "Arnaud Charlet <charlet@adacore.com>",
  "247220": "Arnaud Charlet <charlet@adacore.com>",
  "258108": "Thomas Preud'homme <thomas.preudhomme@arm.com>",
  "265618": "Sameera Deshpande <sameera.deshpande@linaro.org>",
  "272168": "Matthew Beliveau <mbelivea@redhat.com.com>",
  "275028": "Jakub Jelinek <jakub@redhat.com>",
  "275029": "Jakub Jelinek <jakub@redhat.com>",
  "277531": "Paul Thomas <pault@gcc.gnu.org>",
  "277948": "Tobias Burnus <tobias@codesourcery.com>",
  "277957": "Tobias Burnus <tobias@codesourcery.com>",
  "278689": "Tobias Burnus <tobias@codesourcery.com>",
  "279697": "Paul Thomas <pault@gcc.gnu.org>",

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 23:20 ` Richard Earnshaw (lists)
@ 2019-12-29 12:32   ` Eric S. Raymond
  2019-12-29 16:25     ` Richard Earnshaw (lists)
  2019-12-29 17:22     ` Jeff Law
  0 siblings, 2 replies; 26+ messages in thread
From: Eric S. Raymond @ 2019-12-29 12:32 UTC (permalink / raw)
  To: Richard Earnshaw (lists); +Cc: gcc

Richard Earnshaw (lists) <Richard.Earnshaw@arm.com>:
> I've just commented that one out for now; if anybody knows the correct
> addresses, please let me know.  Also, there's one joint list that I've
> not attempted to fix at this time.

> #  "28488": "Jim Kingdon <http://developer.redhat.com>",

That's Jim Kingdon the former CVS dev - I think he was involved in Subversion early too.

He's kingdon@cyclic.com or kingdon@panix.com, according to my back
mail. but since I think I remember that he did work at RedHat in the
late '90s kingdon@redhat.com would be a good bet too.
-- 
		<a href="http://www.catb.org/~esr/">Eric S. Raymond</a>


^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-29 12:32   ` Eric S. Raymond
@ 2019-12-29 16:25     ` Richard Earnshaw (lists)
  2019-12-29 22:25       ` Eric S. Raymond
  2019-12-29 17:22     ` Jeff Law
  1 sibling, 1 reply; 26+ messages in thread
From: Richard Earnshaw (lists) @ 2019-12-29 16:25 UTC (permalink / raw)
  To: esr; +Cc: gcc

On 29/12/2019 12:32, Eric S. Raymond wrote:
> Richard Earnshaw (lists) <Richard.Earnshaw@arm.com>:
>> I've just commented that one out for now; if anybody knows the correct
>> addresses, please let me know.  Also, there's one joint list that I've
>> not attempted to fix at this time.
> 
>> #  "28488": "Jim Kingdon <http://developer.redhat.com>",
> 
> That's Jim Kingdon the former CVS dev - I think he was involved in
> Subversion early too.
> 
> He's kingdon@cyclic.com or kingdon@panix.com, according to my back
> mail. but since I think I remember that he did work at RedHat in the
> late '90s kingdon@redhat.com would be a good bet too.
> -- 
>                 <a href="http://www.catb.org/~esr/">Eric S. Raymond</a>
> 
> 

Based on https://gcc.gnu.org/ml/gcc-patches/2000-02/msg00179.html and
some other patches from his redhat address, I'm going to go with that.

Also, for this one:

#  "47044": "<bjornw@planetarion.com>",

There's some (relatively weak) evidence that this is Bjørn Wennberg (eg
https://groups.google.com/forum/#!msg/comp.databases.sybase/Uz8ICef9Qr8/uPwanH6is60J),
but in the absence of stronger evidence, I'm going to just put bjornw as
the name.

For the final one:

#  "49710": "Naveen Sharma,Nitin Gupta
<naveens@noida.hcltech.com,niting@noida.hcltech.com>",

The list emails show Naveen as the driver of the contribution during the
submission phase, so I'll use that name for the primary author, again
barring any stronger evidence that this is incorrect.

R.

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-29 12:32   ` Eric S. Raymond
  2019-12-29 16:25     ` Richard Earnshaw (lists)
@ 2019-12-29 17:22     ` Jeff Law
  1 sibling, 0 replies; 26+ messages in thread
From: Jeff Law @ 2019-12-29 17:22 UTC (permalink / raw)
  To: esr, Richard Earnshaw (lists); +Cc: gcc

On Sun, 2019-12-29 at 07:32 -0500, Eric S. Raymond wrote:
> Richard Earnshaw (lists) <Richard.Earnshaw@arm.com>:
> > I've just commented that one out for now; if anybody knows the correct
> > addresses, please let me know.  Also, there's one joint list that I've
> > not attempted to fix at this time.
> > #  "28488": "Jim Kingdon <http://developer.redhat.com>;",
> 
> That's Jim Kingdon the former CVS dev - I think he was involved in Subversion early too.
And gdb eons ago.

> 
> He's kingdon@cyclic.com or kingdon@panix.com, according to my back
> mail. but since I think I remember that he did work at RedHat in the
> late '90s kingdon@redhat.com would be a good bet too.
Yea and @cygnus.com before that.  I haven't seen much, if anything,
from him in 15+ years.  He's not with Red Hat anymore.

jeff

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-28 18:54           ` Joseph Myers
  2019-12-28 19:18             ` Eric S. Raymond
@ 2019-12-29 17:44             ` Joseph Myers
  2019-12-29 22:52               ` Joseph Myers
  1 sibling, 1 reply; 26+ messages in thread
From: Joseph Myers @ 2019-12-29 17:44 UTC (permalink / raw)
  To: Richard Earnshaw (lists); +Cc: Jakub Jelinek, gcc

On Sat, 28 Dec 2019, Joseph Myers wrote:

> Concretely, what I'd suggest is: convert ISO-8859-1 entries in the 
> checked-in list to UTF-8, removing anything that thereby becomes a 
> duplicate or unnecessary; handle anything whose encoding isn't simply 
> ISO-8859-1 or UTF-8 via a hardcoded entry in bugdb.py using hex escapes 
> like the existing such entries there.  Once the checked-in list is pure 
> UTF-8 it's easier for people to review and edit.  Where the issue is only 
> presence of ISO-8859 NBSP, or "" or () around the names, remove that in 
> the checked-in list and again remove duplicates.  That way the list can be 
> limited to non-encoding variations.

I've now made those changes to the checked-in list so it's pure UTF-8, and 
thus easier to review and edit.  We still need to implement code in 
bugdb.py to use that list to pick the preferred form from each list of 
variants (and people may wish to change the preferred forms in some 
cases).

-- 
Joseph S. Myers
jsm@polyomino.org.uk

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-29 16:25     ` Richard Earnshaw (lists)
@ 2019-12-29 22:25       ` Eric S. Raymond
  2019-12-29 22:31         ` Richard Earnshaw (lists)
  0 siblings, 1 reply; 26+ messages in thread
From: Eric S. Raymond @ 2019-12-29 22:25 UTC (permalink / raw)
  To: Richard Earnshaw (lists); +Cc: gcc

Richard Earnshaw (lists) <Richard.Earnshaw@arm.com>:
> Also, for this one:
> 
> #  "47044": "<bjornw@planetarion.com>",
> 
> There's some (relatively weak) evidence that this is Bjørn Wennberg (eg
> https://groups.google.com/forum/#!msg/comp.databases.sybase/Uz8ICef9Qr8/uPwanH6is60J),
> but in the absence of stronger evidence, I'm going to just put bjornw as
> the name.

What's weak about that?  The full email address matches.  Un;rdd you
think there are two hackers nameed Bjorn, with a last initial of W,
running around using the same email address, I think we have a winner.
-- 
		<a href="http://www.catb.org/~esr/">Eric S. Raymond</a>


^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-29 22:25       ` Eric S. Raymond
@ 2019-12-29 22:31         ` Richard Earnshaw (lists)
  2019-12-29 22:56           ` Eric S. Raymond
  0 siblings, 1 reply; 26+ messages in thread
From: Richard Earnshaw (lists) @ 2019-12-29 22:31 UTC (permalink / raw)
  To: esr; +Cc: gcc

On 29/12/2019 22:24, Eric S. Raymond wrote:
> Richard Earnshaw (lists) <Richard.Earnshaw@arm.com>:
>> Also, for this one:
>>
>> #  "47044": "<bjornw@planetarion.com>",
>>
>> There's some (relatively weak) evidence that this is Bjørn Wennberg (eg
>> https://groups.google.com/forum/#!msg/comp.databases.sybase/Uz8ICef9Qr8/uPwanH6is60J),
>> but in the absence of stronger evidence, I'm going to just put bjornw as
>> the name.
> 
> What's weak about that?  The full email address matches.  Un;rdd you
> think there are two hackers nameed Bjorn, with a last initial of W,
> running around using the same email address, I think we have a winner.
> 

Weak in the sense that it isn't proof given that the user name is
partially redacted.  There's nothing in the gcc archives that gives a
full name either, unfortunately.

Yes, it's the most likely match, but there's still an element of doubt.

R.

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-29 17:44             ` Joseph Myers
@ 2019-12-29 22:52               ` Joseph Myers
  0 siblings, 0 replies; 26+ messages in thread
From: Joseph Myers @ 2019-12-29 22:52 UTC (permalink / raw)
  To: Richard Earnshaw (lists); +Cc: Jakub Jelinek, gcc

On Sun, 29 Dec 2019, Joseph Myers wrote:

> I've now made those changes to the checked-in list so it's pure UTF-8, and 
> thus easier to review and edit.  We still need to implement code in 
> bugdb.py to use that list to pick the preferred form from each list of 
> variants (and people may wish to change the preferred forms in some 
> cases).

I've now implemented that code in bugdb.py.

Given those fixes, I'm planning to compare author names from the 
reposurgeon conversion and Maxim's conversion, as I think cases where they 
find different authors (not just different email addresses) are good cases 
for manual review (we already have various such manual author fixups for 
individual commits in bugdb.py).  In fact that manual review may show up 
*other* commits that should be reattributed.  One example Maxim gave of a 
missing author was Aymeric Vincent.  That was a commit on 
premerge-fsf-branch where the reposurgeon heuristic "don't use 
attributions from ChangeLog for a ChangeLog-only commit" applied.  But 
whether or not the commit just adding the ChangeLog entry should be 
reattributed to the person named in that ChangeLog entry, the real changes 
that ChangeLog entry relates to are two previous commits (each file 
committed separately), so it shows up that those two previous commits 
ought to be reattributed.

-- 
Joseph S. Myers
joseph@codesourcery.com

^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-29 22:31         ` Richard Earnshaw (lists)
@ 2019-12-29 22:56           ` Eric S. Raymond
  2019-12-30 15:38             ` Richard Earnshaw (lists)
  0 siblings, 1 reply; 26+ messages in thread
From: Eric S. Raymond @ 2019-12-29 22:56 UTC (permalink / raw)
  To: Richard Earnshaw (lists); +Cc: gcc

Richard Earnshaw (lists) <Richard.Earnshaw@arm.com>:
> Weak in the sense that it isn't proof given that the user name is
> partially redacted.  There's nothing in the gcc archives that gives a
> full name either, unfortunately.
> 
> Yes, it's the most likely match, but there's still an element of doubt.
> 
> R.

https://groups.google.com/forum/#!msg/comp.databases.sybase/Uz8ICef9Qr8/uPwanH6is60

If you open his message to Michel Peppler, you'll see a sig block that
says:

 bjornw@planetarion.com  Bjørn Wennberg, Fifth Season AS

It's him, yep.  Be sure to get the ø right what you fill in the name. :-)
-- 
		<a href="http://www.catb.org/~esr/">Eric S. Raymond</a>


^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: Git conversion: fixing email addresses from ChangeLog files
  2019-12-29 22:56           ` Eric S. Raymond
@ 2019-12-30 15:38             ` Richard Earnshaw (lists)
  0 siblings, 0 replies; 26+ messages in thread
From: Richard Earnshaw (lists) @ 2019-12-30 15:38 UTC (permalink / raw)
  To: esr; +Cc: gcc

On 29/12/2019 22:56, Eric S. Raymond wrote:
> Richard Earnshaw (lists) <Richard.Earnshaw@arm.com>:
>> Weak in the sense that it isn't proof given that the user name is
>> partially redacted.  There's nothing in the gcc archives that gives a
>> full name either, unfortunately.
>>
>> Yes, it's the most likely match, but there's still an element of doubt.
>>
>> R.
> 
> https://groups.google.com/forum/#!msg/comp.databases.sybase/Uz8ICef9Qr8/uPwanH6is60
> 
> If you open his message to Michel Peppler, you'll see a sig block that
> says:
> 
>  bjornw@planetarion.com  Bjørn Wennberg, Fifth Season AS
> 
> It's him, yep.  Be sure to get the ø right what you fill in the name. :-)
> 

Excellent.  Then as you say, we have a match.

R.

^ permalink raw reply	[flat|nested] 26+ messages in thread

end of thread, other threads:[~2019-12-30 15:38 UTC | newest]

Thread overview: 26+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2019-12-27 19:47 Git conversion: fixing email addresses from ChangeLog files Richard Earnshaw (lists)
2019-12-28 11:37 ` Segher Boessenkool
2019-12-28 13:05   ` Joseph Myers
2019-12-28 14:54     ` Segher Boessenkool
2019-12-28 16:34       ` Richard Earnshaw (lists)
2019-12-28 20:11         ` Segher Boessenkool
2019-12-28 23:09           ` Richard Earnshaw (lists)
2019-12-28 16:36       ` Joseph Myers
2019-12-28 12:04 ` Jakub Jelinek
2019-12-28 16:55   ` Richard Earnshaw (lists)
2019-12-28 17:14     ` Joseph Myers
2019-12-28 17:23       ` Richard Earnshaw (lists)
2019-12-28 17:34         ` Joseph Myers
2019-12-28 18:54           ` Joseph Myers
2019-12-28 19:18             ` Eric S. Raymond
2019-12-29 17:44             ` Joseph Myers
2019-12-29 22:52               ` Joseph Myers
2019-12-28 18:37     ` Andreas Schwab
2019-12-28 23:20 ` Richard Earnshaw (lists)
2019-12-29 12:32   ` Eric S. Raymond
2019-12-29 16:25     ` Richard Earnshaw (lists)
2019-12-29 22:25       ` Eric S. Raymond
2019-12-29 22:31         ` Richard Earnshaw (lists)
2019-12-29 22:56           ` Eric S. Raymond
2019-12-30 15:38             ` Richard Earnshaw (lists)
2019-12-29 17:22     ` Jeff Law

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).