On 27/12/2019 19:47, Richard Earnshaw (lists) wrote: > Email addresses from the ChangeLog files are not validated during > commits, so a number of typos exist in the extracted data. I've > extracted the 'Author:' entry from a prototype conversion and then piped > that through sort and uniq -c. Subsequent analysis shows the following > addresses/names that are likely in need of some resolution. > > Several of the names differ only in case and a case-insensitive sort > would ignore these. But some clearly show typing errors. > > I've marked with a '*' those entries that I believe should be canonical, > but if you think I've made a mistake somewhere, please let me know. > It's easier to fix this before I write a script to do the changes than > afterwards. > > There are many entries that look like unix machine names rather than > genuine email addresses for a domain. I have *not* attempted to > canonicalize those, but if some folk would like that doing, please > supply entries in the form > > 'name to match ': 'name to use ', > > form and I'll paste those into the conversion script. > Another source of missed authorship is where the changelogs file was unable to parse the email address from the changelog due to syntactic errors. The list below is from my last run a couple of days back. I've added fixups for these commits as shown in the the list. There's only two committers for whom I've been unable to trace the email address, so I've just commented that one out for now; if anybody knows the correct addresses, please let me know. Also, there's one joint list that I've not attempted to fix at this time. R.