From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [216.205.24.124]) by sourceware.org (Postfix) with ESMTP id 98F4B3857407 for ; Wed, 15 Sep 2021 13:21:10 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 98F4B3857407 Received: from mail-qt1-f199.google.com (mail-qt1-f199.google.com [209.85.160.199]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-63-mcyng5AaOxiDmdeD2ok5Ng-1; Wed, 15 Sep 2021 09:21:06 -0400 X-MC-Unique: mcyng5AaOxiDmdeD2ok5Ng-1 Received: by mail-qt1-f199.google.com with SMTP id e8-20020a05622a110800b0029ecbdc1b2aso2339795qty.12 for ; Wed, 15 Sep 2021 06:21:06 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:message-id:subject:from:to:cc:date:in-reply-to :references:user-agent:mime-version:content-transfer-encoding; bh=Z5S9tyxTlow5AdXmG4kieZzN+a+t1wy3vLBKmbO1Ppc=; b=Y70OlsKvDwhm0wbtUX02xcxuURsDoSwsSseM01d60NfqyWweKpPU8u1JOcpx/lJTsM CNVBLBugaDBqrPb/p8wSmBRuclgp9G4f0rvGVCnOMWqRoo8GpzrMUSP2AIrAqYRwxtCW wJCrWZ/a7sEPm7l/kwebboZ4uTxgdntrVMgMvPiWY152F4foQZ9b4L3yPSheR1t6tWZf uhLFg8d6Mt34qjWPK5uWOsydzzzG6YetfsRzCa6+WSY9XWfXqA2kOlJpl4TVFQieuc+c pG8Zpjh2Kybz1CrB4O06jIbvW00p149mrvsWiIeMjP48l2WmjGvXKDNrttWWDDNRn2Jl SDLw== X-Gm-Message-State: AOAM5330d3zAiKiEaLdL0n3rYtqN089Q0gubV4AnWOA7j/Iq5iu+x5E0 5cVjWoLRBMFCL/1FJW3QO/U5k0qEfX9+NMkCWLZFiFgKCMNdNM19M5Fq7DS5Wezg6/lArH2Jn/H yLifG5l4= X-Received: by 2002:a05:6214:528:: with SMTP id x8mr10710118qvw.30.1631712065961; Wed, 15 Sep 2021 06:21:05 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzso0hgycMCw/G8W6XAbYvExLShlRTbD656joZd3uN+T8oWcygCdagPiJvv15JUFUYDUlTA2Q== X-Received: by 2002:a05:6214:528:: with SMTP id x8mr10710098qvw.30.1631712065756; Wed, 15 Sep 2021 06:21:05 -0700 (PDT) Received: from t14s.localdomain (c-73-69-212-193.hsd1.nh.comcast.net. [73.69.212.193]) by smtp.gmail.com with ESMTPSA id 67sm9980072qkl.1.2021.09.15.06.21.05 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 15 Sep 2021 06:21:05 -0700 (PDT) Message-ID: <3d642e058b5fde27ec3dd955183a3b2de934174c.camel@redhat.com> Subject: Re: Error when accessing git read-only archive From: David Malcolm To: Jonathan Wakely , Thomas Koenig Cc: gcc mailing list Date: Wed, 15 Sep 2021 09:21:04 -0400 In-Reply-To: References: <8fe140f6-804b-0180-e4d3-5293ee11905c@netcologne.de> User-Agent: Evolution 3.38.4 (3.38.4-1.fc33) MIME-Version: 1.0 X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-5.8 required=5.0 tests=BAYES_00, DKIMWL_WL_HIGH, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, KAM_LOTSOFHASH, KAM_SHORT, RCVD_IN_DNSWL_LOW, RCVD_IN_MSPIKE_H2, SPF_HELO_NONE, SPF_NONE, TXREP autolearn=ham autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: gcc@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 15 Sep 2021 13:21:13 -0000 On Mon, 2021-09-13 at 14:03 +0100, Jonathan Wakely via Gcc wrote: > On Mon, 13 Sept 2021 at 14:01, Jonathan Wakely > wrote: > > > > On Mon, 13 Sept 2021 at 13:53, Thomas Koenig via Gcc < > > gcc@gcc.gnu.org> wrote: > > > > > > Hi, > > > > > > I just got an error when accessing the gcc git pages at > > > https://gcc.gnu.org/git/gitweb.cgi?p=gcc.git , it is: > > > > > > This page contains the following errors: > > > error on line 91 at column 6: XML declaration allowed only at the > > > start > > > of the document > > > Below is a rendering of the page up to the first error. > > > > The web server seems to restart the page in the middle of the HTML, > > the content contains: > > > > > > > > Content-type: text/html > > > > > > > "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"> > > > > Ah, the "second" page it's trying to display (in the middle of the > first) is an error: > >
>

> 500 - Internal Server Error >
>
> Wide character in subroutine entry at /var/www/git/gitweb.cgi line > 2208. > >
Summarizing some notes from IRC: The last commit it manages to print successfully in that log seems to be: c012297c9d5dfb177adf1423bdd05e5f4b87e5ec so it appears that: 42e95a830ab48e59389065ce79a013a519646f1 is triggering the issue, and indeed https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=f42e95a830ab48e59389065ce79a013a519646f1 fails in a similar way, whereas other commits work. It appears to be due to the "ł" character in the email address of the Author, in that: commit c012297c9d5dfb177adf1423bdd05e5f4b87e5ec Author: Jan-Benedict Glaw works, whereas: commit f42e95a830ab48e59389065ce79a013a519646f1 Author: Jan-Benedict Glaw doesn't. git show f42e95a830ab48e59389065ce79a013a519646f1 | hexdump -C shows: 00000030 41 75 74 68 6f 72 3a 20 4a 61 6e 2d 42 65 6e 65 |Author: Jan-Bene| 00000040 64 69 63 74 20 47 6c 61 77 20 3c 6a 62 67 6c 61 |dict Glaw .D| 00000060 61 74 65 3a 20 20 20 4d 6f 6e 20 53 65 70 20 31 |ate: Mon Sep 1| i.e. we have the two bytes 0xc5 0x82, which is the UTF-8 encoding of "ł". $ git format-patch c012297c9d5dfb177adf1423bdd05e5f4b87e5ec^^..c012297c9d5dfb177adf1423bdd05e5f4b87e5ec 0001-Fix-multi-statment-macro.patch 0002-cr16-elf-is-now-obsoleted.patch $ file *.patch 0001-Fix-multi-statment-macro.patch: unified diff output, UTF-8 Unicode text 0002-cr16-elf-is-now-obsoleted.patch: unified diff output, ASCII text Hope this is helpful Dave