From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id 905E13858D28; Sat, 25 Mar 2023 01:01:31 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 905E13858D28 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1679706091; bh=Tyj99DifWEVPGb2fZ50dwGHNccwsDP9xiUVfdeQ3GZs=; h=From:To:Subject:Date:In-Reply-To:References:From; b=m81xmbo2ntozO/wNza/B2sPcNVzMiXYJpwCYFqM9pH8FHidJ2h1r/MtUykml8SNSe QfakarobnSMI1wfhi/t4yl1MEVoDWMssCTmu6py3vAHxtN+VZVCvDYVg4WAOQohnkM VJunnRa27t3XvbGb5BlKW72BaVN5vFYhObalvTCE= From: "dmalcolm at gcc dot gnu.org" To: gcc-bugs@gcc.gnu.org Subject: [Bug analyzer/109098] Encoding errors on SARIF output for non-UTF-8 source files Date: Sat, 25 Mar 2023 01:01:27 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: analyzer X-Bugzilla-Version: 13.0 X-Bugzilla-Keywords: diagnostic X-Bugzilla-Severity: normal X-Bugzilla-Who: dmalcolm at gcc dot gnu.org X-Bugzilla-Status: RESOLVED X-Bugzilla-Resolution: FIXED X-Bugzilla-Priority: P3 X-Bugzilla-Assigned-To: dmalcolm at gcc dot gnu.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: bug_status resolution Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 List-Id: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D109098 David Malcolm changed: What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |RESOLVED Resolution|--- |FIXED --- Comment #7 from David Malcolm --- Should be fixed on trunk by r13-6861-gd495ea2b232f3e: https://gcc.gnu.org/git/?p=3Dgcc.git;a=3Dcommit;h=3Dd495ea2b232f3eb50155d7c= 7362c09a744766746 https://gcc.gnu.org/git/?p=3Dgcc.git;a=3Dcommitdiff_plain;h=3Dd495ea2b232f3= eb50155d7c7362c09a744766746 The invalid UTF-8 in the patch seems to have broken the server-side script: Enumerating objects: 51, done. Counting objects: 100% (51/51), done. Delta compression using up to 64 threads Compressing objects: 100% (29/29), done. Writing objects: 100% (29/29), 7.74 KiB | 1.29 MiB/s, done. Total 29 (delta 22), reused 0 (delta 0), pack-reused 0 remote: Traceback (most recent call last): remote: File "hooks/post_receive.py", line 118, in remote: post_receive(refs_data, args.submitter_email) remote: File "hooks/post_receive.py", line 65, in post_receive remote: submitter_email) remote: File "hooks/post_receive.py", line 47, in post_receive_one remote: update.send_email_notifications() remote: File "/sourceware1/projects/src-home/git-hooks/hooks/updates/__init__.py", line = 189, in send_email_notifications remote: self.__email_new_commits() remote: File "/sourceware1/projects/src-home/git-hooks/hooks/updates/__init__.py", line 1031, in __email_new_commits remote: commit, self.get_standard_commit_email(commit)) remote: File "/sourceware1/projects/src-home/git-hooks/hooks/updates/__init__.py", line 1011, in __send_commit_email remote: default_diff=3Demail.diff) remote: File "/sourceware1/projects/src-home/git-hooks/hooks/updates/__init__.py", line = 946, in __maybe_get_email_custom_contents remote: hook_input=3Djson.dumps(hooks_data), remote: File "/usr/lib64/python2.7/json/__init__.py", line 244, in dumps remote: return _default_encoder.encode(obj) remote: File "/usr/lib64/python2.7/json/encoder.py", line 207, in encode remote: chunks =3D self.iterencode(o, _one_shot=3DTrue) remote: File "/usr/lib64/python2.7/json/encoder.py", line 270, in iterenc= ode remote: return _iterencode(o, 0) remote: UnicodeDecodeError: 'utf8' codec can't decode byte 0x80 in position 13147: invalid start byte To git+ssh://gcc.gnu.org/git/gcc.git 13ec81eb4c3..d495ea2b232 master -> master=