From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: by sourceware.org (Postfix, from userid 48) id 895213858C50; Thu, 30 Mar 2023 15:16:01 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 895213858C50 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org; s=default; t=1680189361; bh=AKkmEReGmgbra691WOCFMP7/hm9Y7kxMs05y3UziuAk=; h=From:To:Subject:Date:In-Reply-To:References:From; b=FO04pg62uuP0xQwSAJJ/UYVBOV5BMfXczuRuiZv328OMSF13EGLyzrpv9rUK604xw Hpl1YKZO274DiyStYG9fY1db1BftoBdnc/jZiuN0fomrPAJL2PzCPEMX/ea1OSOqwi BYwAyWAH65U6ZVkj1W0uw/hHtjoTsIGp1o1T9jEE= From: "hubicka at gcc dot gnu.org" To: gcc-bugs@gcc.gnu.org Subject: [Bug target/109137] [12 regression] Compiling ffmpeg with -m32 on x86_64-pc-linux-gnu hangs on libavcodec/h264_cabac.c since r12-9086-g489c81db7d4f75 Date: Thu, 30 Mar 2023 15:16:00 +0000 X-Bugzilla-Reason: CC X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: gcc X-Bugzilla-Component: target X-Bugzilla-Version: 13.0 X-Bugzilla-Keywords: compile-time-hog, ice-on-invalid-code, inline-asm X-Bugzilla-Severity: normal X-Bugzilla-Who: hubicka at gcc dot gnu.org X-Bugzilla-Status: REOPENED X-Bugzilla-Resolution: X-Bugzilla-Priority: P1 X-Bugzilla-Assigned-To: unassigned at gcc dot gnu.org X-Bugzilla-Target-Milestone: 12.3 X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: http://gcc.gnu.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 List-Id: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=3D109137 --- Comment #21 from Jan Hubicka --- Zen 1-3 changes were intentional in the original tuning patch (it is also briefly mentioned in the commit message). By allowing 256 bit AVX moves instead of 64bit integer moves (or 128bit) we can move bigger blocks of mem= ory without loops and it was faster in micro-benchmarks I made on all zens, eve= n on znver1. We also automatically go for 128bit moves when ISA allows that. We could revert that part of backport, but won't we get same hangs with -march=3Dznver4 or core512 which also enables avx512 moves? (I am rebuilding gcc12 to see what happens)=