From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-qk1-x736.google.com (mail-qk1-x736.google.com [IPv6:2607:f8b0:4864:20::736]) by sourceware.org (Postfix) with ESMTPS id E9B9B3858C54 for ; Fri, 12 May 2023 16:44:02 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org E9B9B3858C54 Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=gmail.com Received: by mail-qk1-x736.google.com with SMTP id af79cd13be357-75131c2997bso3489319985a.1 for ; Fri, 12 May 2023 09:44:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20221208; t=1683909842; x=1686501842; h=to:subject:message-id:date:from:mime-version:from:to:cc:subject :date:message-id:reply-to; bh=wPUcZnvDv2s/FfRkKnueCcQyZoX0bsq7N04jhGJnBB4=; b=Z2MfsFGeO03HkYuqfgUy8UWq/1NDvB3gS7OyxAbZ5eb6PPi1zeJVkiYjbJbKVbdQ6B sBlmz2kCHe2DBuhjGmScZZActSB48kpCIYLhs+LGnh91y4XcUYIR7uT5QpUfwQIVfygl H1gzE8ABz5HcBE8GEqBeAY8SOW775zzPCNeuEq2BriyaS3hKLig3vpXY0Omw+Xf9n+1P 6nIAdFZODLcj3Sd3WKnnG9Cl3GKUrMOXFDGj75142O9j3HWjfbXpY1PpdYbFZrAoYN+K 5/K0ysY2lCHDKR/9oNc1NdhsUjKZj7Dh3HLilwu+gbg6LS5Xi2p3S7ZZvPxe9Y+44t5y 6Beg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1683909842; x=1686501842; h=to:subject:message-id:date:from:mime-version:x-gm-message-state :from:to:cc:subject:date:message-id:reply-to; bh=wPUcZnvDv2s/FfRkKnueCcQyZoX0bsq7N04jhGJnBB4=; b=Nu3/JCtovtL/uSBUKV5Bu2eWKLBrfvv7kHjmD4ekhfbKY19e+lyxNG3VDtqhhrigVh /O1dz8Kep1e0jcgEgLftoqF2gVY+ZJ8TlVu54O69rrhl1cSzHPHwDuv9deTyeFC/9gQ9 9BbfLeHQmplslQ9eGw/JZlUdkmWb7PIDMu8JdwqOxFQWPOhJyi6gxbWwfhALIPbzWujF eJTOrl1XzeRcff1b5jL3Y3QEw8bGPBTllaJMhEzSydoe8KezvZdv77QY+Q/f5HiptIQg tupbZLIUnKu3HtowUTZZmtysaVUFgqvA0GEOAzjnPFsW+ikEAN1/ieLupgFtckMZxtel Co6A== X-Gm-Message-State: AC+VfDwmQaXtev6dGdPM6fHqpwiw2Z/QvFQUiNb3H+Ds6ijDRIkssKar fIcSe3Ev17Dmv0CLXQNTggCRn60XazCbNI7mciJYeP8Iuk0kIA== X-Google-Smtp-Source: ACHHUZ57XwQ+pfaahcssGeIfhKgTkg6fQ6+q98u9Hts4YZMj//QXwfeQe7JHxXTkigsFeDdiIMeM52IKh0DHm34SqHk= X-Received: by 2002:ad4:5cce:0:b0:5ac:96c3:14d4 with SMTP id iu14-20020ad45cce000000b005ac96c314d4mr37893456qvb.17.1683909841961; Fri, 12 May 2023 09:44:01 -0700 (PDT) MIME-Version: 1.0 From: Uros Bizjak Date: Fri, 12 May 2023 18:43:50 +0200 Message-ID: Subject: [PATCH] i386: Remove mulv2si emulated sequence for TARGET_SSE2 [PR109797] To: "gcc-patches@gcc.gnu.org" Content-Type: multipart/mixed; boundary="000000000000f76ef105fb81cf50" X-Spam-Status: No, score=-8.1 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM,GIT_PATCH_0,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,TXREP,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: --000000000000f76ef105fb81cf50 Content-Type: text/plain; charset="UTF-8" Remove mulv2si emulated sequence for TARGET_SSE2 and enable only native PMULLD instruction for TARGET_SSE4_1. Ideally, the vectorization for TARGET_SSE2 should depend on more precise cost estimation (the PR contains patch for ix86_multiplication_cost), but even with patched cost function the runtime regression was not fixed. PR target/109797 gcc/ChangeLog: * config/i386/mmx.md (mulv2si3): Remove expander. (mulv2si3): Rename insn pattern from *mulv2si. Bootstrapped and regression tested on x86_64-linux-gnu {,-m32}. Pushed to master. Uros. --000000000000f76ef105fb81cf50 Content-Type: text/plain; charset="US-ASCII"; name="p.diff.txt" Content-Disposition: attachment; filename="p.diff.txt" Content-Transfer-Encoding: base64 Content-ID: X-Attachment-Id: f_lhksfjlj0 ZGlmZiAtLWdpdCBhL2djYy9jb25maWcvaTM4Ni9tbXgubWQgYi9nY2MvY29uZmlnL2kzODYvbW14 Lm1kCmluZGV4IGU3Y2E5MjFkZDJiLi5iMjk1NGZmZjhhZSAxMDA2NDQKLS0tIGEvZ2NjL2NvbmZp Zy9pMzg2L21teC5tZAorKysgYi9nY2MvY29uZmlnL2kzODYvbW14Lm1kCkBAIC0yMDkyLDM5ICsy MDkyLDcgQEAgKGRlZmluZV9pbnNuICIqPGluc24+PG1vZGU+MyIKICAgIChzZXRfYXR0ciAidHlw ZSIgInNzZWFkZCIpCiAgICAoc2V0X2F0dHIgIm1vZGUiICJUSSIpXSkKIAotKGRlZmluZV9leHBh bmQgIm11bHYyc2kzIgotICBbKHNldCAobWF0Y2hfb3BlcmFuZDpWMlNJIDAgInJlZ2lzdGVyX29w ZXJhbmQiKQotCShtdWx0OlYyU0kKLQkgIChtYXRjaF9vcGVyYW5kOlYyU0kgMSAicmVnaXN0ZXJf b3BlcmFuZCIpCi0JICAobWF0Y2hfb3BlcmFuZDpWMlNJIDIgInJlZ2lzdGVyX29wZXJhbmQiKSkp XQotICAiVEFSR0VUX01NWF9XSVRIX1NTRSIKLXsKLSAgaWYgKCFUQVJHRVRfU1NFNF8xKQotICAg IHsKLSAgICAgIHJ0eCBvcDEgPSBsb3dwYXJ0X3N1YnJlZyAoVjRTSW1vZGUsIGZvcmNlX3JlZyAo VjJTSW1vZGUsIG9wZXJhbmRzWzFdKSwKLQkJCQlWMlNJbW9kZSk7Ci0gICAgICBydHggb3AyID0g bG93cGFydF9zdWJyZWcgKFY0U0ltb2RlLCBmb3JjZV9yZWcgKFYyU0ltb2RlLCBvcGVyYW5kc1sy XSksCi0JCQkJVjJTSW1vZGUpOwotCi0gICAgICBydHggdG1wMSA9IGdlbl9yZWdfcnR4IChWNFNJ bW9kZSk7Ci0gICAgICBlbWl0X2luc24gKGdlbl92ZWNfaW50ZXJsZWF2ZV9sb3d2NHNpICh0bXAx LCBvcDEsIG9wMSkpOwotICAgICAgcnR4IHRtcDIgPSBnZW5fcmVnX3J0eCAoVjRTSW1vZGUpOwot ICAgICAgZW1pdF9pbnNuIChnZW5fdmVjX2ludGVybGVhdmVfbG93djRzaSAodG1wMiwgb3AyLCBv cDIpKTsKLQotICAgICAgcnR4IHJlcyA9IGdlbl9yZWdfcnR4IChWMkRJbW9kZSk7Ci0gICAgICBl bWl0X2luc24gKGdlbl92ZWNfd2lkZW5fdW11bHRfZXZlbl92NHNpIChyZXMsIHRtcDEsIHRtcDIp KTsKLQotICAgICAgcnR4IG9wMCA9IGdlbl9yZWdfcnR4IChWNFNJbW9kZSk7Ci0gICAgICBlbWl0 X2luc24gKGdlbl9zc2UyX3BzaHVmZF8xIChvcDAsIGdlbl9sb3dwYXJ0IChWNFNJbW9kZSwgcmVz KSwKLQkJCQkgICAgY29uc3QwX3J0eCwgY29uc3QyX3J0eCwKLQkJCQkgICAgY29uc3QwX3J0eCwg Y29uc3QyX3J0eCkpOwotCi0gICAgICBlbWl0X21vdmVfaW5zbiAob3BlcmFuZHNbMF0sIGxvd3Bh cnRfc3VicmVnIChWMlNJbW9kZSwgb3AwLCBWNFNJbW9kZSkpOwotICAgICAgRE9ORTsKLSAgICB9 Ci19KQotCi0oZGVmaW5lX2luc24gIiptdWx2MnNpMyIKKyhkZWZpbmVfaW5zbiAibXVsdjJzaTMi CiAgIFsoc2V0IChtYXRjaF9vcGVyYW5kOlYyU0kgMCAicmVnaXN0ZXJfb3BlcmFuZCIgIj1Zciwq eCx2IikKIAkobXVsdDpWMlNJCiAJICAobWF0Y2hfb3BlcmFuZDpWMlNJIDEgInJlZ2lzdGVyX29w ZXJhbmQiICIlMCwwLHYiKQo= --000000000000f76ef105fb81cf50--