From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from cc-smtpout1.netcologne.de (cc-smtpout1.netcologne.de [IPv6:2001:4dd0:100:1062:25:2:0:1]) by sourceware.org (Postfix) with ESMTPS id 9EC4838618E2 for ; Thu, 18 Mar 2021 18:24:26 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org 9EC4838618E2 Received: from cc-smtpin1.netcologne.de (cc-smtpin1.netcologne.de [89.1.8.201]) by cc-smtpout1.netcologne.de (Postfix) with ESMTP id D84591361B; Thu, 18 Mar 2021 19:24:24 +0100 (CET) Received: from localhost (localhost [127.0.0.1]) by cc-smtpin1.netcologne.de (Postfix) with ESMTP id D4A5811F23; Thu, 18 Mar 2021 19:24:24 +0100 (CET) Received: from [2001:4dd6:22d3:0:d9ee:f94a:6429:461d] (helo=cc-smtpin1.netcologne.de) by localhost with ESMTP (eXpurgate 4.19.0) (envelope-from ) id 60539ad8-7ac5-7f0000012729-7f000001d43a-1 for ; Thu, 18 Mar 2021 19:24:24 +0100 Received: from linux-p51k.fritz.box (2001-4dd6-22d3-0-d9ee-f94a-6429-461d.ipv6dyn.netcologne.de [IPv6:2001:4dd6:22d3:0:d9ee:f94a:6429:461d]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by cc-smtpin1.netcologne.de (Postfix) with ESMTPSA; Thu, 18 Mar 2021 19:24:21 +0100 (CET) Subject: Re: MATMUL broken with frontend optimization. To: Steve Kargl , Richard Biener Cc: Tobias Burnus , "fortran@gcc.gnu.org" References: <20210318074849.GA22541@troutmask.apl.washington.edu> <563cee48-fbcc-09bc-0cd1-f05082e4feb3@codesourcery.com> <20210318161347.GA24201@troutmask.apl.washington.edu> <1780c473-3523-316f-c372-52824d062a01@netcologne.de> From: Thomas Koenig Message-ID: Date: Thu, 18 Mar 2021 19:24:21 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.7.0 MIME-Version: 1.0 In-Reply-To: <1780c473-3523-316f-c372-52824d062a01@netcologne.de> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: de-DE Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-3.8 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, NICE_REPLY_A, RCVD_IN_DNSWL_LOW, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=ham autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: fortran@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Fortran mailing list List-Unsubscribe: , List-Archive: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 18 Mar 2021 18:24:27 -0000 I didn't finish the previous mail before hitting "send", so here is the postscript... > OK, so I've had a bit of time to look at the actual test case.  I > missed one very important detail before:  This is a vector-matrix > operation. > > For this, we do not have a good library routine (Harald just > removed it because of a bug in buffering), and -fexternal-blas > does not work because we do not handle calls to anything but > *GEMM. A vector-matrix multiplicatin would be a call to *GEMV, a worthy goal, but out of scope so close to a release. > The idea is that, for a vector-matrix-multiplication, the > compiler should have enough information about the information about how to optimize for the relevant architecture, especially if the user compilers with the right flags. So, the current idea is that, if we optimize, we can inline. What would a better heuristic be? Best regards Thomas