From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (qmail 28842 invoked by alias); 14 Feb 2017 11:25:40 -0000 Mailing-List: contact gcc-patches-help@gcc.gnu.org; run by ezmlm Precedence: bulk List-Id: List-Archive: List-Post: List-Help: Sender: gcc-patches-owner@gcc.gnu.org Received: (qmail 28832 invoked by uid 89); 14 Feb 2017 11:25:39 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=0.5 required=5.0 tests=AWL,BAYES_50,RCVD_IN_DNSWL_NONE,SPF_PASS,URIBL_RED autolearn=ham version=3.3.2 spammy=D*mentor.com, gr, tang, openacc X-HELO: relay1.mentorg.com Received: from relay1.mentorg.com (HELO relay1.mentorg.com) (192.94.38.131) by sourceware.org (qpsmtpd/0.93/v0.84-503-g423c35a) with ESMTP; Tue, 14 Feb 2017 11:25:29 +0000 Received: from nat-ies.mentorg.com ([192.94.31.2] helo=svr-ies-mbx-01.mgc.mentorg.com) by relay1.mentorg.com with esmtp id 1cdbEc-0007SE-Be from Thomas_Schwinge@mentor.com for gcc-patches@gcc.gnu.org; Tue, 14 Feb 2017 03:25:26 -0800 Received: from hertz.schwinge.homeip.net (137.202.0.87) by svr-ies-mbx-01.mgc.mentorg.com (139.181.222.1) with Microsoft SMTP Server (TLS) id 15.0.1210.3; Tue, 14 Feb 2017 11:25:23 +0000 From: Thomas Schwinge To: Chung-Lin Tang CC: gcc-patches Subject: Re: [gomp4] Async related additions to OpenACC runtime library In-Reply-To: <044c4fdb-e659-6029-1da1-4f6bfc05ca9c@mentor.com> References: <044c4fdb-e659-6029-1da1-4f6bfc05ca9c@mentor.com> User-Agent: Notmuch/0.9-101-g81dad07 (http://notmuchmail.org) Emacs/24.5.1 (x86_64-pc-linux-gnu) Date: Tue, 14 Feb 2017 11:29:00 -0000 Message-ID: <87fujhqajk.fsf@hertz.schwinge.homeip.net> MIME-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable X-ClientProxiedBy: svr-ies-mbx-01.mgc.mentorg.com (139.181.222.1) To svr-ies-mbx-01.mgc.mentorg.com (139.181.222.1) X-SW-Source: 2017-02/txt/msg00898.txt.bz2 Hi Chung-Lin! On Mon, 13 Feb 2017 18:13:42 +0800, Chung-Lin Tang wrote: > Tested and committed to gomp-4_0-branch. Thanks! (Not yet reviewed.) Testing this, I saw a lot of regressions, and in r245427 just committed the following to gomp-4_0-branch to address OCthese. Did you simply forget to commit your changes to libgomp/libgomp.map, or why did this work for you? Please verify: commit bd5613600754bd7a1fe85990eb3b7b6b5f2e1543 Author: tschwinge Date: Tue Feb 14 11:20:31 2017 +0000 Update libgomp/libgomp.map for OpenACC async functions =20=20=20=20 libgomp/ * libgomp.map: Add OACC_2.5 version, and add acc_copyin_async, acc_copyin_async_32_h_, acc_copyin_async_64_h_, acc_copyin_async_array_h_, acc_copyout_async, acc_copyout_async_32_h_, acc_copyout_async_64_h_, acc_copyout_async_array_h_, acc_create_async, acc_create_async_32_h_, acc_create_async_64_h_, acc_create_async_array_h_, acc_delete_async, acc_delete_async_32_h_, acc_delete_async_64_h_, acc_delete_async_array_h_, acc_get_default_async, acc_get_default_async_h_, acc_memcpy_from_device_async, acc_memcpy_to_device_async, acc_set_default_async, acc_set_default_async_h_, acc_update_device_async, acc_update_device_async_32_h_, acc_update_device_async_64_h_, acc_update_device_async_array_h_, acc_update_self_async, acc_update_self_async_32_h_, acc_update_self_async_64_h_, and acc_update_self_async_array_h_. Add GOMP_PLUGIN_1.2 version, a= nd add GOMP_PLUGIN_acc_thread_default_async. =20=20=20=20 git-svn-id: svn+ssh://gcc.gnu.org/svn/gcc/branches/gomp-4_0-branch@2454= 27 138bc75d-0d04-0410-961f-82ee72b054a4 --- libgomp/ChangeLog.gomp | 20 ++++++++++++++++++++ libgomp/libgomp.map | 39 +++++++++++++++++++++++++++++++++++++++ 2 files changed, 59 insertions(+) diff --git libgomp/ChangeLog.gomp libgomp/ChangeLog.gomp index 0a5f601..b811c28 100644 --- libgomp/ChangeLog.gomp +++ libgomp/ChangeLog.gomp @@ -1,3 +1,23 @@ +2017-02-14 Thomas Schwinge + + * libgomp.map: Add OACC_2.5 version, and add acc_copyin_async, + acc_copyin_async_32_h_, acc_copyin_async_64_h_, + acc_copyin_async_array_h_, acc_copyout_async, + acc_copyout_async_32_h_, acc_copyout_async_64_h_, + acc_copyout_async_array_h_, acc_create_async, + acc_create_async_32_h_, acc_create_async_64_h_, + acc_create_async_array_h_, acc_delete_async, + acc_delete_async_32_h_, acc_delete_async_64_h_, + acc_delete_async_array_h_, acc_get_default_async, + acc_get_default_async_h_, acc_memcpy_from_device_async, + acc_memcpy_to_device_async, acc_set_default_async, + acc_set_default_async_h_, acc_update_device_async, + acc_update_device_async_32_h_, acc_update_device_async_64_h_, + acc_update_device_async_array_h_, acc_update_self_async, + acc_update_self_async_32_h_, acc_update_self_async_64_h_, and + acc_update_self_async_array_h_. Add GOMP_PLUGIN_1.2 version, and + add GOMP_PLUGIN_acc_thread_default_async. + 2017-02-13 Cesar Philippidis =20 * plugin/plugin-nvptx.c (nvptx_exec): Adjust the default num_gangs. diff --git libgomp/libgomp.map libgomp/libgomp.map index b047ad9..2c9a13d 100644 --- libgomp/libgomp.map +++ libgomp/libgomp.map @@ -378,6 +378,40 @@ OACC_2.0 { acc_set_cuda_stream; }; =20 +OACC_2.5 { + global: + acc_copyin_async; + acc_copyin_async_32_h_; + acc_copyin_async_64_h_; + acc_copyin_async_array_h_; + acc_copyout_async; + acc_copyout_async_32_h_; + acc_copyout_async_64_h_; + acc_copyout_async_array_h_; + acc_create_async; + acc_create_async_32_h_; + acc_create_async_64_h_; + acc_create_async_array_h_; + acc_delete_async; + acc_delete_async_32_h_; + acc_delete_async_64_h_; + acc_delete_async_array_h_; + acc_get_default_async; + acc_get_default_async_h_; + acc_memcpy_from_device_async; + acc_memcpy_to_device_async; + acc_set_default_async; + acc_set_default_async_h_; + acc_update_device_async; + acc_update_device_async_32_h_; + acc_update_device_async_64_h_; + acc_update_device_async_array_h_; + acc_update_self_async; + acc_update_self_async_32_h_; + acc_update_self_async_64_h_; + acc_update_self_async_array_h_; +} OACC_2.0; + GOACC_2.0 { global: GOACC_data_end; @@ -417,3 +451,8 @@ GOMP_PLUGIN_1.1 { global: GOMP_PLUGIN_target_task_completion; } GOMP_PLUGIN_1.0; + +GOMP_PLUGIN_1.2 { + global: + GOMP_PLUGIN_acc_thread_default_async; +} GOMP_PLUGIN_1.1; Gr=C3=BC=C3=9Fe Thomas