From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from EUR05-VI1-obe.outbound.protection.outlook.com (mail-vi1eur05on2055.outbound.protection.outlook.com [40.107.21.55]) by sourceware.org (Postfix) with ESMTPS id 687843AA8CB9 for ; Thu, 6 May 2021 14:27:13 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org 687843AA8CB9 Received: from AM5PR0701CA0065.eurprd07.prod.outlook.com (2603:10a6:203:2::27) by AM0PR08MB5442.eurprd08.prod.outlook.com (2603:10a6:208:182::7) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4108.25; Thu, 6 May 2021 14:27:11 +0000 Received: from VE1EUR03FT043.eop-EUR03.prod.protection.outlook.com (2603:10a6:203:2:cafe::aa) by AM5PR0701CA0065.outlook.office365.com (2603:10a6:203:2::27) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4129.13 via Frontend Transport; Thu, 6 May 2021 14:27:11 +0000 X-MS-Exchange-Authentication-Results: spf=pass (sender IP is 63.35.35.123) smtp.mailfrom=arm.com; sourceware.org; dkim=pass (signature was verified) header.d=armh.onmicrosoft.com;sourceware.org; dmarc=pass action=none header.from=arm.com; Received-SPF: Pass (protection.outlook.com: domain of arm.com designates 63.35.35.123 as permitted sender) receiver=protection.outlook.com; client-ip=63.35.35.123; helo=64aa7808-outbound-1.mta.getcheckrecipient.com; Received: from 64aa7808-outbound-1.mta.getcheckrecipient.com (63.35.35.123) by VE1EUR03FT043.mail.protection.outlook.com (10.152.19.122) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4108.25 via Frontend Transport; Thu, 6 May 2021 14:27:11 +0000 Received: ("Tessian outbound 13cdc29c30b8:v91"); Thu, 06 May 2021 14:27:10 +0000 X-CheckRecipientChecked: true X-CR-MTA-CID: 21db2ca2f28f7bdf X-CR-MTA-TID: 64aa7808 Received: from b18c1ec0e1e9.2 by 64aa7808-outbound-1.mta.getcheckrecipient.com id DF74BFB8-1299-4B4E-AF00-034EA93A46EF.1; Thu, 06 May 2021 14:27:04 +0000 Received: from FRA01-MR2-obe.outbound.protection.outlook.com by 64aa7808-outbound-1.mta.getcheckrecipient.com with ESMTPS id b18c1ec0e1e9.2 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384); Thu, 06 May 2021 14:27:04 +0000 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Qb+x5pNfCO5234fmjID9egt/WZG+UQnnbIlgX+JPbjcPiXUt92KA9e99uPlDYr/+/hYL+LpEr3vJu87QYhb5/BpQwItJyp4HP8dZz3E7Ih/ouJvC98NFAQbtby+QnbYH6iQvshWinBgoVwhvoS4ms/n8NEThqDo/DgYNQEM8f/IQfuYlAF/DhStMBScNEWzSLRc0ih/WvHQWlY79xlGRmY3C4vPz+06rnYeb8fzqyTpzizlV6HUD6cSQxd+QONBw0GdBwBfvqKHWFRAGe4mWx3A/w2qXm8z/m3KRoIPKeNEWwui6s4WSsEdmN+mTnajo9gI8z4qscKVLtDc+H9t/Ig== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=VKEEUeH0DNqKlTimI8gmOaxO6WT2Z9q4MP8ASutml20=; b=NJKrE3hUZrwMaVmdUNR8sN2UvrfHv76WBnwoVHm7nYTOt7nObScwXFUGnnSJyk4AgauH7Mlg2Y6rnmnWtgJtlaRgBLPGvyfyikfAFcjFgVOPwk4wiAsU4iG/3g5tJK4epdMwzIGh1Pe0xusTWSNSRr/rUIxiRi/Wf91iOeB3MgH1TquceytNRbOiDAa4OIVwJI0jLoFsPmOJBrAWyH9gYetNcPy83XVWAt0Ja9ABxq2Irvb6uKEdRbJ3HOpQKAEElAh7Bjm8z5ce3/Q+cYVLrc3r8GdJTBi8fgtD1KT4H6QEW7HF6dOOujAAbSx11EYaQ/dTr4vP7vOR8m2wWwuRDg== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=arm.com; dmarc=pass action=none header.from=arm.com; dkim=pass header.d=arm.com; arc=none Authentication-Results-Original: fujitsu.com; dkim=none (message not signed) header.d=none;fujitsu.com; dmarc=none action=none header.from=arm.com; Received: from PA4PR08MB6320.eurprd08.prod.outlook.com (2603:10a6:102:e5::9) by PR2PR08MB4763.eurprd08.prod.outlook.com (2603:10a6:101:20::18) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4108.24; Thu, 6 May 2021 14:27:02 +0000 Received: from PA4PR08MB6320.eurprd08.prod.outlook.com ([fe80::c99f:671d:bb2c:f20b]) by PA4PR08MB6320.eurprd08.prod.outlook.com ([fe80::c99f:671d:bb2c:f20b%7]) with mapi id 15.20.4087.044; Thu, 6 May 2021 14:27:02 +0000 Date: Thu, 6 May 2021 15:26:59 +0100 From: Szabolcs Nagy To: "naohirot@fujitsu.com" Cc: 'Wilco Dijkstra' , 'GNU C Library' Subject: Re: [PATCH 0/5] Added optimized memcpy/memmove/memset for A64FX Message-ID: <20210506142658.GA9028@arm.com> References: Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.9.4 (2018-02-28) X-Originating-IP: [217.140.106.55] X-ClientProxiedBy: LO2P123CA0082.GBRP123.PROD.OUTLOOK.COM (2603:10a6:600:138::15) To PA4PR08MB6320.eurprd08.prod.outlook.com (2603:10a6:102:e5::9) MIME-Version: 1.0 X-MS-Exchange-MessageSentRepresentingType: 1 Received: from arm.com (217.140.106.55) by LO2P123CA0082.GBRP123.PROD.OUTLOOK.COM (2603:10a6:600:138::15) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4108.25 via Frontend Transport; Thu, 6 May 2021 14:27:01 +0000 X-MS-PublicTrafficType: Email X-MS-Office365-Filtering-Correlation-Id: 39c27b54-2019-4ba4-2ed9-08d9109b094e X-MS-TrafficTypeDiagnostic: PR2PR08MB4763:|AM0PR08MB5442: X-MS-Exchange-Transport-Forked: True X-Microsoft-Antispam-PRVS: x-checkrecipientrouted: true NoDisclaimer: true X-MS-Oob-TLC-OOBClassifiers: OLM:2887;OLM:2887; X-MS-Exchange-SenderADCheck: 1 X-Microsoft-Antispam-Untrusted: BCL:0; X-Microsoft-Antispam-Message-Info-Original: CyKNCcLr/4dZHvjSLy63YNLdNQxjR57bt3OyrTDnCd/v5FTnq+wbTfYA1dSGKkYQHIeMYqQcJjbBhzVhNDSA02f3LZrYpGyjLdVk7OouJC1p11qYetFpnNJ5C4ximf7EA1Hnlm7I6y219JF3TTKpGhrDmLCiqR/qdCTp0UmAGhSnee/0NqBIBVLv8P4ZPs7424hufT67PoNt3V3ibCz9832cHaAJZbHz5lTcIt7eeKBUgccyvJEiYbmGWErmrLKCZLYuiPGOqlfrCCcV+i6KfY1fOj+7Dg4PsezGtx7h5efE1PV2K+dtTHvO7ffB4FnJxn4FGEFD8152vMDGaMl2pqvPkCFFV8OEEGyLpkmnj0dWrUFU6QWS9SZLS2zkInbDeIDHJdfYjxSSjopUxtFI7IJfY7oaVFfvr26s1t+qWqVQUZNqZ0zpzbSuZk9z/d7WwYlIGRkDVJiuKxLq1LyghlkaXyt3KP0OAZCB4sXQCGmgGw0qQDu79MPAICeCNTJ4Oie8ekBLt8QU9ZQgDC+boWNDr7lvT1pSKtq+YDGvb9JhldAyzHPBQKrLEw+mpWhzvtGIJuyouOzX6fHo81dSsfPV1Qk1xn02s8ZuzrTcmKkr/GBU1Kv5XqUkH8mNdQH5KSjBOPZ924dvkmf7sgg5E+KoRw2Cf+2WsEFTTTy37WQ0ro9C9QIEGq7aIrquSDdYDKodkcTaj0bbM7RyqhtQWmWz5JrM83Q9rIk1c6w1atQ= X-Forefront-Antispam-Report-Untrusted: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:PA4PR08MB6320.eurprd08.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(346002)(366004)(136003)(39850400004)(396003)(376002)(6916009)(86362001)(4326008)(26005)(2616005)(316002)(8886007)(478600001)(36756003)(1076003)(956004)(5660300002)(186003)(966005)(16526019)(54906003)(66946007)(8936002)(7696005)(52116002)(66556008)(8676002)(2906002)(66476007)(55016002)(38350700002)(38100700002)(33656002); DIR:OUT; SFP:1101; X-MS-Exchange-AntiSpam-MessageData: =?utf-8?B?NTJuVU1oWk9MS2FQN2FPNXVCcndJWWpNbFFwTUxxTjk0MXhwYWhqUjJyZndH?= =?utf-8?B?RiswL2hCbEwxNHZvUG5nWlA0Mm9LT253TllYamlWdVlwOW9hVnBFeFVlNUlx?= =?utf-8?B?TlRJRkd1dWFQSGZOcXYzdzZrS0p6Mk9ncUp3SFNNWmtTdVNOT0xQN3N4SHlM?= =?utf-8?B?NDUyd3cycXA0RlNyL25kWllhSTNleEp5U3dPdzJjVnM5RVVaSHI3RXY2UURJ?= =?utf-8?B?VFJRWkI5YTBzL2VVVHNsYU42U1BQT0J6bFl1bXBXbXZLa3VKV0RRNXh0S1VM?= =?utf-8?B?dGdFWTBJR0RoMHpuY0hGbmxSMXB1VVN2akdNQkNWZzlIWStzUjZqbmNDdVRx?= =?utf-8?B?Qjk4TFhUeXdkWVNCdG5EbVFzUjQ1Y0IrelUrK1RrM2U5YWdlMFM4MGZ3VGdr?= =?utf-8?B?RjJOMVJWQmpYMTVyM1R3NWEyUUU0OGdYenhyQjNMTXYwV3FBMFhtV3dPY1oy?= =?utf-8?B?NnBwQlBCRFM1Q0twSy9LRHRxNkFKWHpvdlQ5QkFtOWRqUEk0WUdTcVhieU9H?= =?utf-8?B?NENhb3dqNXBPWFNHWm81dGZvZWZFOHRJc0p4YU94Zi8vQldKa2Fqa0tmd2xQ?= =?utf-8?B?N3B0eW5mLzJHT2x6enhBMC8zYW82cEN0TEFjcDhjRlpsbTZwWUt1ZDcycHYy?= =?utf-8?B?NFVXSW5iM2dzTzM3bG9Pem13QXZ0T0pSMFZSUWlkMFFaMDV2ejdXaStsT2tG?= =?utf-8?B?MDNXUWFlZEJYQ3pFSCtHV1NPZlFma2J6bE40RE96OWZnZnE4NXQ3R3RveWd4?= =?utf-8?B?Z2dDVS94aERkSjZoK0d2ZkRhWFdYODI5djEvdnNqTjJBS0xDemFXUHNvbStk?= =?utf-8?B?UElFcVFRT0lrTld6MXFkWlRsdEo1bkcvYisvSC90VlVxV2MzRmdvcW94OUQw?= =?utf-8?B?MkRRVlVXdjdzWkI5S1dEWVVySU02TXIxR2NvSjVsVU44SkdDM0hKMkVTd1M3?= =?utf-8?B?ZndkZXdWRTNOd0pnaVlZUmR3QWZxc0lPU0wwQlhJbUxESG9rSmVyRnROWmJW?= =?utf-8?B?cW1pSUxWRjJjZ25XdlR1N3BqdzhOazQwdWpadTdEZ3Vyd2dFbU0vQjk2eUNL?= =?utf-8?B?TTlLV0MrQWRWckpqTGkvTVRBMmJSSUtJckVWdGVLekluSmk5L0E2d0VYTFlN?= =?utf-8?B?WmZvd1ZCcjB3VTd6a0RXSEtPWlBYQ2FXS3d3RFRJQXc1ZVk2a2g2WDkrVldh?= =?utf-8?B?QkJkTWtDaThON08xMkt4Q1FOd0ZBWUl2OTRoNGNWWWt1UE9lT1lyNkx4SHRN?= =?utf-8?B?eUtRNUpmVUxPTm1XZ2xlMjY3VGdzVUJwUFhBMFBzQ2RPQkgxcE9xWTBsem9o?= =?utf-8?B?MDlQbm1oL1RGWm9qV0NlWm4vZEdoamFZQkVSeERHS29SUTZyOFA4aExsVmFX?= =?utf-8?B?QndFenExeWN1L0lHSzA0VUZZZWUwR0VQNWgxa3NDamRXTWJLVmVlVXNaeFZx?= =?utf-8?B?Y2dsUktwREErbGJsYnRXNzA2dTZ4bzJiVWxxZ1NKSVFCeWVWcFE2RHBJaVpj?= =?utf-8?B?bU1ScERFZzNmdlpLNkNBRUVTMklvSEZlTW9naVZsSW5rSFYzYmVyZnptQnpG?= =?utf-8?B?bnBrK3FjUDIvR0QrY2J6M2tWdlBUT2d2OTZFZXZJZG5YdkxBODZPekJNSnRK?= =?utf-8?B?eXRRSXRwdEpkQTBoUVlPRTB3RXJXSXQ4MkJMUDhmMGY5WnBGUUFPVitoMGVa?= =?utf-8?B?a2FWdEFCV2lvb2NwMXVCYXVZeVF3enJ2QXFndW1jc05WLzI1Wlp4YkM2MXFK?= =?utf-8?Q?AUPAfIMDHGadhAS5tvGFUiSuxN2mJqJbpmem+ia?= X-MS-Exchange-Transport-CrossTenantHeadersStamped: PR2PR08MB4763 Original-Authentication-Results: fujitsu.com; dkim=none (message not signed) header.d=none;fujitsu.com; dmarc=none action=none header.from=arm.com; X-EOPAttributedMessage: 0 X-MS-Exchange-Transport-CrossTenantHeadersStripped: VE1EUR03FT043.eop-EUR03.prod.protection.outlook.com X-MS-Office365-Filtering-Correlation-Id-Prvs: 5dac3565-ba2c-47d6-8ae0-08d9109b03cc X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: cz1SaAEuIRJ2+jw7DrCF1fKvlo8i2j0qJJ099Oed4xD5zaEzFrVIeo2Q0WVTtRarFYo1/mpvBTYnKxF1PgZuMpiA6q8Tb9z5e0F+X0OkXRpJPYdAUz0aKYgBbWJPr7eYMhXP1UBfd/oS70Aa+ejSbaJv349Ervyt7zzWREW/bboydWaXrLdNxNjVJo3Ned4p7ODLpi0u8nVCIMEX7JNNkw7xTzHTIBIDwkPgH9qykAi326Iho11gBId4EtLwWmLpGMpu3qslP8Zwqee7NTgyOTmWT77P0Mvb/fN2riGFUyvWxd8X9Vb+qS63KBbSaPF6AwrOtETVNXEQLLpb1RMQqP/j1UZqZBaMNWJDSOgcyAn0CkwDQ0dH/+jnhwz6uLe9rnWKzBZ3aqSffEmPjeBe4GgjKLjOHa1yV20iDnOG33+pXOaXkRjYac4rNYhgsZeJQdkMNeTMRh3kvG5NWwjgneMM0WsyOmlgCDM75P1dg7wB/IowLzE+E8udN1ylGKg5PSFQPVOCYNZToRSZhIfvj/OsHSG8Q3weWyTm2uHvabYfy6FEgehNKX43j9oniGeNyd6qRIzT8YpjuW95J4B/hI1GpkWhu6jFJIRSrx4nwwKBlyYntrX+ajQmHe3gG8MXqcFhOQrYzDK0fHkFJ9H8HlCRr1/N5/OYdkcuzcGpWBrKnSxsu2tl/Fb4UQHivkZ41m1stlr8gumokMcyS5uRa4RzdvI1HE0YKLJpD4vAK7E= X-Forefront-Antispam-Report: CIP:63.35.35.123; CTRY:IE; LANG:en; SCL:1; SRV:; IPV:CAL; SFV:NSPM; H:64aa7808-outbound-1.mta.getcheckrecipient.com; PTR:ec2-63-35-35-123.eu-west-1.compute.amazonaws.com; CAT:NONE; SFS:(4636009)(396003)(39850400004)(346002)(376002)(136003)(36840700001)(46966006)(356005)(956004)(82740400003)(5660300002)(16526019)(186003)(2906002)(70206006)(70586007)(47076005)(26005)(2616005)(86362001)(1076003)(478600001)(81166007)(336012)(8936002)(8676002)(8886007)(316002)(966005)(55016002)(33656002)(4326008)(36860700001)(6862004)(36756003)(82310400003)(7696005)(54906003); DIR:OUT; SFP:1101; X-OriginatorOrg: arm.com X-MS-Exchange-CrossTenant-OriginalArrivalTime: 06 May 2021 14:27:11.0440 (UTC) X-MS-Exchange-CrossTenant-Network-Message-Id: 39c27b54-2019-4ba4-2ed9-08d9109b094e X-MS-Exchange-CrossTenant-Id: f34e5979-57d9-4aaa-ad4d-b122a662184d X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=f34e5979-57d9-4aaa-ad4d-b122a662184d; Ip=[63.35.35.123]; Helo=[64aa7808-outbound-1.mta.getcheckrecipient.com] X-MS-Exchange-CrossTenant-AuthSource: VE1EUR03FT043.eop-EUR03.prod.protection.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Anonymous X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM0PR08MB5442 X-Spam-Status: No, score=-8.1 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, MSGID_FROM_MTA_HEADER, RCVD_IN_DNSWL_LOW, RCVD_IN_MSPIKE_H2, SPF_HELO_PASS, SPF_PASS, TXREP, UNPARSEABLE_RELAY autolearn=ham autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 06 May 2021 14:27:15 -0000 The 05/06/2021 10:01, naohirot@fujitsu.com wrote: > > From: Wilco Dijkstra > > > [1] > > > https://github.com/NaohiroTamura/glibc/commit/c582917071e76cfed84fafb0 > > > c82cb70339294386 > > > > I meant using an actual NOP in the #else case so that existing string functions > > won't change. Also note the #defines in the #if and #else need to be indented. > > > > I've read the mail thread regarding BTI, but I think I couldn't fully understand the > problem. BTI seems available from ARMv8.5, and A64FX is ARMv8.2. > Even though distro distributed BTI enabled binary, BTI doesn't work on A64FX. > So BTI_J macro can be removed from A64FX IFUNC code at least, because A64FX > IFUNC code is executed only on A64FX. > Are we discussing the BTI_C code which is not in IFUNC code? BTI_C at function entry. the slowdown you showed with bti c at function entry should not be present with a plain nop. this means a64fx implemented hint space instructions (such as bti c) slower than plain nops, which is not expected and will cause slowdowns with distros that try to distribute binaries with bti c, this problem goes beyond string functions.