From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from fencepost.gnu.org (fencepost.gnu.org [IPv6:2001:470:142:3::e]) by sourceware.org (Postfix) with ESMTPS id 740B2385840B for ; Thu, 29 Jul 2021 19:46:34 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 740B2385840B Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=uma.es Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=gnu.org Received: from eggs.gnu.org ([2001:470:142:3::10]:34584) by fencepost.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1m9BzG-0002AO-3l for gcc@gnu.org; Thu, 29 Jul 2021 15:46:34 -0400 Received: from mtaext1.sci.uma.es ([2001:720:c20:cc05:b1ff:25e:f:c1]:52590 helo=cartero1.uma.es) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1m9BzE-0007Se-0m for gcc@gnu.org; Thu, 29 Jul 2021 15:46:33 -0400 Received: from MTAMXint1.sci.uma.es (MTAMXint1.bo-sci.uma.es [IPv6:2001:720:c20:821:b1ff:2525:d:c1]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by cartero1.uma.es (Postfix) with ESMTPS id 4GbLcn5DGLz358np; Thu, 29 Jul 2021 21:46:17 +0200 (CEST) Received: from klmsint1.sci.uma.es (klmsint1.bo-sci.uma.es [IPv6:2001:720:c20:821:b1ff:25:a2:c1]) by MTAMXint1.sci.uma.es (Postfix) with ESMTP id 4GbLcn4rZzz3jG66; Thu, 29 Jul 2021 21:46:17 +0200 (CEST) Received: from MSA1.sci.uma.es (MSA1.bo-sci.uma.es [IPv6:2001:720:c20:821:b1ff:587:0:c1]) by klmsint1.sci.uma.es (Postfix) with ESMTP id 5F8D3288E6DC; Thu, 29 Jul 2021 21:46:17 +0200 (CEST) Received: from [192.168.1.85] (unknown [90.175.82.138]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) (Authenticated sender: manuel.lopez-ibanez@uma.es) by MSA1.sci.uma.es (Postfix) with ESMTPSA id 4GbLcm1FjBz3jQVx; Thu, 29 Jul 2021 21:46:14 +0200 (CEST) Subject: Re: rust frontend and UTF-8/unicode processing/properties From: =?UTF-8?B?TWFudWVsIEzDs3Blei1JYsOhw7Fleg==?= To: mark@klomp.org, "gcc@gnu.org" Message-ID: Date: Thu, 29 Jul 2021 21:46:14 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.11.0 MIME-Version: 1.0 Content-Type: text/plain; charset=windows-1252; format=flowed Content-Language: en-GB Content-Transfer-Encoding: 7bit X-KLMS-Rule-ID: 1 X-KLMS-Message-Action: clean X-KLMS-AntiSpam-Lua-Profiles: 165316 [Jul 29 2021] X-KLMS-AntiSpam-Version: 5.9.20.0 X-KLMS-AntiSpam-Envelope-From: manuel.lopez-ibanez@uma.es X-KLMS-AntiSpam-Rate: 0 X-KLMS-AntiSpam-Status: not_detected X-KLMS-AntiSpam-Method: none X-KLMS-AntiSpam-Auth: dkim=none X-KLMS-AntiSpam-Info: LuaCore: 449 449 5db59deca4a4f5e6ea34a93b13bc730e229092f4, {Tracking_uf_ne_domains}, {Tracking_from_domain_doesnt_match_to}, ApMailHostAddress: 90.175.82.138 X-MS-Exchange-Organization-SCL: -1 X-KLMS-AntiSpam-Interceptor-Info: scan successful X-KLMS-AntiPhishing: Clean, bases: 2021/07/29 18:34:00 X-KLMS-AntiVirus: Kaspersky Security for Linux Mail Server, version 8.0.3.30, bases: 2021/07/29 18:10:00 #16967828 X-KLMS-AntiVirus-Status: Clean, skipped Received-SPF: pass client-ip=2001:720:c20:cc05:b1ff:25e:f:c1; envelope-from=manuel.lopez-ibanez@uma.es; helo=cartero1.uma.es X-Spam_score_int: 7 X-Spam_score: 0.7 X-Spam_bar: / X-Spam_report: (0.7 / 5.0 requ) BAYES_00=-1.9, FAKE_REPLY_A1=2.642, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=no autolearn_force=no X-Spam_action: no action X-Spam-Status: No, score=3.4 required=5.0 tests=BAYES_00, FAKE_REPLY_A1, HEADER_FROM_DIFFERENT_DOMAINS, KAM_DMARC_STATUS, KAM_SHORT, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=no autolearn_force=no version=3.4.4 X-Spam-Level: *** X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: gcc@gcc.gnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gcc mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 29 Jul 2021 19:46:36 -0000 > For the gcc rust frontend I was thinking of importing a couple of > gnulib modules to help with UTF-8 processing, conversion to/from > unicode codepoints and determining various properties of those > codepoints. But it seems gcc doesn't yet have any gnulib modules > imported, and maybe other frontends already have helpers to this that > the gcc rust frontend could reuse. Although I agree that factoring out the code in libcpp so that it can be used by other FEs would be great and in line with the goals of https://gcc.gnu.org/wiki/ModularGCC that is a significant amount of work. Importing gnulib has its own advantages and it would allow GCC to finally deprecate libiberty: https://gcc.gnu.org/wiki/replacelibibertywithgnulib There is a preliminary patch here: https://gcc.gnu.org/legacy-ml/gcc-patches/2016-08/msg01554.html Cheers, Manuel.