From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-ed1-x52b.google.com (mail-ed1-x52b.google.com [IPv6:2a00:1450:4864:20::52b]) by sourceware.org (Postfix) with ESMTPS id 5AE5A3857C55; Thu, 16 Mar 2023 08:57:17 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 5AE5A3857C55 Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=gmail.com Received: by mail-ed1-x52b.google.com with SMTP id x3so4550056edb.10; Thu, 16 Mar 2023 01:57:17 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; t=1678957036; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=wjs9plfxx4bmTKbgZ88msQPCed8WyYGTUKHZZtt2mZQ=; b=g/zxav7hva6fbzEqPINVsuHSjj+oniHpUA3sgB83hp13DNb9f2r5EwGO0biUmtHGeg MEfq3gVQPWfHl18V+DRWR0InPafEvDPAklW4O2Q9uRMl5m/NwQ9s5sFjAsUpn1yvXsO4 mtXF7zGgJTGA8+ADxEybpFESNXISrcD5dYZGOfXjTc8lAjP9Vi5yWUJmCNcuKvHN6uo9 lCkms2Ryl0DTbl9PvKIrqG3T5f0D2knB7iaUhZ+Pau2JI+NrmP03w6AoMNLa4+UCIsAy BFxSaH8JaWUl4yQWyviPu9QdWTDHM7zALNL4ODYdNr/qxgdNiVlA9euGPPTuff0uEx4Q Pseg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; t=1678957036; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=wjs9plfxx4bmTKbgZ88msQPCed8WyYGTUKHZZtt2mZQ=; b=bJunQdM98RmbIMgXel2kmiChIIp2s/M2Hy6yDu3KIit+O79Mn1fS85RJKFReX8qeMF x6avNZXEKnWfr0yOT2YnUeBR6/UgaBMkej7i0aq2dw+Xo3FHyDDm/C504giImOSdhlKh JkFua6xeVO9h+tIZaR5Vtx90g5pbBEerGMSlPOnV3yhH2IoJ8Q/NbZu1qpAqIk+l5F58 1D6EsVlPs+7YUrAnlsPyD8K2Hu/QIjdPJHZ1v6qSvnGYtztPOkhhC48CX3iRi+f5kkl5 +9PzVDjp58dUYoNrdYgNL7M6NUb+OpwS98hizzY6d2XL951cqThkwFiQvOKbqcTz+AfC nt8A== X-Gm-Message-State: AO0yUKWFhLOF/BI4JY0TIKCzVjWGEtg5EJ/sS4ZrHRxkRSIx2JA2uzZ2 Oml4+mmN7UUOXyNfvhsjkl77YgfDh5PkSh9jxZg= X-Google-Smtp-Source: AK7set8pL0NMC+n0w4XtSm/GRAmMfP2Nta5gkBiMuvQquYUzfXq+yl4g+BH7jQDi4kQPZNYeAmdds8qKUtocfCY19YQ= X-Received: by 2002:a17:906:4c86:b0:930:af80:5ba6 with SMTP id q6-20020a1709064c8600b00930af805ba6mr390212eju.1.1678957035804; Thu, 16 Mar 2023 01:57:15 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: Raiki Tamura Date: Thu, 16 Mar 2023 17:57:04 +0900 Message-ID: Subject: Re: [GSoC] gccrs Unicode support To: Jakub Jelinek Cc: Arthur Cohen , David Edelsohn , Philip Herron , gcc@gcc.gnu.org, gcc-rust@gcc.gnu.org Content-Type: multipart/alternative; boundary="000000000000b65a0705f700a5f9" X-Spam-Status: No, score=-0.4 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_ENVFROM_END_DIGIT,FREEMAIL_FROM,HTML_MESSAGE,KAM_SHORT,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS,TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: --000000000000b65a0705f700a5f9 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Sorry for resending this email. I forgot using =E2=80=9CReply All=E2=80=9D. Thank you for your response, Arsen and Jakub. I did not know C++ also supports Unicode identifiers. I looked a little into C++ and found C++ accepts the same form of identifiers as Rust. So I will do further investigation of libcpp with the hope that it can also be used in the Rust frontend. Raiki Tamura On Thu, Mar 16, 2023 at 0:18 Jakub Jelinek wrote: > On Wed, Mar 15, 2023 at 11:00:19AM +0000, Philip Herron via Gcc wrote: > > Excellent work on getting up to speed on the rust front-end. From my > > perspective I am interested to see what the wider GCC community thinks > > about using https://www.gnu.org/software/libunistring/ library within > GCC > > instead of rolling our own, this means it will be another dependency on > GCC. > > > > The other option is there is already code in the other front-ends to do > > this so in the worst case it should be possible to extract something out > of > > them and possibly make this a shared piece of functionality which we can > > mentor you through. > > I don't know what exactly Rust FE needs in this area, but e.g. libcpp > already handles whatever C/C++ need from Unicode support POV and can hand= le > it without any extra libraries. > So, if we could avoid the extra dependency, it would be certainly better, > unless you really need massive amounts of code from those libraries. > libcpp already e.g. provides mapping of unicode character names to code > points, determining which unicode characters can appear at the start or > in the middle of identifiers, etc. > > Jakub > > --000000000000b65a0705f700a5f9--