From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mout-p-202.mailbox.org (mout-p-202.mailbox.org [80.241.56.172]) by sourceware.org (Postfix) with ESMTPS id CD0333858D39; Wed, 15 Mar 2023 15:05:05 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org CD0333858D39 Authentication-Results: sourceware.org; dmarc=pass (p=reject dis=none) header.from=aarsen.me Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=aarsen.me Received: from smtp102.mailbox.org (smtp102.mailbox.org [10.196.197.102]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-384) server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by mout-p-202.mailbox.org (Postfix) with ESMTPS id 4PcDG64cLRz9syr; Wed, 15 Mar 2023 16:05:02 +0100 (CET) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=aarsen.me; s=MBO0001; t=1678892702; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=RTVnFi7UYOGL6MsVD/8yXIJReq4hcOj/a7hQFVItijo=; b=BGsKNLSipBYlQKiIdImLne8vJm4rgxoFbdfmQPvWJ4xlA4X2S2qv1yEW/JS7U0bnp41BO/ aWkyGdO/tSPsBLS0BW/U5trA9WxvSq0W+SLbpA/SW+HCBkV8c1uedBUOCAfsN/hIgdpvUV d6m425946dhWiHzDvOPM1AflcfgGvDkHsVdhqAQlDBeOq2rqh8p2ux4HBjx378VVelfyB6 WQYR6QXDdvx2Yx+SQjC4pKmfCV5VrUi5zHm9et0Gh/DFHV+TNrYhn5ahP2R1lbfE/PFEMy 4HHHQP0EdYUQBR4Ovg8VWzkNO9CS2sNn+PThEv4lJqD0QtBiCgC0WBDRLN6nYw== References: From: Arsen =?utf-8?Q?Arsenovi=C4=87?= To: Philip Herron Cc: Raiki Tamura , gcc-rust@gcc.gnu.org, David Edelsohn , Arthur Cohen , gcc@gcc.gnu.org Subject: Re: [GSoC] gccrs Unicode support Date: Wed, 15 Mar 2023 15:53:38 +0100 In-reply-to: Message-ID: <87lejykqcn.fsf@aarsen.me> MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha512; protocol="application/pgp-signature" X-Spam-Status: No, score=-4.6 required=5.0 tests=BAYES_00,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,KAM_INFOUSMEBIZ,KAM_SHORT,RCVD_IN_DNSWL_LOW,SPF_HELO_NONE,SPF_PASS,TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org List-Id: --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Philip Herron via Gcc writes: > Hi Raiki Welcome, Raiki! > Excellent work on getting up to speed on the rust front-end. From my > perspective I am interested to see what the wider GCC community thinks > about using https://www.gnu.org/software/libunistring/ library within GCC > instead of rolling our own, this means it will be another dependency on G= CC. As my $0.02, it is likely best not to create yet another re-implementation. There's already precedent for including dependencies that can do a very complex job well, like GMP and MPFR. Text handling is deceivingly simple, and in practice, nobody seems to get it fully right. The effort is minimized, and yet most effectively shared, if done in a library. (note: I don't have a horse in the race wrt which specific library to use, as I'm no expert, but I suspect libunistring could work well) Have a wonderful day! > The other option is there is already code in the other front-ends to do > this so in the worst case it should be possible to extract something out = of > them and possibly make this a shared piece of functionality which we can > mentor you through. > > Thanks > > --Phil > > On Mon, 13 Mar 2023 at 16:19, Raiki Tamura via Gcc wrot= e: > >> Hello, >> >> My name is Raiki Tamura, an undergraduate student at Kyoto University in >> Japan and I want to work on Unicode support in gccrs this year. >> I have already written my proposal (linked below) and shared it with the >> gccrs team in Zulip. >> In the project, I am planning to use the GNU unistring library to handle >> Unicode characters and the GNU IDN library to normalize identifiers. >> According to my potential mentor, it would provide Unicode libraries for >> all frontends in GCC. If there are concerns or feedback about this, plea= se >> tell me about it. >> Thank you. >> >> Link to my proposal: >> >> https://docs.google.com/document/d/1MgsbJMF-p-ndgrX2iKeWDR5KPSWw9Z7onsHI= iZ2pPKs/edit?usp=3Dsharing >> >> Raiki Tamura >> =2D-=20 Arsen Arsenovi=C4=87 --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iIYEARYKAC4WIQT+4rPRE/wAoxYtYGFSwpQwHqLEkwUCZBHemBAcYXJzZW5AYWFy c2VuLm1lAAoJEFLClDAeosST72sA/RFk2eYdStlNxDoQnbmOx5GCh8rjgzEkNF32 TfP3FSIKAQDVkjw9bLXG14R5azHh5uZkZ3gO8IyZCHEtoFT5nQR3AA== =9sop -----END PGP SIGNATURE----- --=-=-=--