From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from gproxy1-pub.mail.unifiedlayer.com (gproxy1-pub.mail.unifiedlayer.com [69.89.25.95]) by sourceware.org (Postfix) with ESMTPS id 999D13858C00 for ; Mon, 10 Oct 2022 16:11:56 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 999D13858C00 Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=tromey.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=tromey.com Received: from cmgw10.mail.unifiedlayer.com (unknown [10.0.90.125]) by progateway3.mail.pro1.eigbox.com (Postfix) with ESMTP id 53F8710045C31 for ; Mon, 10 Oct 2022 16:11:45 +0000 (UTC) Received: from box5379.bluehost.com ([162.241.216.53]) by cmsmtp with ESMTP id hvNYosKd62FNChvNYouJ3E; Mon, 10 Oct 2022 16:11:45 +0000 X-Authority-Reason: nr=8 X-Authority-Analysis: v=2.4 cv=E+cIGYRl c=1 sm=1 tr=0 ts=63444441 a=ApxJNpeYhEAb1aAlGBBbmA==:117 a=ApxJNpeYhEAb1aAlGBBbmA==:17 a=dLZJa+xiwSxG16/P+YVxDGlgEgI=:19 a=Qawa6l4ZSaYA:10:nop_rcvd_month_year a=Qbun_eYptAEA:10:endurance_base64_authed_username_1 a=AjVfXAo6hDqZhU1JfhgA:9 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=tromey.com; s=default; h=Content-Type:MIME-Version:Message-ID:In-Reply-To:Date:References :Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=x2SWqpHQlHuGeGWW+OiqLHq3DCfP25sbCLDILG4XOyU=; b=XTUWvE0Z4qsLlCp+plUDWNf/S1 iJw3wqp2bJQFa5+X1TrH88qnSljaYbJj5DT+g2LbvrbmnjnpFOKgVdPSvLI86YBNFHgqxXC7ZVuca lRhAn1uLwjxmWj+vjVchQulde; Received: from 71-211-160-49.hlrn.qwest.net ([71.211.160.49]:52088 helo=prentzel) by box5379.bluehost.com with esmtpsa (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.95) (envelope-from ) id 1ohvNY-003UBJ-3G; Mon, 10 Oct 2022 10:11:44 -0600 From: Tom Tromey To: Patrick Monnerat Cc: Tom Tromey , Patrick Monnerat via Gdb-patches Subject: Re: [PATCH] gdb: add UTF16/UTF32 target charsets in phony_iconv References: <20221002140010.106238-1-patrick@monnerat.net> <87k05bs8c5.fsf@tromey.com> <0a978271-3085-8bf3-f5fd-6a0b3f9f3ea2@monnerat.net> <874jwejgbb.fsf@tromey.com> <2f10efe4-1095-b620-ea1c-08cc047c45c4@monnerat.net> X-Attribution: Tom Date: Mon, 10 Oct 2022 10:11:38 -0600 In-Reply-To: <2f10efe4-1095-b620-ea1c-08cc047c45c4@monnerat.net> (Patrick Monnerat's message of "Sun, 9 Oct 2022 02:47:18 +0200") Message-ID: <87zge3irph.fsf@tromey.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - box5379.bluehost.com X-AntiAbuse: Original Domain - sourceware.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - tromey.com X-BWhitelist: no X-Source-IP: 71.211.160.49 X-Source-L: No X-Exim-ID: 1ohvNY-003UBJ-3G X-Source: X-Source-Args: X-Source-Dir: X-Source-Sender: 71-211-160-49.hlrn.qwest.net (prentzel) [71.211.160.49]:52088 X-Source-Auth: tom+tromey.com X-Email-Count: 2 X-Source-Cap: ZWx5bnJvYmk7ZWx5bnJvYmk7Ym94NTM3OS5ibHVlaG9zdC5jb20= X-Local-Domain: yes X-Spam-Status: No, score=-3022.8 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, JMQ_SPF_NEUTRAL, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gdb-patches@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gdb-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 10 Oct 2022 16:11:58 -0000 Patrick> This describes the particular case of Solaris. Are there other OSes Patrick> with similar implementations? I don't know. It's not impossible, since the encoding of wchar_t isn't specified. Patrick> Totally agreed: we need to have something more "predictable". UTF-32 Patrick> seems a good choice, but the endian problem should still be resolved. Patrick> Should it be fixed (UTF-32[BL]E) or machine dependent? Both have pros Patrick> and cons. We could have a class implementing those chars + their Patrick> ctype-like methods and even a basic_string instance subclass Patrick> supporting conversions. It's simplest if the characters are just scalars -- that is, in the host ordering. I don't think there's any need for a string type yet. Patrick> I nevertheless don't have any idea what is the amount of work required Patrick> to change this. I did most of it already. Patrick> For the particular case of Solaris, did things changed nowadays and Patrick> how old versions should be supported? I don't know, but it wouldn't be important any more, because we'd no longer require any way to convert to wchar_t -- gdb simply wouldn't use wchar_t any more. Patrick> In the short and middle terms, I think the current patch is still Patrick> useful: it immediately (and dirtily!) solves the problem introduced by Patrick> Ada support and will allow a smooth and gentle UTF-32 transition until Patrick> reaching a situation where phony_iconv can be dropped. I suspect we should probably move forward with your patch for GDB 13, and then switch to my patch for GDB 14. My reasoning here is just that requiring iconv is a change that we may not want to spring on people so late in the process. Tom