From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-il1-x129.google.com (mail-il1-x129.google.com [IPv6:2607:f8b0:4864:20::129]) by sourceware.org (Postfix) with ESMTPS id 9FEEB3858C83 for ; Mon, 28 Feb 2022 20:59:39 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 9FEEB3858C83 Received: by mail-il1-x129.google.com with SMTP id e11so11035232ils.3 for ; Mon, 28 Feb 2022 12:59:39 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version; bh=RSvcMC6Ibbyr/8zBvuNtNF83uHNVFHRku1fR5EP8EAE=; b=ywmL5WdW1yTZADhotvDQCOVlLRN5p7JyGpVDzK+onkMp/4/+POwyGvMGPbsnK0QEBf f3UJqxmO3G/WBB52Wpay4a4JEDXe9uzXVp9vdm6dQAx63I0BNb9OO7JTvnRCL/UGUKIt ul2IIzVgGZVCmpgpxPhlbsUqDB9nPSnt7o8gmqqlAWzTf3Q0ayQ5zpKdcOQG5aqvfBMY wWdFm5Mm80BQHOoBBvyG8QQbWn5Pl1wyVn5mb+1VDTJsMJOEQ1ZT8JRu5o/+WhLhO01K gdItLZCT84Fudy33KW/ngsOYz0OCll1NIW+vo9RRLzuNEsMNRQ0FMrla2IJRIWagHpWf 5tWw== X-Gm-Message-State: AOAM532HNfsvyxzAx36kxUCNHcPFfwusyu5UU3s/XbzTEH3H3o/2CN4G lA3/eunCFsdQ2VX+DzbRYDmEgOS4xEUrJA== X-Google-Smtp-Source: ABdhPJwq8wnAUWSLlkm6St9q8oCmbL5rAsjiiPCdGVtsenNZ3pW+1ik6fZVjqVlgsHpDO6NMQw1XoQ== X-Received: by 2002:a05:6e02:1ca8:b0:2c3:6342:681e with SMTP id x8-20020a056e021ca800b002c36342681emr1819106ill.59.1646081979059; Mon, 28 Feb 2022 12:59:39 -0800 (PST) Received: from murgatroyd (75-166-141-253.hlrn.qwest.net. [75.166.141.253]) by smtp.gmail.com with ESMTPSA id x14-20020a927c0e000000b002c244d8dcc8sm6618621ilc.42.2022.02.28.12.59.38 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 28 Feb 2022 12:59:38 -0800 (PST) From: Tom Tromey To: Eli Zaretskii Cc: Tom Tromey , gdb-patches@sourceware.org Subject: Re: [PATCH 5/5] Handle non-ASCII identifiers in Ada References: <20220228183304.1162089-1-tromey@adacore.com> <20220228183304.1162089-6-tromey@adacore.com> <83y21ulsuo.fsf@gnu.org> X-Attribution: Tom Date: Mon, 28 Feb 2022 13:59:37 -0700 In-Reply-To: <83y21ulsuo.fsf@gnu.org> (Eli Zaretskii's message of "Mon, 28 Feb 2022 20:59:59 +0200") Message-ID: <87lexulnba.fsf@tromey.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Status: No, score=-5.6 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, RCVD_IN_DNSWL_NONE, SPF_HELO_NONE, SPF_PASS, TXREP, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: gdb-patches@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Gdb-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 28 Feb 2022 20:59:40 -0000 >>>>> "Eli" == Eli Zaretskii writes: >> Date: Mon, 28 Feb 2022 11:33:04 -0700 >> From: Tom Tromey via Gdb-patches >> Cc: Tom Tromey >> >> +for c in range(0, 0x10FFFF): >> + process_codepoint(c) Eli> This script assumes that the version of Python which will run it is Eli> up-to-date with the latest Unicode Character Database (UCD), right? Eli> Is that a good assumption? Wouldn't it be better to process the UCD Eli> from the latest Unicode Standard directly? Ordinarily, yes, but in practice the Ada compiler uses quite old data, and so whatever is provided by a recent-ish Python is more than good enough. If the Ada compiler is changed, I'll update the script. I suspect this won't happen, though. Tom