From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <aldyh@redhat.com>
Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124])
	by sourceware.org (Postfix) with ESMTPS id 9ED1A385695F
	for <gcc-patches@gcc.gnu.org>; Mon, 31 Oct 2022 08:42:53 +0000 (GMT)
DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 9ED1A385695F
Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=redhat.com
Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=redhat.com
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com;
	s=mimecast20190719; t=1667205773;
	h=from:from:reply-to:subject:subject:date:date:message-id:message-id:
	 to:to:cc:cc:mime-version:mime-version:content-type:content-type:
	 in-reply-to:in-reply-to:references:references;
	bh=4XWMxwsmDMp7saG+BL+gGbKGQUg6hT5/E4lYJYDGuH8=;
	b=Bwf8bbACKaiVzqRUuMFJajKrikz9NVY23A56RZAqw64nZejaORCNIGa0tka/lRi4bPIFmg
	TLCQkCYK/0laLVHhVISOX57iUGqU5T3fvzfhihPbVNeHqANlotXdDvpmX4YYeEQToLi4WC
	PbM7G+arUvcy8nX0qwCVHbrG5Z5/+Ao=
Received: from mail-yb1-f199.google.com (mail-yb1-f199.google.com
 [209.85.219.199]) by relay.mimecast.com with ESMTP with STARTTLS
 (version=TLSv1.3, cipher=TLS_AES_128_GCM_SHA256) id
 us-mta-156-DmsKXzTgPMa0Mx2_1tbPZA-1; Mon, 31 Oct 2022 04:42:49 -0400
X-MC-Unique: DmsKXzTgPMa0Mx2_1tbPZA-1
Received: by mail-yb1-f199.google.com with SMTP id v81-20020a252f54000000b006cb87d7042cso9930072ybv.20
        for <gcc-patches@gcc.gnu.org>; Mon, 31 Oct 2022 01:42:49 -0700 (PDT)
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=1e100.net; s=20210112;
        h=cc:to:subject:message-id:date:from:in-reply-to:references
         :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id
         :reply-to;
        bh=4XWMxwsmDMp7saG+BL+gGbKGQUg6hT5/E4lYJYDGuH8=;
        b=dWeHrGqhcWDXGYcYUtAvwtDZNGNcWDJckCnoG5OaoNlMt2ttnhm++x+yyVTJTYFrth
         u+pWjn/8pLniCRYPGmlHN33jc94OEhrwH+Pqo9lWcrgwYp8QUMNQ2MrKAv62jN00ZKLS
         MXyufUrqr6O+hOCIvPhBQcxNEuKvzkwUrOujXQrqkjYbRxs9VQoQnYJWpskFXcL/4utm
         VvcbT3i44lcaqMMt1az94YaFiwhsE6pBpgv11fFkEj59J5YEuBObmW8dqsjiqoEvA5xy
         0Rg1mk9NI9xpR0Nzbfxc1wpj/jlcdnHnHJvP4WEqZn16axTU1Xh7WxiU8dQJqVfTpeqS
         dqSA==
X-Gm-Message-State: ACrzQf1oSh7Bq5MnM/gGXUfNo1Og6QFKdYxwD0QbY38GHVm2L5Kh78DR
	fiHVYwwhi5nkqersmMXH83lTXv27xa7nYWUhZUO7t2EkprQgW3fT5ATzFrBE/9olryaC3vzscS4
	OVZ+bjKadTbEa+QcwWcj4ouyjf84zbxbkXQ==
X-Received: by 2002:a0d:cdc2:0:b0:34d:101b:53c with SMTP id p185-20020a0dcdc2000000b0034d101b053cmr12371721ywd.444.1667205769437;
        Mon, 31 Oct 2022 01:42:49 -0700 (PDT)
X-Google-Smtp-Source: AMsMyM4426u56KQFjhKNTZ9y14ayqL59BnGEhJnc4ZSRm6N1RAGsORyQyO/vuOq1gEPKGNiWEcql/4LscfoYQ4a0WPY=
X-Received: by 2002:a0d:cdc2:0:b0:34d:101b:53c with SMTP id
 p185-20020a0dcdc2000000b0034d101b053cmr12371708ywd.444.1667205769178; Mon, 31
 Oct 2022 01:42:49 -0700 (PDT)
MIME-Version: 1.0
References: <20221013123649.474497-1-aldyh@redhat.com> <Y0hRjKccB1XjeTC0@tucnak>
 <CAGm3qMXCqT2r97o-tD8LfNa==z0zT-NSM1539rqS48ShknmhzQ@mail.gmail.com> <CAGm3qMXK5CiauPUhOKo0yW8CCUXnq=4xhb15LNMxvyzzSEwZYg@mail.gmail.com>
In-Reply-To: <CAGm3qMXK5CiauPUhOKo0yW8CCUXnq=4xhb15LNMxvyzzSEwZYg@mail.gmail.com>
From: Aldy Hernandez <aldyh@redhat.com>
Date: Mon, 31 Oct 2022 09:42:38 +0100
Message-ID: <CAGm3qMWJvn5zU0hvS5+XZNR3C39NnevktHsG4a38LBey7s-7XQ@mail.gmail.com>
Subject: Re: [PATCH] [PR24021] Implement PLUS_EXPR range-op entry for floats.
To: Jakub Jelinek <jakub@redhat.com>
Cc: GCC patches <gcc-patches@gcc.gnu.org>, Andrew MacLeod <amacleod@redhat.com>
X-Mimecast-Spam-Score: 0
X-Mimecast-Originator: redhat.com
Content-Type: multipart/alternative; boundary="000000000000a3c57905ec5097cc"
X-Spam-Status: No, score=-5.9 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH,DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,HTML_MESSAGE,RCVD_IN_DNSWL_NONE,RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_NONE,TXREP autolearn=ham autolearn_force=no version=3.4.6
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org
List-Id: <gcc-patches.gcc.gnu.org>

--000000000000a3c57905ec5097cc
Content-Type: text/plain; charset="UTF-8"

Ping ping

On Mon, Oct 24, 2022, 08:04 Aldy Hernandez <aldyh@redhat.com> wrote:

> PING
>
> On Mon, Oct 17, 2022 at 8:21 AM Aldy Hernandez <aldyh@redhat.com> wrote:
> >
> > On Thu, Oct 13, 2022 at 7:57 PM Jakub Jelinek <jakub@redhat.com> wrote:
> > >
> > > On Thu, Oct 13, 2022 at 02:36:49PM +0200, Aldy Hernandez wrote:
> > > > +// Like real_arithmetic, but round the result to INF if the
> operation
> > > > +// produced inexact results.
> > > > +//
> > > > +// ?? There is still one problematic case, i387.  With
> > > > +// -fexcess-precision=standard we perform most SF/DFmode arithmetic
> in
> > > > +// XFmode (long_double_type_node), so that case is OK.  But without
> > > > +// -mfpmath=sse, all the SF/DFmode computations are in XFmode
> > > > +// precision (64-bit mantissa) and only occassionally rounded to
> > > > +// SF/DFmode (when storing into memory from the 387 stack).  Maybe
> > > > +// this is ok as well though it is just occassionally more precise.
> ??
> > > > +
> > > > +static void
> > > > +frange_arithmetic (enum tree_code code, tree type,
> > > > +                REAL_VALUE_TYPE &result,
> > > > +                const REAL_VALUE_TYPE &op1,
> > > > +                const REAL_VALUE_TYPE &op2,
> > > > +                const REAL_VALUE_TYPE &inf)
> > > > +{
> > > > +  REAL_VALUE_TYPE value;
> > > > +  enum machine_mode mode = TYPE_MODE (type);
> > > > +  bool mode_composite = MODE_COMPOSITE_P (mode);
> > > > +
> > > > +  bool inexact = real_arithmetic (&value, code, &op1, &op2);
> > > > +  real_convert (&result, mode, &value);
> > > > +
> > > > +  // If real_convert above has rounded an inexact value to towards
> > > > +  // inf, we can keep the result as is, otherwise we'll adjust by 1
> ulp
> > > > +  // later (real_nextafter).
> > > > +  bool rounding = (flag_rounding_math
> > > > +                && (real_isneg (&inf)
> > > > +                    ? real_less (&result, &value)
> > > > +                    : !real_less (&value, &result)));
> > >
> > > I thought the agreement during Cauldron was that we'd do this always,
> > > regardless of flag_rounding_math.
> > > Because excess precision (the fast one like on ia32 or -mfpmath=387 on
> > > x86_64), or -frounding-math, or FMA contraction can all increase
> precision
> > > and worst case it all behaves like -frounding-math for the ranges.
> > >
> > > So, perhaps use:
> > >   if ((mode_composite || (real_isneg (&inf) ? real_less (&result,
> &value)
> > >                                             : !real_less (&value,
> &result))
> > >       && (inexact || !real_identical (&result, &value))))
> >
> > Done.
> >
> > > ?
> > > No need to do the real_isneg/real_less stuff for mode_composite, then
> > > we do it always for inexacts, but otherwise we check if the rounding
> > > performed by real.cc has been in the conservative direction (for upper
> > > bound to +inf, for lower bound to -inf), if yes, we don't need to do
> > > anything, if yes, we frange_nextafter.
> > >
> > > As discussed, for mode_composite, I think we want to do the extra
> > > stuff for inexact denormals and otherwise do the nextafter
> unconditionally,
> > > because our internal mode_composite representation isn't precise
> enough.
> > >
> > > > +  // Be extra careful if there may be discrepancies between the
> > > > +  // compile and runtime results.
> > > > +  if ((rounding || mode_composite)
> > > > +      && (inexact || !real_identical (&result, &value)))
> > > > +    {
> > > > +      if (mode_composite)
> > > > +     {
> > > > +       bool denormal = (result.sig[SIGSZ-1] & SIG_MSB) == 0;
> > >
> > > Use real_isdenormal here?
> >
> > Done.
> >
> > > Though, real_iszero needs the same thing.
> >
> > So... real_isdenormal() || real_iszero() as in the attached patch?
> >
> > >
> > > > +       if (denormal)
> > > > +         {
> > > > +           REAL_VALUE_TYPE tmp;
> > >
> > > And explain here why is this, that IBM extended denormals have just
> > > DFmode precision.
> >
> > Done.
> >
> > > Though, now that I think about it, while this is correct for denormals,
> > >
> > > > +           real_convert (&tmp, DFmode, &value);
> > > > +           frange_nextafter (DFmode, tmp, inf);
> > > > +           real_convert (&result, mode, &tmp);
> > > > +         }
> > >
> > > there are also the cases where the higher double exponent is in the
> > > [__DBL_MIN_EXP__, __LDBL_MIN_EXP__] aka [-1021, -968] or so.
> > > https://en.wikipedia.org/wiki/Double-precision_floating-point_format
> > > If the upper double is denormal in the DFmode sense, so smaller
> absolute
> > > value than __DBL_MIN__, then doing nextafter in DFmode is the right
> thing to
> > > do, the lower double must be always +/- zero.
> > > Now, if the result is __DBL_MIN__, the upper double is already
> normalized
> > > but we can add __DBL_DENORM_MIN__ to it, which will make the number
> have
> > > 54-bit precision.
> > > If the result is __DBL_MIN__ * 2, we can again add __DBL_DENORM_MIN__
> > > and make it 55-bit precision.  Etc. until we reach __DBL_MIN__ * 2e53
> > > where it acts like fully normalized 106-bit precision number.
> > > I must say I'm not really sure what real_nextafter is doing in those
> cases,
> > > I'm afraid it doesn't handle it correctly but the only other use
> > > of real_nextafter is guarded with:
> > >   /* Don't handle composite modes, nor decimal, nor modes without
> > >      inf or denorm at least for now.  */
> > >   if (format->pnan < format->p
> > >       || format->b == 10
> > >       || !format->has_inf
> > >       || !format->has_denorm)
> > >     return false;
> >
> > Dunno.  Is there a conservative thing we can do for mode_composites
> > that aren't denormal or zero?
> >
> > How does this look?
> > Aldy
>

--000000000000a3c57905ec5097cc--