From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <hi@jdoubleu.de>
Received: from deimos.uberspace.de (deimos.uberspace.de [185.26.156.152])
 by sourceware.org (Postfix) with ESMTPS id D277E3858C51
 for <newlib@sourceware.org>; Tue, 12 Apr 2022 11:19:13 +0000 (GMT)
DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org D277E3858C51
Authentication-Results: sourceware.org;
 dmarc=none (p=none dis=none) header.from=jdoubleu.de
Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=jdoubleu.de
Received: (qmail 21784 invoked by uid 989); 12 Apr 2022 11:19:11 -0000
Authentication-Results: deimos.uberspace.de;
	auth=pass (plain)
Message-ID: <ba27e7c5-d12e-2d76-4d32-97c9a7bc5a91@jdoubleu.de>
Date: Tue, 12 Apr 2022 13:19:08 +0200
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101
 Thunderbird/91.8.0
Subject: Re: [PATCH] add tests for tzset(3)
Content-Language: en-US
To: Dimitar Dimitrov <dimitar@dinux.eu>
Cc: newlib@sourceware.org, Jeff Johnston <jjohnstn@redhat.com>,
 Brian Inglis <Brian.Inglis@SystematicSw.ab.ca>
References: <569f6ec3-6bc8-8170-c1fb-ed946020e258@jdoubleu.de>
 <YlKYmZ0oNLitMT2b@kendros> <c8ca181b-294b-2cc0-8b0f-5756ddd99e88@jdoubleu.de>
 <YlNFZ+JY0OxY+6d3@kendros> <836b0ac6-5e3a-03ca-d696-16adda6554a8@jdoubleu.de>
 <YlRk89SzhU71U+vs@kendros>
From: jdoubleu <hi@jdoubleu.de>
In-Reply-To: <YlRk89SzhU71U+vs@kendros>
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
X-Rspamd-Bar: ---
X-Rspamd-Report: BAYES_HAM(-3) MIME_GOOD(-0.1)
X-Rspamd-Score: -3.1
Received: from unknown (HELO unkown) (::1)
 by deimos.uberspace.de (Haraka/2.8.28) with ESMTPSA;
 Tue, 12 Apr 2022 13:19:11 +0200
X-Spam-Status: No, score=-10.2 required=5.0 tests=BAYES_00, BODY_8BITS,
 GIT_PATCH_0, KAM_DMARC_STATUS, KAM_SHORT, MSGID_FROM_MTA_HEADER, NICE_REPLY_A,
 RCVD_IN_DNSWL_NONE, SPF_HELO_NONE, SPF_PASS, TXREP,
 T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.4
X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on
 server2.sourceware.org
X-BeenThere: newlib@sourceware.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Newlib mailing list <newlib.sourceware.org>
List-Unsubscribe: <https://sourceware.org/mailman/options/newlib>,
 <mailto:newlib-request@sourceware.org?subject=unsubscribe>
List-Archive: <https://sourceware.org/pipermail/newlib/>
List-Post: <mailto:newlib@sourceware.org>
List-Help: <mailto:newlib-request@sourceware.org?subject=help>
List-Subscribe: <https://sourceware.org/mailman/listinfo/newlib>,
 <mailto:newlib-request@sourceware.org?subject=subscribe>
X-List-Received-Date: Tue, 12 Apr 2022 11:19:17 -0000

Hi,

> I believe it is the ('>' != tzenv[n]) check that fails, because the
> maximum parsed string limit of 10 has been reached and n=10.

Okay, so the newlib implementation actually fails, when the name is too 
long.

The POSIX standard leaves it up to the implementation how to handle 
names[1]:
> The interpretation of these fields is unspecified if either field is less than three bytes (except for the case when dst is missing), more than {TZNAME_MAX} bytes, or if they contain characters other than those specified.

I'm not sure where to go with this. In both cases, the current 
implementation needs a fix. Looping in Brian Inglis, the writer of the 
current implementation and Jeff Johnston, a maintainer.

As I see it, there are two solutions:
1. Treat long names as an error case. The failed condition[2] needs to 
apply UTC as current time, like Dimitar Dimitrov previously suggested. 
The test needs to be changed to check that it actually fails.
2. Mimic glibc's behavior and ignore the remaining characters from the 
name. In that case, the condition[2] and following code needs to be 
updated as well.

I am happy to provide a patch, when we agreed on one solution.

> I assume you no longer need assembly output from compiler?

No, thanks. We already found the issue. It would only be helpful, if 
different configurations generate different code (through macros, 
optimization, etc.).

[1]: 
https://pubs.opengroup.org/onlinepubs/9699919799/basedefs/V1_chap08.html#tag_08_03
[2]: 
https://sourceware.org/git/?p=newlib-cygwin.git;a=blob;f=newlib/libc/time/tzset_r.c;h=9cb30b188f989f65ec9eb6417f5d74020f8c72e9;hb=HEAD#l57

Cheers
---
🙎🏻‍♂️ jdoubleu
On 4/11/2022 7:27 PM, Dimitar Dimitrov wrote:
> On Mon, Apr 11, 2022 at 01:17:16PM +0200, jdoubleu wrote:
>> Hi,
>>
>> looks like I'm running the testsuite against glibc and not newlib (for
>> target x86_64-pc-linux-gnu). I'm not even sure whether there's a backend for
>> linux.
>>
>> I'm currently trying to run only the tzset code against the test vectors
>> (like Brian Inglis did[1]).
>>
>> At least it show, that the newlib implementation differs from glibc. Maybe
>> the test case is flawed and it should indeed fail.
>>
>>> 6:20:12 is the timezone string of the previous test case, whose tzset
>>> call was successful. Looking at the current code, this is expected
>>> behaviour.
>>
>> Okay. Looks like the condition[2] fails. The question is, which part of it
>> does?
> 
> I believe it is the ('>' != tzenv[n]) check that fails, because the
> maximum parsed string limit of 10 has been reached and n=10.
> 
>>
>> I've appended a patch, which prints all variables when the condition fails.
>> Could you please apply the patch and then recompile and re-run the tests
>> again?
> 
> Here is the result for arm-none-eabi:
> parsing name: tzenv="+0123456789ABCDEF>3:33:33", res=1, n=10, tzenv[n] = 9
> Assertion failed! Expected 1647906533 to equal 1647916532. winter time, timezone = "<+0123456789ABCDEF>3:33:33"
> 
> I assume you no longer need assembly output from compiler?
>>
>> I've previously noticed something with the sscanf format[3].
>>
>>> Perhaps TZ should be reset to UTC before the bail out?
>>
>> I don't think the implementation should fall back to UTC whenever parsing
>> failed. It apparently doesn't in glibc. I'm not sure if the behavior is
>> specified somewhere.
> 
> The tzset manual page says that UTC is used if TZ cannot be parsed:
> https://man7.org/linux/man-pages/man3/tzset.3.html
> 
>>
>> Maybe resetting it before each test case is a good idea, though. That makes
>> it clearer, why the test case failed.
>>
>>> With that chunk removed, as shown below:
>>>      {"3:33:33",               IN_SECONDS(3, 33, 33),   NO_TIME},                 // truncates the name (17 + 1)
>>> I still get:
>>>    Assertion failed! Expected 1647906533 to equal 1647916532. winter time, timezone = "3:33:33"
>>
>> My bad; "3:33:33" isn't a valid timezone string. It has to be prefixed by a
>> name e.g. "MESZ" or "<+00>", as you tried. That explains why it is also
>> failing.
>>
>>
>> Thanks for your effort so far!
>>
>>
>> [1]: https://sourceware.org/pipermail/newlib/2022/019529.html
>> [2]: https://sourceware.org/git/?p=newlib-cygwin.git;a=blob;f=newlib/libc/time/tzset_r.c;h=9cb30b188f989f65ec9eb6417f5d74020f8c72e9;hb=HEAD#l57
>> [3]: https://sourceware.org/pipermail/newlib/2022/019535.html
>>
>>
>> Cheers
>> ---
>> 🙎🏻‍♂️ jdoubleu
> 
>>  From 8c698dc63f765d4a5b3a49a25850c1738279d68d Mon Sep 17 00:00:00 2001
>> From: jdoubleu <hi@jdoubleu.de>
>> Date: Mon, 11 Apr 2022 13:10:38 +0200
>> Subject: [PATCH] debug print condition values of tz string name parsing
>>
>> ---
>>   newlib/libc/time/tzset_r.c | 6 ++++--
>>   1 file changed, 4 insertions(+), 2 deletions(-)
>>
>> diff --git a/newlib/libc/time/tzset_r.c b/newlib/libc/time/tzset_r.c
>> index 9cb30b188..e20a32a62 100644
>> --- a/newlib/libc/time/tzset_r.c
>> +++ b/newlib/libc/time/tzset_r.c
>> @@ -54,9 +54,11 @@ _tzset_unlocked_r (struct _reent *reent_ptr)
>>         ++tzenv;
>>   
>>         /* quit if no items, too few or too many chars, or no close quote '>' */
>> -      if (sscanf (tzenv, "%10[-+0-9A-Za-z]%n", __tzname_std, &n) <= 0
>> -		|| n < TZNAME_MIN || TZNAME_MAX < n || '>' != tzenv[n])
>> +      int res = sscanf (tzenv, "%10[-+0-9A-Za-z]%n", __tzname_std, &n);
>> +      if (res <= 0 || n < TZNAME_MIN || TZNAME_MAX < n || '>' != tzenv[n]) {
>> +        printf("parsing name: tzenv=\"%s\", res=%d, n=%d, tzenv[n] = %c\n", tzenv, res, n, tzenv[n]);
>>           return;
>> +      }
>>   
>>         ++tzenv;	/* bump for close quote '>' */
>>       }
>> -- 
>> 2.35.1
>>
>