From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from smtp.gentoo.org (smtp.gentoo.org [140.211.166.183]) by sourceware.org (Postfix) with ESMTP id 987F53857829 for ; Wed, 24 Mar 2021 01:13:40 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.3.2 sourceware.org 987F53857829 Received: from vapier (localhost [127.0.0.1]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.gentoo.org (Postfix) with ESMTPS id 7E474335D52; Wed, 24 Mar 2021 01:13:39 +0000 (UTC) Date: Tue, 23 Mar 2021 21:13:39 -0400 From: Mike Frysinger To: Jeffrey Walton Cc: Peng Yu , libc-help Subject: Re: Does glibc has complete test coverage? Message-ID: Mail-Followup-To: Jeffrey Walton , Peng Yu , libc-help References: MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-4.7 required=5.0 tests=BAYES_00, JMQ_SPF_NEUTRAL, KAM_DMARC_STATUS, RCVD_IN_MSPIKE_H3, RCVD_IN_MSPIKE_WL, SPF_HELO_PASS, SPF_PASS, TXREP autolearn=no autolearn_force=no version=3.4.2 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on server2.sourceware.org X-BeenThere: libc-help@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-help mailing list List-Unsubscribe: , List-Archive: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 24 Mar 2021 01:13:42 -0000 On 23 Mar 2021 17:02, Jeffrey Walton wrote: > On Tue, Mar 23, 2021 at 4:43 PM Mike Frysinger wrote: > > On 23 Mar 2021 11:39, Peng Yu via Libc-help wrote: > > > https://www.kernel.org/doc/man-pages/missing_pages.html > > > > > > "... quite a few kernel and glibc bugs have been uncovered while > > > writing test programs during the preparation of man pages. " > > > > > > I see the above text. It doesn't make too much sense, as it indicates > > > that glibc does not have complete test coverage. > > > > > > Why not taking an approach of always accompanying each line of source > > > code with appopriate test cases? If this approach is taken, then most > > > bugs should have been eliminated beforehand? > > > > ignoring the legacy aspect (code that's in the tree now but lacks tests), > > you have diminishing returns when it comes to writing unittests, and, as > > can be seen in a recent discussion, glibc is pretty tightly coupled to > > the runtime environment (i.e. the host kernel). so getting an env that > > matches all the different code paths is challenging. > > > > plus it comes down a bit to this being an open source project for many > > of us, not a job, and you have to be respectful of balancing quality > > and developer time with any requests you make on other volunteers. > > > > along those lines, this is an open source project where "patches are > > welcome", so if you wanted to spend your time improving the frameworks > > and coverage of our tests, we'd welcome you. > > Interns are usually a good choice for writing test cases. It gets them > familiar with the code, frees up a senior developer's time, and helps > avoid the developer's bias. > > Test cases are monkey work that should be delegated. When delegation > does not occur it usually points back to shortcomings in project > management. many are good for delegation, but that doesn't mean quantity is the same as quality. if we could get 100% coverage but it took weeks to run, but 90% coverage took <1 hour, is that 10% worth it ? this isn't exactly hyperbole when we have targets that run on simulators or FPGAs and have <<1GHz CPUs. for example, how much of LTP should be part of glibc ? they have over 1000 "syscall" tests which mostly go through the C library's APIs and can catch bugs, but they also take a long time to run. how much should glibc be exercising different kernel versions ? a lot of our work & APIs depend heavily on the kernel working correctly. should we be running against every Linux release since 3.2 ? do we test the many different ways kernels can be compiled ? do we workaround kernel bugs ? https://sourceware.org/pipermail/libc-alpha/2021-March/123486.html https://sourceware.org/pipermail/libc-alpha/2021-March/123582.html glibc has a matrix of build tools that it can utilize and significantly affects its behavior & output. do we try every combo of GCC & binutils that we support ? glibc runs on like 20 diff architectures, and many of those have ISA specific optimizations (like x86_64 SSE/AVX/etc...). that's another huge multiplier. my point is that "100% coverage" sounds fine until you dive down the rabbit hole and realize it goes forever. > > also try googling for "100% test coverage" and reading the variety of > > opinions the wider world has on the topic. > > Sorry, I could not resist.... But you know the funny thing is, when > you perform a post-mortem to determine why the bug made it into > production, it usually points to (1) a developer mistake and (2) lack > of test case. > > If you break (1) or (2) you break the chain for the bug. So you either > have to hire developers who don't make mistakes or provide complete > test cases. i don't think that view lines up perfectly with the real world as you might like it to. tests can have bugs too, and i've found plenty of those. as in, all the tests pass fine, but that's because the tests set up an invalid environment that doesn't match the real runtime. or the tests are there, and the runtime (e.g. kernel) changed. that doesn't stop the bug from being introduced because kernel developers aren't running all of the world's testsuites against their releases. i'm not saying tests don't add value and we shouldn't write them. i write tests constantly. but i am saying that they aren't the solution to all of our ails. -mike