This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [2.20] [6/6] Do not terminate default test runs on test failure

From: Brooks Moses <bmoses at google dot com>
To: "Joseph S. Myers" <joseph at codesourcery dot com>, libc-alpha at sourceware dot org
Date: Mon, 10 Feb 2014 16:10:40 -0800
Subject: Re: [2.20] [6/6] Do not terminate default test runs on test failure
Authentication-results: sourceware.org; auth=none
References: <Pine dot LNX dot 4 dot 64 dot 1401100208000 dot 9412 at digraph dot polyomino dot org dot uk> <Pine dot LNX dot 4 dot 64 dot 1401100214150 dot 9412 at digraph dot polyomino dot org dot uk>

On 01/09/2014 06:14 PM, Joseph S. Myers wrote:

Normal practice for software testsuites is that rather than
terminating immediately when a test fails, they continue running and
report at the end on how many tests passed or failed.

[...]

Questions:

* Should "make check" also exit with error status if there were any
   FAILs, not just if there were ERRORs?  I didn't do this because it
   seems useful to make the exit status distinguish the case where the
   build is broken (early termination or an ERROR) from the case where
   there are simply some test failures.


The current case seems reasonable to me.

For our internal GCC builds, we have XFAIL mainifests separate from the onesmaintained in the testsuite, and compare the .sum files against thesemanifests to determine whether we consider the build to be successful ornot. This occurs because sometimes our local patches cause regressions intests we don't care about, or sometimes our test environment leads to failures.

Thus, we'd like to be able to have the make status be an indication ofwhether or not things got to the point of being able to create a testsummary file, and then examine the summary file ourselves in a separate stepto determine whether the result is overall PASS or FAIL.

(For reference: The comparison script is validate_failures.py in GCC'scontrib/testsuite-management directory. I expect it will work with thesummary files this patch chain creates, without needing modifications.)

* There are various cases of miscellaneous files that are dependencies
   of tests, the build of which involves running some code from the
   newly built libc and associated programs, but which aren't counted
   as tests themselves (aren't in tests-special, don't use
   evaluate-test) - for example, timezone files generated with zic.
   Should these more systematically change to be counted as tests?

Yes, please. See my mantra about "more consistency means we are closer to afuture in which I can have a system that runs tests remotely without ashared filesystem". :)

   They may not be that interesting as tests, but doing this would
   increase the chances of getting a complete log of test results when
   cross-testing if the host system is flaky - ensuring that only
   problems on the build system will terminate a test run early, not
   problems on the host used for running the newly built libc.


Yup.

   (Ideally, in all cases of a test depending on another test - and
   there are plenty already of one test using the .out file from
   another test, whether or not we make a rule that running code on the
   host means it's a test - failure in the dependency would result in
   UNRESOLVED not FAIL from the test depending on the failed test.  But
   that may be hard to implement, and I'm not particularly concerned
   about one test failure causing others as fallout.)


Agreed; this seems fairly low-priority.

A couple of minor comments:

     To build and run test programs which exercise some of the library
  facilities, type `make check'.  If it does not complete successfully,

[...]

+without reporting any unexpected failures or errors in its final
+summary of results, do not use the built library, and report a bug
+after verifying that the problem is not already known.  (You can

I would write this as "If it does not complete successfully, or if itreports any unexpected failures or errors...," to avoid the nested negationand thereby make it clearer.

+specify `stop-on-test-failure=y' when running `make check' to make it


Expand "it" to "the test run".

+stop immediately when a failure occurs rather than finishing running
+the tests then reporting all problems found.)  *Note Reporting Bugs::,

I think "rather than completing the test run and reporting..." would be alittle cleaner.

Possibly expand "stop" to "stop and exit with an error status" or somethinglike that, too.


The same comments apply to manual/install.texi, of course.

Other than the documentation comments, this seems good to me -- and theoverall patch sequence seems a marked improvement over the status quo. Evenon x86_64, we have a handful of local patches that amount to "delete thistest because it is broken", and it will be nice to change those to an XFAILvalidation step instead.


Thanks,
- Brooks

Follow-Ups:
- Re: [2.20] [6/6] Do not terminate default test runs on test failure
  - From: Joseph S. Myers

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]