This is the mail archive of the gdb-patches@sourceware.org mailing list for the GDB project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

[RFC] fix gdb.threads/non-stop-fair-events.exp timeouts

From: Sandra Loosemore <sandra at codesourcery dot com>
To: gdb-patches <gdb-patches at sourceware dot org>
Cc: Pedro Alves <palves at redhat dot com>, Yao Qi <yao dot qi at linaro dot org>
Date: Fri, 4 Sep 2015 10:54:37 -0600
Subject: [RFC] fix gdb.threads/non-stop-fair-events.exp timeouts
Authentication-results: sourceware.org; auth=none

While running GDB tests on nios2-linux-gnu with gdbserver and "targetremote", I've been seeing random failures ingdb.threads/non-stop-fair-events.exp. E.g. in one test run I got

FAIL: gdb.threads/non-stop-fair-events.exp: signal_thread=6: thread 1broke out of loop (timeout)FAIL: gdb.threads/non-stop-fair-events.exp: signal_thread=6: thread 2broke out of loop (timeout)FAIL: gdb.threads/non-stop-fair-events.exp: signal_thread=6: thread 3broke out of loop (timeout)FAIL: gdb.threads/non-stop-fair-events.exp: signal_thread=7: thread 1broke out of loop (timeout)FAIL: gdb.threads/non-stop-fair-events.exp: signal_thread=10: thread 1broke out of loop (timeout)FAIL: gdb.threads/non-stop-fair-events.exp: signal_thread=10: thread 2broke out of loop (timeout)

and in other test runs I got a different ones. The pattern seemed to bethat sometimes it took an extra long time for the first thread to breakout of the loop, but once that happened they would all stop correctlyand send the expected replies even though GDB had given up on waitingfor the first few already.

I've come up with the attached patch to factor the timeout for thefailing tests by the number of threads still running, which seems totake care of the problem. Does this seem reasonable?

I'm somewhat confused because, in spite of it sometimes taking at least3 times the normal timeout for the first stop message to appear, thealarm in the test case (which is tied to the normal timeout) was nevertriggering. My best theory on that is that the slowness is not in thetest case, but rather in gdbserver. IOW, all the threads are alreadystopped by the time the alarm would expire, but gdb and gdbserverhaven't finished all the notifications and requests to print a stopmessage for any of the threads yet. Is that plausible? Should thetimeout for the alarm be factored by the number of threads, too, just tobe safe?

I'm also not entirely sure what this test case is supposed to test.From the original commit message and comments in the .exp file it seemslike timeouts were supposed to be a sign of a broken kernel with threadstarvation problems, not bugs in gdb or gdbserver. But, don't wenormally just skip tests that the target doesn't support or can't runproperly, rather than report them as FAILs? And, I don't know how todistinguish timeouts that mean the kernel is broken from timeouts thatmean the target is just slow and you need to set a bigger value in thetest harness.


-Sandra the confused

2015-09-04  Sandra Loosemore  <sandra@codesourcery.com>

	gdb/testsuite/
	* gdb.threads/non-stop-fair-events.exp (test): Use factored
	timeout	when waiting for threads to break out of loop.

diff --git a/gdb/testsuite/gdb.threads/non-stop-fair-events.exp b/gdb/testsuite/gdb.threads/non-stop-fair-events.exp
index e2d3f7d..1570d3f 100644
--- a/gdb/testsuite/gdb.threads/non-stop-fair-events.exp
+++ b/gdb/testsuite/gdb.threads/non-stop-fair-events.exp
@@ -135,16 +135,22 @@ proc test {signal_thread} {
 
 	# Wait for all threads to finish their steps, and for the main
 	# thread to hit the breakpoint.
+	# Running this many threads may be quite slow on remote targets,
+	# so factor the timeout according to how many threads are running.
+	set max_timeout $NUM_THREADS
 	for {set i 1} { $i <= $NUM_THREADS } { incr i } {
 	    set test "thread $i broke out of loop"
-	    gdb_test_multiple "" $test {
-		-re "loop_broke" {
-		    # The prompt was already matched in the "continue
-		    # &" test above.  We're now consuming asynchronous
-		    # output that comes after the prompt.
-		    pass $test
+	    with_timeout_factor $max_timeout {
+	        gdb_test_multiple "" $test {
+		    -re "loop_broke" {
+			# The prompt was already matched in the "continue
+			# &" test above.  We're now consuming asynchronous
+			# output that comes after the prompt.
+			pass $test
+		    }
 		}
 	    }
+	    set max_timeout [expr $max_timeout - 1]
 	}
 
 	# It's helpful to have this in the log if the test ever

Follow-Ups:
- Re: [RFC] fix gdb.threads/non-stop-fair-events.exp timeouts
  - From: Pedro Alves

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]