riscv-gnu-toolchain/dejagnu.git - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author	Files	Lines
2020-06-22	remote: Fix a stuck remote call pipeline causing testing to hang	Maciej W. Rozycki	1	-1/+17
	Fix a stuck remote call pipeline comprised of multiple processes causing testing to hang and requiring a manual intervention to either terminate or proceed, like below (here with the GCC `c' testsuite invoked with `execute.exp=postmod-1.c' for 8 compilation and 8 execution tests on a remote QEMU target run in the system emulation mode): PASS: gcc.c-torture/execute/postmod-1.c -O0 (test for excess errors) Executing on remote-localhost: .../gcc/testsuite/gcc/postmod-1.exe (timeout = 15) spawn [open ...] WARNING: program timed out got a INT signal, interrupted by user === gcc Summary === # of expected passes 1 by not killing the pending force-kills in `close_wait_program' and also by setting the channel associated with the pipeline to the nonblocking mode when it is about to be closed afterwards. The situation here is as follows. A connection to the remote target board is requested by `rsh_exec' with input redirection requested from `/dev/null'. The request is handled by `local_exec' and the redirection causes a Tcl command pipeline channel to be opened. The list of PIDs of the processes comprising the pipeline is determined and then the channel is assigned an Expect spawn ID. The spawn ID is then waited for output produced by the remote target (here accessed with SSH) and, ultimately, completion marked by the end-of-file condition. As SSH gets stuck and does not complete the timeout eventually fires and a kill sequence is initiated, by calling `close_wait_program' with the list of PIDs previously obtained to kill given as one of the procedure's arguments. Seeing the list of PIDs rather than -1 `close_wait_program' issues SIGINT to all the requested processes right away and schedules a delayed sequence called "force-kills" to them, which sends SIGTERM and then, after a further delay, SIGKILL. Now `close_wait_program' calls `close' on the spawn ID associated with the pipeline, but this call doesn't affect the pipeline as its input has been redirected from `/dev/null'. As the next step `wait' is called on the same spawn ID and returns successfully right away with a result like {0 exp8 0 0} in `wres', where no PID is indicated, consistently with the null PID result of the original `spawn' command that assigned the spawn ID (`exp8' here) to the pipeline. The return from the `wait' command causes code to be executed for the pending force-kills to be killed. At this point the process situation is like below: PID TTY STAT TIME COMMAND 6908 pts/3 Sl 0:00 expect -- .../share/dejagnu/runtest.exp --tool gcc --target_board remote-localhost execute.exp=postmod-1.c 6976 pts/3 S 0:00 \_ ssh -p 2222 -l macro localhost sh -c '.../gcc/testsuite/gcc/postmod-1.exe ; echo XYZ${?}ZYX' 6977 pts/3 Z 0:00 \_ [cat] <defunct> 6991 pts/3 Z 0:00 \_ [sh] <defunct> so `cat' and `sh' have already terminated, the former presumably due to SIGINT sent previously and the latter having been the force-kills just killed, and only await being wait(2)ed for, however `ssh' is still live and in the interruptible sleep, presumably awaiting communication with the remote end. Since there is nothing else to do for `close_wait_program' it returns success to `local_exec', which then calls `close' on the pipeline to clean up after it. But that in turn causes wait(2) to be called on the individual PIDs comprising the pipeline and when the PID associated with `ssh' the call hangs indefinitely preventing the whole testsuite from proceeding. A similar situation triggers with GDB testing where a Tcl command pipeline channel is opened in `remote_spawn' instead, and then closed, after `close_wait_program' has been called, in `standard_close'. So the solution to the problem is twofold. First pending force-kills are not killed after `wait' if there are more than one PID in the list passed to `close_wait_program'. This follows the observation that if there was only one PID on the list, then the process must have been created directly by `spawn' rather than by assigning a spawn ID to a pipeline and the return from `wait' would mean the process associated with the PID must have already been cleaned up after, so it is only when there are more there is a possibility any may have been left behind live. Second if a pipeline has been used, then the channel associated with the pipeline is set to the nonblocking mode in case any of the processes that may have left live is stuck in the noninterruptible sleep (aka D) state. Such a process would necessarily ignore even SIGKILL so long as it remains in that state and would cause wait(2) called by `close' to hang possibly indefinitely, and we want the testsuite to proceed rather than hang even in bad circumstances. Finally it appears to be safe to leave pending force-kills to complete their job after `wait' has been called in `close_wait_program', because based on the observation made here the command does not actually call wait(2) if issued on a spawn ID associated with a pipeline created by `open' rather than a process created by `spawn'. Instead the PIDs from a pipeline are supposed to be cleaned up after by calling wait(2) from the `close' command call made on the pipeline channel. If on the other hand the channel is set to the nonblocking mode before `close', then even that command does not call wait(2) on the associated PIDs. Therefore the PIDs on the list passed are not subject to PID reuse and the force-kills won't accidentally kill an unrelated process, as a PID cannot be allocated by the kernel for a new process until any previous process's status has been consumed from its PID by wait(2). And then PIDs of any children that have actually terminated one way or another are wait(2)ed for by Tcl automatically in the event loop, so no mess is left behind. * lib/remote.exp (close_wait_program): Only kill the pending force-kills if the PID list has a single entry. (local_exec): Set the channel about to be closed to the nonblocking mode if we didn't see an EOF. (standard_close): Likewise, unconditionally. Signed-off-by: Maciej W. Rozycki <macro@wdc.com>
2020-06-22	remote: Use `catch' in killing pending force-kills	Maciej W. Rozycki	1	-1/+4
	Address an execution race in `close_wait_program' and use `catch' in killing pending force-kills issued there in the recovery of a stuck test case, in case the force-kill sequence has completed before the command to kill the sequence had a chance to run, so that no error is thrown and a testsuite run does not get interrupted early like: PASS: gcc.c-torture/execute/postmod-1.c -O0 (test for excess errors) Executing on remote-localhost: .../gcc/testsuite/gcc/postmod-1.exe (timeout = 15) spawn [open ...] WARNING: program timed out ERROR: tcl error sourcing .../gcc/testsuite/gcc.c-torture/execute/execute.exp. ERROR: child process exited abnormally while executing "exec sh -c "exec > /dev/null 2>&1 && kill -9 $exec_pid"" (procedure "close_wait_program" line 57) invoked from within "close_wait_program $spawn_id $pid wres" (procedure "local_exec" line 104) [...] "uplevel #0 source .../gcc/testsuite/gcc.c-torture/execute/execute.exp" invoked from within "catch "uplevel #0 source $test_file_name"" testcase .../gcc/testsuite/gcc.c-torture/execute/execute.exp completed in 196 seconds === gcc Summary === # of expected passes 1 -- therefore not letting `execute.exp' continue (here with the GCC `c' testsuite invoked with `execute.exp=postmod-1.c' for 8 compilation and 8 execution tests). The completion of the force-kill sequence would have to happen in the window between the `wait' command has returned, which would at worst happen as a result of the final `kill -9' command in the sequence, and the `kill -9 $exec_pid' command issued here, and the `sleep 5' command issued at the end of the force-kill sequence makes the likelihood of such a scenario low, but this might still happen with a loaded host system and there is no drawback from using `catch' here, so let's do it. * lib/remote.exp (close_wait_program): Use `catch' in killing pending force-kills. Signed-off-by: Maciej W. Rozycki <macro@wdc.com>
2020-06-22	Update NEWS	Jacob Bachmeyer	2	-0/+6

2020-06-22	Add tests for new features in default_target_compile	Jacob Bachmeyer	2	-3/+319

2020-06-22	Add "linker=" option to target_compile to support testing	Jacob Bachmeyer	3	-5/+18

2020-06-22	Fix up upstreamed GDB testsuite patches	Jacob Bachmeyer	2	-2/+20

2020-06-20	Document find_* procedures imported from GDB testsuite	Jacob Bachmeyer	2	-2/+51

2020-06-20	Add Go support to default_target_compile	Tom Tromey	4	-0/+70
	This adds Go support to default_target_compile. This comes from this gdb patch: commit a766d390bb857383a5f9ae80a102e1f8705f4c2e Author: Doug Evans <dje@google.com> Date: Wed Apr 25 14:07:23 2012 +0000 Initial pass at Go language support.
2020-06-20	Add Rust support to default_target_compile	Tom Tromey	4	-1/+52
	This adds support for the Rust language to default_target_compile. This comes from a gdb patch: commit 67218854b1987d89593ccaf5feaf5b29b1b976f2 Author: Tom Tromey <tom@tromey.com> Date: Tue Apr 26 19:38:43 2016 -0600 Update gdb test suite for Rust [...] 2016-05-17 Tom Tromey <tom@tromey.com> Manish Goregaokar <manishsmail@gmail.com>
2020-06-20	Add early_flags to default_target_compile	Tom Tromey	3	-5/+39
	This adds early_flags support to default_target_compile. This originated in this gdb patch: commit 6ebea266fd0a7a56c90db3ab6237ff9f6c919747 Author: Doug Evans <dje@google.com> Date: Fri Jul 24 15:24:37 2015 -0700 Workaround debian change to default value of --as-needed. gdb/testsuite/ChangeLog: * lib/future.exp (gdb_default_target_compile): New option "early_flags". * lib/gdb.exp (gdb_compile): Undo debian's change in default of --as-needed. This patch also pulls in the "linker_opts_order" code, though nothing uses it yet. A use will come in a subsequent patch.
2020-06-19	Update ChangeLog	Jacob Bachmeyer	1	-0/+11

2020-06-19	Add tests for handling arithmetic errors in non-auto-loaded procedures	Tom de Vries	2	-0/+40

2020-06-19	Reword NEWS entries	Tom de Vries	1	-4/+3
	[Following comments from Jacob Bachmeyer] The NEWS entries changed in this patch were originally written separately and had not been rethought when the general handling of Tcl errors was tightened. Another thanks to Tom de Vries for catching this.
2020-06-18	Use consistent behavior for Tcl errors in test scripts	Jacob Bachmeyer	5	-26/+53

2020-06-18	Add tests for handling arithmetic errors in auto-loaded procedures	Jacob Bachmeyer	3	-0/+63

2020-06-17	Fix typos in ChangeLog	Jacob Bachmeyer	1	-1/+2

2020-06-17	Merge branch 'pr41914' into 'PR41824'	Jacob Bachmeyer	2	-0/+6

2020-06-17	Add new testsuite files to TESTSUITE_FILES in Makefile.am	Jacob Bachmeyer	2	-0/+4

2020-06-17	Allow testing to continue after an undefined command is called	Jacob Bachmeyer	11	-5/+195

2020-06-17	Propagate return value of auto-loaded command	Rob Savoye	2	-0/+8

2020-06-15	Update ChangeLog	Jacob Bachmeyer	1	-0/+15

2020-06-15	Combine remote stderr into remote stdout	Christophe Lyon	1	-1/+1
	* lib/ssh.exp (ssh_exec): Redirect stderr to stdout on the remote machine, to avoid race conditions.
2020-06-15	Keep trailing newline in remote execution output	Yvan Roux	2	-6/+0
	* lib/rsh.exp (rsh_exec): Don't remove trailing newline. * lib/ssh.exp (rsh_exec): Likewise.
2020-06-15	Support using QEMU in local/remote testing using default "unix" board	Maxim Kuvyrkov	1	-0/+13
	If the board file defines "exec_shell", prepend it before the local or remote command.
2020-06-10	Fix Info node connectivity and formatting mistake	Jacob Bachmeyer	1	-1/+2

2020-06-06	Add "testcase group" API	Jacob Bachmeyer	5	-11/+249

2020-06-05	Add "testsuite can call api" feature test API	Jacob Bachmeyer	5	-7/+92

2020-06-02	Save and restore variables set by command arguments	Jacob Bachmeyer	3	-6/+70
	This commit closes the window during which srcdir was controlled by the testsuite local init file even if specified on the command line. Allowing this override causes problems with Automake when the source directory was set using a relative file name.
2020-06-01	Print initial working directory if verbose output is selected	Jacob Bachmeyer	2	-0/+3

2020-06-01	Add verbose output for testsuite search	Jacob Bachmeyer	2	-0/+17

2020-05-28	Fix problems with relative srcdir in launcher and report-card testsuites	Jacob Bachmeyer	3	-2/+7

2020-05-28	Fix problems with relative srcdir in testsuite/runtest.main/options.exp	Jacob Bachmeyer	2	-1/+5

2020-05-28	Show the user's login name shortly after finding it	Jacob Bachmeyer	2	-2/+5
	Previously, two procedures were defined between setting logname and reporting its value if verbosity is selected. This does not change program flow, but will make the code easier to examine in the future.
2020-05-28	Tidy irregular indentation in ChangeLog	Jacob Bachmeyer	1	-32/+34

2020-05-28	Fix default value of libdir global	Jacob Bachmeyer	2	-1/+9
	This was causing testsuite/runtest.main/stats.exp to fail unless the DejaGnu sources were in a directory named dejagnu.
2020-05-27	Fix node duplicated by merge error in dejagnu.texi	Jacob Bachmeyer	1	-14/+0
	The previous merge duplicated the "target_link procedure" node.
2020-05-26	Merge branch 'patch-0002'	Rob Savoye	2	-4/+179

2020-05-26	Add documentation for target_compile procedure.	Jacob Bachmeyer	2	-22/+197

2020-05-26	Document internal procedure, default_link	Jacob Bachmeyer	2	-18/+32

2020-05-25	Use board_info correctly.	Jacob Bachmeyer	1	-10/+10

2020-05-25	Cleanup whitespace	Jacob Bachmeyer	8	-15/+18

2020-05-25	Use host_info procedure to probe for a host configuration, instead of ↵	Jacob Bachmeyer	2	-1/+7
	checking a local empty target_info array due to lacking global target_info.
2020-05-25	Fix Copyright date	Rob Savoye	30	-31/+66

2020-05-25	Establish a default C compiler by evaluating [find_gcc] if no other ↵	Jacob Bachmeyer	44	-125/+102
	compiler is given.
2020-05-24	add entry	Rob Savoye	1	-0/+5

2020-05-24	Add baseboard to cross test to a Raspberry PI	Rob Savoye	1	-0/+24

2020-05-24	This adds a .dir-locals.el for emacs to the repository	Tom Tromey	1	-0/+22

2020-05-24	Ignore generated files.	Tom Tromey	2	-0/+8

2020-05-21	Collect loading DejaGnu libraries into a single loop, Revise the mock ↵	Jacob Bachmeyer	2	-33/+1288
	board_info array.
2020-05-21	New file	Jacob Bachmeyer	1	-0/+246