[PATCH] gdbserver: convert program_args to a single string

Simon Marchi simark@simark.ca
Tue Jan 14 15:32:58 GMT 2025



On 2025-01-14 08:39, Andrew Burgess wrote:
> This commit changes how gdbserver stores the inferior arguments from
> being a vector of separate arguments into a single string with all of
> the arguments combined together.
> 
> Making this change might feel a little strange; intuitively it feels
> like we would be better off storing the arguments as a vector, but
> this change is part of a larger series of work that aims to improve
> GDB's inferior argument handling.  The full series was posted here:
> 
>   https://inbox.sourceware.org/gdb-patches/cover.1730731085.git.aburgess@redhat.com
> 
> But asking people to review a 14 patch series in unreasonable, so I'm
> instead posting the patches in smaller batches.  This patch can stand
> alone, and I do think this change makes sense on its own:
> 
> First, GDB already stores the inferior arguments as a single string,
> so doing this moves gdbserver into line with GDB.  The common code
> into which gdbserver calls requires the arguments to be a single
> string, so currently each target's create_inferior implementation
> merged the arguments anyway, so all this commit really does is move
> the merging up the call stack, and store the merged result rather than
> storing the separate parts.
> 
> However, the biggest reason for why this commit is needed, is an issue
> with passing arguments from GDB to gdbserver when starting a new
> inferior.
> 
> Consider:
> 
>   (gdb) set args $VAR
>   (gdb) run
>   ...
> 
> When using a native target the inferior will see the value of $VAR
> expanded by the shell GDB uses to start the inferior.  However, if
> using an extended-remote target the inferior will see literally $VAR,
> the unexpanded name of the variable, the reason for this is that,
> although GDB sends '$VAR' to gdbserver, when gdbserver receives this,
> it converts this to '\$VAR', which prevents the variable from being
> expanded by the shell.
> 
> The reason for this is that construct_inferior_arguments escapes all
> special shell characters within its arguments, and it is
> construct_inferior_arguments that is used to combine the separate
> arguments into a single string.
> 
> In the future I will change construct_inferior_arguments so that
> it can apply different escaping strategies.  When this happens we will
> want to escape arguments coming from the gdbserver command line
> differently than arguments coming from GDB (via a vRun packet), which
> means we need to call construct_inferior_arguments earlier, at the
> point where we know if the arguments came from the gdbserver command
> line, or from the vRun packet.
> 
> This argument escaping issue is discussed in PR gdb/28392.
> 
> This commit doesn't fix any issues, nor does it change
> construct_inferior_arguments to actually do different escaping, that
> will all come later.  This is purely a restructuring.
> 
> There should be no user visible changes after this commit.
> 
> Bug: https://sourceware.org/bugzilla/show_bug.cgi?id=28392
> 
> Tested-By: Guinevere Larsen <guinevere@redhat.com>

Some suggestions below, but otherwise:

Approved-By: Simon Marchi <simon.marchi@efficios.com>

> diff --git a/gdbserver/server.cc b/gdbserver/server.cc
> index 55898f59556..efe63ae7515 100644
> --- a/gdbserver/server.cc
> +++ b/gdbserver/server.cc
> @@ -121,7 +121,20 @@ private:
>    /* The program name, adjusted if needed.  */
>    std::string m_path;
>  } program_path;
> -static std::vector<char *> program_args;
> +
> +/* All program arguments are merged into a single string.  This is similar
> +   to how GDB manages the inferior arguments, and actually makes our lives
> +   easier; the rules for how arguments are merged into a single string
> +   differ depending on where the arguments come from.  Arguments arriving
> +   form the gdbserver command line are quoted, while arguments arriving
> +   from GDB (via a vRun packet) are already quoted.
> +
> +   NOTE: The comment above is ahead of its time.  The differences between
> +   how the PROGRAM_ARGS string is built up have not yet been implemented.
> +   A later patch in this series will make this change, and remove this
> +   note.  */

I think this is a bit too much for a code comment, it belongs to the
commit message (where it is already well explained).  It would be enough
to state what it is at the current time:

/* All program arguments are merged into a single string.  */

> @@ -4376,8 +4388,10 @@ captured_main (int argc, char *argv[])
>  
>        n = argc - (next_arg - argv);
>        program_path.set (next_arg[0]);
> +      std::vector<char *> temp_arg_vector;
>        for (i = 1; i < n; i++)
> -	program_args.push_back (xstrdup (next_arg[i]));
> +	temp_arg_vector.push_back (next_arg[i]);
> +      program_args = construct_inferior_arguments (temp_arg_vector);

Would that work, using std::vector's constructor that takes two
iterators?

    std::vector<char *> temp_arg_vector (&next_arg[1], &next_arg[argc]);
    program_args = construct_inferior_arguments (temp_arg_vector);

(not sure if the end iterator needs `argc` or `argc - 1`)

or directly:

    program_args = construct_inferior_arguments ({&next_arg[1], &next_arg[argc]});

> diff --git a/gdbserver/win32-low.cc b/gdbserver/win32-low.cc
> index da858b65e6f..139c945a2ba 100644
> --- a/gdbserver/win32-low.cc
> +++ b/gdbserver/win32-low.cc
> @@ -492,12 +492,12 @@ create_process (const char *program, char *args,
>  
>  /* Start a new process.
>     PROGRAM is the program name.
> -   PROGRAM_ARGS is the vector containing the inferior's args.
> +   PROGRAM_ARGS is a string containing all the inferior's arguments.
>     Returns the new PID on success, -1 on failure.  Registers the new
>     process with the process list.  */

I think this comment should just be removed, there's no point in
repeating the documentation from the base class.

Simon


More information about the Gdb-patches mailing list