[PATCH 5/5] gdb: native target invalid architecture detection
John Baldwin
jhb@FreeBSD.org
Tue May 31 16:08:17 GMT 2022
On 5/31/22 7:30 AM, Andrew Burgess via Gdb-patches wrote:
> If GDB is asked to start a new inferior, or attach to an existing
> process, using a binary file for an architecture that does not match
> the current native target, then, currently, GDB will assert. Here's
> an example session using current HEAD of master with GDB built for an
> x86-64 GNU/Linux native target, the binary being used is a RISC-V ELF:
>
> $ ./gdb/gdb -q --data-directory ./gdb/data-directory/
> (gdb) file /tmp/hello.rv32imc.x
> Reading symbols from /tmp/hello.rv32imc.x...
> (gdb) start
> Temporary breakpoint 1 at 0x101b2: file hello.rv32.c, line 23.
> Starting program: /tmp/hello.rv32imc.x
> ../../src/gdb/gdbarch.h:166: internal-error: gdbarch_tdep: Assertion `dynamic_cast<TDepType *> (tdep) != nullptr' failed.
> A problem internal to GDB has been detected,
> further debugging may prove unreliable.
>
> The same error is encountered if, instead of starting a new inferior,
> the user tries to attach to an x86-64 process with a RISC-V binary set
> as the current executable.
>
> These errors are not specific to the x86-64/RISC-V pairing I'm using
> here, any attempt to use a binary for one architecture with a native
> target of a different architecture will result in a similar error.
>
> Clearly, attempting to use this cross-architecture combination is a
> user error, but I think GDB should do better than an assert; ideally a
> nice error should be printed.
>
> The problem we run into is that, when the user starts a new inferior,
> or attaches to an inferior, the inferior stops. At this point GDB
> attempts to handle the stop, and this involves reading registers from
> the inferior.
>
> These register reads end up being done through the native target, so
> in the example above, we end up in the amd64_supply_fxsave function.
> However, these functions need a gdbarch. The gdbarch is fetched from
> the register set, which was constructed using the gdbarch from the
> binary currently in use. And so we end up in amd64_supply_fxsave
> using a RISC-V gdbarch.
>
> When we call:
>
> i386_gdbarch_tdep *tdep = gdbarch_tdep<i386_gdbarch_tdep> (gdbarch);
>
> this will assert as the gdbarch_tdep data within the RISC-V gdbarch is
> of the type riscv_gdbarch_tdep not i386_gdbarch_tdep.
>
> The solution I propose in this commit is to add a new target_ops
> method supports_architecture_p. This method will return true if a
> target can safely be used with a specific architecture, otherwise, the
> method returns false.
>
> I imagine that a result of true from this method doesn't guarantee
> that GDB can start an inferior of a given architecture, it just means
> that GDB will not crash if such an attempt is made. A result of false
> is a hard stop; attempting to use this target with this architecture
> is not supported, and may cause GDB to crash.
>
> This distinction is important I think for things like remote targets,
> and possibly simulator targets. We might imagine that GDB can ask a
> remote (or simulator) to start with a particular executable, and the
> target might still refuse for some reason. But my thinking is that
> these refusals should be well handled (i.e. GDB should give a user
> friendly error), rather than crashing, as is the case with the native
> targets.
>
> For example, if I start gdbserver on an x86-64 machine like this:
>
> gdbserver --multi :54321
>
> Then make use of this from a GDB session like this:
>
> $ ./gdb/gdb -q --data-directory ./gdb/data-directory/
> (gdb) file /tmp/hello.rv32imc.x
> Reading symbols from /tmp/hello.rv32imc.x...
> (gdb) target extended-remote :54321
> Remote debugging using :54321
> (gdb) run
> Starting program: /tmp/hello.rv32imc.x
> Running the default executable on the remote target failed; try "set remote exec-file"?
> (gdb)
>
> Though the error is not very helpful in diagnosing the problem, we can
> see that GDB has not crashed, but has given the user an error.
>
> And so, the supports_architecture_p method is created to return true
> by default, then I override this in inf_child_target, where I compare
> the architecture in question with the default_bfd_arch.
>
> Finally, I've added calls to supports_architecture_p for the
> run (which covers run, start, starti) and attach commands.
>
> You will notice a lack of tests for this change. I'm not sure of a
> good way that I can build a binary for a different architecture as
> part of a test, but if anyone has any ideas then I'll be happy to add
> a test here.
Have you considered multi-arch cases such as running i386 binaries on an x86-64
host or 32-bit arm binaries on an AArch64 host? Will we need to override this
method in certain targets (e.g. x86-linux-nat.c or x86-fbsd-nat.c) to support
such cases?
--
John Baldwin
More information about the Gdb-patches
mailing list