This is the mail archive of the gdb-patches@sourceware.org mailing list for the GDB project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH] Expect SI_KERNEL si_code for a MIPS software breakpoint trap

From: Luis Machado <lgustavo at codesourcery dot com>
To: "Maciej W. Rozycki" <macro at imgtec dot com>
Cc: "Maciej W. Rozycki" <macro at linux-mips dot org>, <gdb-patches at sourceware dot org>, <linux-mips at linux-mips dot org>
Date: Wed, 10 Feb 2016 10:52:47 -0200
Subject: Re: [PATCH] Expect SI_KERNEL si_code for a MIPS software breakpoint trap
Authentication-results: sourceware.org; auth=none
References: <1442592647-3051-1-git-send-email-lgustavo at codesourcery dot com> <alpine dot LFD dot 2 dot 20 dot 1509181729100 dot 10647 at eddie dot linux-mips dot org> <56B9F7E6 dot 5010006 at codesourcery dot com> <alpine dot DEB dot 2 dot 00 dot 1602092020150 dot 15885 at tp dot orcam dot me dot uk>
Reply-to: Luis Machado <lgustavo at codesourcery dot com>

On 02/09/2016 06:55 PM, Maciej W. Rozycki wrote:

On Tue, 9 Feb 2016, Luis Machado wrote:

I'm finally getting back to this. Sorry for the delay.


  No problem, thanks for keeping a track of it.

   I'm not convinced however that it is safe to assume all SIGTRAPs come
from breakpoints -- this signal is sent by the kernel for both BREAK and
trap (multiple mnemonics, e.g. TEQ, TGEI, etc.) instructions which may
have been placed throughout code for some reason, for example to serve as
cheap assertion checks.

   Is there a separate check made afterwards like `bpstat_explains_signal'
to validate the source of the signal here?


In my specific example we are dealing with two breakpoint hits by different
threads. The first one is reported just fine and the one we have problems with
is the queued one that is reported afterwards when we attempt to move the
other thread.

We do rely on bpstat_explains_signal when gdbserver/the remote target notifies
gdb about a stop. In the case of a queued breakpoint hit, it doesn't even get
reported back to gdb and is just ignored by gdbserver if it is recognized as a
breakpoint hit.


  I'm not sure I understand what's going on here, why is a breakpoint hit
required to be ignored by gdbserver?

  From what you say I infer GDB controls a multi-threaded program and there
it sets a software breakpoint which, by its nature, is global to all the
threads.  Then multiple threads hit the breakpoint simultaneously (or
nearly simultaneously) and the hits are delivered to GDB one by one, via
gdbserver.  So why is GDB not prepared for that?

  It set the breakpoint itself so it should expect it to hit sometime, and
if there are multiple threads, then there can be multiple hits at once or
almost at once because, even in the all-stop mode, there is no guaranteed
way to stop the threads all at once.  The threads may be spread across
different processors in an SMP system for example, all trapping literally
at once -- and then the kernel queueing the signals according to its own
internal schedule before delivering them to the debugger (that, from the
kernel's point of view, being gdbserver in this case).

I went through the data again and i was partially mistaken about theabove. Here's a detailed description of the events that take place inthe reported situation.

1 - A breakpoint is inserted by GDB at some code that is executed bymultiple threads.2 - Two threads, let us call them 1 and 2, happen to hit the samesoftware breakpoint, so both SIGTRAP's get sent by the kernel andgdbserver picks one of them to process.3 - gdbserver figures out this is from a breakpoint hit, since it knowsthere is a breakpoint inserted at that PC, and sends a swbreak stopreply back to gdb.4 - GDB gets the swbreak stop reply and notifies the user about thebreakpoint hit for thread 1, displays the frame information etc.

Now, the user goes and deletes that specific breakpoint that triggeredthe previous event and switches the context to thread 2 and thenattempts to continue execution.

5 - In gdbserver, thread 2 still has a pending SIGTRAP that was not yethandled.6 - gdbserver proceeds to handle it, sees it is a SIGTRAP but cannot mapthat event back to a software breakpoint hit due to the removal of suchbreakpoint and because gdbserver doesn't expect SI_KERNEL to mean"software breakpoint hit".7 - gdbserver then assumes this is a generic trap and reports it as suchto GDB, in a new stop reply.8 - GDB receives the stop reply and displays a generic trace/breakpointSIGTRAP since it also cannot map the trap back to a software breakpoint,i.e. bpstat_explains_signal returns 0 and random_signal is non-zero.

Patching gdbserver to recognize a si_code of SI_KERNEL as a softwarebreakpoint trap causes changes starting from 6.

6 - gdbserver proceeds to handle it, sees it is a SIGTRAP and that itssi_code is SI_KERNEL, meaning a software breakpoint hit now, even thoughwe can't recognize a breakpoint hit by checking for an underlyingbreakpoint instruction.

7 - gdbserver sends a swbreak stop reply back to GDB.

8 - GDB receives the stop reply and notices it is a delayed breakpointhit. According to its logic, GDB discards this as useless and theprogram continues its execution properly. Here random_signal is non-zeroand target_stopped_by_sw_breakpoint () returns 1 (because gdbserver toldGDB so).

The problem of forcing gdbserver to recognize all traps withsi_code==SI_KERNEL is that even hardcoded traps will be reported back toGDB as a swbreak event, which is not ideal.

But currently there is no easy way to tell a software breakpoint hit anda hardcoded trap (and maybe even a hardware breakpoint hit?) apart.

With the proposed change, even the hardcoded traps will initially be
recognized as breakpoints, albeit maybe recognized as permanent breakpoints
for some opcodes. It may cause gdbserver to ignore a second hardcoded trap
hit, which is not desirable.


  So why does gdbserver have to be taught which breakpoints may have
potentially been set by GDB and which may have not?  Why not to deliver
them all and leave it up to GDB to decide?  I believe it will be the right
thing to let GDB know that more than one thread has hit the same
breakpoint.

  Did I miss anything?  How is this situation handled in a native debug
scenario?

I take it native debugging will display the same sympthoms because thedefinitions of si_code are shared between GDB and gdbserver, fromnat/linux-ptrace.h. Native also uses a similar function to check forbreakpoint hits, namely gdb/linux-nat.c:check_stopped_by_breakpoint. Idid not test this with a native debugger though.

   Perhaps we should make it a part of the ABI and teach MIPS/Linux about
the breakpoint encoding used by GDB, which is `BREAK 5' (aka BRK_SSTEPBP
in kernel-speak, a misnomer I'm afraid), and make it set `si_code' to
TRAP_BRKPT, as expected.  This won't fix history of course, but at least
it will make debugging a little bit easier to handle in the future.
Cc-ing `linux-mips' for further input.


This is the best solution in my opinion and will definitely make the
debugger's life easier if it can tell the difference between multiple
seemingly equivalent SIGTRAP's.

Does this involve handling BRK_SSTEPBP inside
arch/mips/kernel/traps.c:do_trap_or_bp?


  No, as I noted in my reply to David elsewhere in this thread, this would
have to be in `do_bp' instead, to exclude trap instructions (e.g. TEQ,
etc.) from being treated as breakpoints.  I can implement this change
myself for you, but we need to agree first what the right solution for GDB
is.  So far it looks to me we'd be only papering over a problem elsewhere.


Hopefully the above makes what we're facing more clear.

Follow-Ups:
- Re: [PATCH] Expect SI_KERNEL si_code for a MIPS software breakpoint trap
  - From: Maciej W. Rozycki
- Re: [PATCH] Expect SI_KERNEL si_code for a MIPS software breakpoint trap
  - From: Pedro Alves

References:
- Re: [PATCH] Expect SI_KERNEL si_code for a MIPS software breakpoint trap
  - From: Luis Machado
- Re: [PATCH] Expect SI_KERNEL si_code for a MIPS software breakpoint trap
  - From: Maciej W. Rozycki

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]