This is the mail archive of the
mailing list for the GDB project.
Re: performance talk [Re: RFC: implement DW_OP_bit_piece]
>>>>> "Jan" == Jan Kratochvil <email@example.com> writes:
Tom> If you know of specific bad cases, I'm very interested in that.
Tom> I think we understand the symbol reading problem pretty well now, but
Tom> other stuff could at least use bug reports.
Jan> Checked now we were talking about `attach' being fast enough (~2s
Jan> on OOo.org) but I was testing it with
Jan> `archer-tromey-delayed-symfile2'. Then `bt full' is still slow -
Jan> tens of secs.
Yeah, this is known. See below.
At this point I'm much more interested in performance problems elsewhere
in gdb. This particular one is, while not "solved", at least very well
explored. A solution is on the horizon...
Here's a dump of the status:
This particular branch uses the aranges section to lazily read DWARF.
This makes "attach" very fast, because we don't hit the disk very much.
However, "bt" and "bt full" are still quite slow. If you want the bt
output for OO.o you can actually see it pause for a noticeable amount of
time at one particular point.
What is happening here is that at this point we need some particular
piece of debuginfo (I think a type, but I am not sure I looked into the
details). Since we don't have any sort of by-name index, we have to
read psymtabs and then search.
The fix for this is to add a by-name map. You might think we could use
debug_pubnames and debug_pubtypes, but those are broken beyond repair
for various reasons: historical gcc bugs make them unreliable, they
don't mention everything gdb needs, they aren't canonicalized, and
what's worse, it turns out that actually reading them in is too slow.
This is where the mmap()able debug index comes in. This approach yields
good attach times *and* good backtrace times, at the cost of having to
build a separate index per objfile. If you want to try this, it is all
on archer-tromey-optional-psymtab. I can tell you how to make the index
files if you do this.
What makes this fast is that the by-name index means we can just read
the CUs we care about. Reading any particular CU is fast enough that we
don't need anything smarter.
I am planning to propose this as an F14 feature: make index files for
all the debuginfo, and at the same time drop .debug_pub* (to save some
space). I figured I would delay submission to FSF gdb until we get this
underway; my view is that this is more of a "distro" feature (creating
the index files is just as slow as simply starting gdb) and there is no
point in pushing it if the distro isn't interested.
Another idea I implemented is to read psymtabs in the background. This
works pretty well: gdb starts some worker threads and reads in psymtabs
starting with the largest one (the smaller the debuginfo, the less
likely you are to notice the pause).
I haven't submitted this for two reasons.
First, it is not quite complete. It has a questionable BFD hack, and I
haven't yet made the complaint() system threadsafe.
Second, I assume it will be contentious and/or too ugly. It uses
__thread in various spots and it changes cp-name-parser.y to depend on
bison (by making it a pure parser), not to mention that threads are
I'd be happy to revisit this if people are interested. Note that it is
not as good as the index approach.
These two approaches are actually complementary. We could put them both
Finally, one or both of these branches has a little optimization so that
if you "gdb -readnow", we don't create psymtabs at all. This could go
in now; I just haven't bothered since I think -readnow is only ever
actually used to locate psymtab expansion bugs.