This is the mail archive of the gdb-patches@sourceware.org mailing list for the GDB project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH] DWARF 5 support: Handle line table and file indexes

From: Simon Marchi <simon dot marchi at polymtl dot ca>
To: Ali Tamur <tamur at google dot com>
Cc: gdb-patches at sourceware dot org
Date: Wed, 09 Oct 2019 23:14:17 -0400
Subject: Re: [PATCH] DWARF 5 support: Handle line table and file indexes
Dkim-filter: OpenDKIM Filter v2.11.0 smtp.polymtl.ca x9A3EKQ0007821
References: <20191002015214.219674-1-tamur@google.com>

Hi Ali,

Thanks for the patch.  I noted a few minor comments below.

On 2019-10-01 21:52, Ali Tamur via gdb-patches wrote:

[Sorry, I forgot to add gdb/ChangeLog, resending]
* Fix handling of file and directory indexes in line tables; in DWARF5 theindexes are zero-based. Make file_names field private to abstract thisdetailfrom the clients. Introduce a new file_names_size method. Reflect thesechanges
in clients.

I think it's a good idea to encapsulate this. Should include_dirs beprivate as well, since it is in the same situation?

*  Pass whether the file index is zero or one based to a few methods.

Hmm, I am not a big fan of that, I had a hard time understanding whythat was necessary. From what I understand now, it's only necessary sothat some code that is agnostic to DWARF 4 vs 5 can iterate on all fileentries.

Instead, maybe we could have the line_header object provide a way forclients to iterate on file entries, without exposing the vector. Maybea range adapter that would let clients do:


for (const file_entry &fe : lh->file_names ())
  {
    ...
  }

Hopefully, that is enough for all cases that pass is_zero_based = truein your patch.

* Handle DW_FORM_data16 in read_formatted_entries; it is used torecord MD5

of the file entries in DWARF 5.

* Fix a bug in line header parsing that calculates the length of theheader

incorrectly. (Seemingly this manifests itself only in DWARF 5).

* Introduce a new method, is_valid_file_index that takes into accountwhether

it is DWARF 5. Use it in related clients.

Tested with CC=/usr/bin/gcc (version 8.3.0) against master branch (alsowith-gsplit-dwarf and -gdwarf-4 flags) and there was no increase in the setoftests that fails. (gdb still cannot debug a 'hello world' program withDWARF 5,

so for the time being, this is all we care about).

This is part of an effort to support DWARF 5 in gdb.

gdb/ChangeLog:

	* dwarf2read.c (dir_index): Change type.
	(file_name_index): Likewise.
	(line_header::include_dir_at): Change comment and implementation on
	whether it is DWARF 5.
	(line_header::file_name_at): Change comment, API and implementation.
	(line_header::file_names_size): New method.
	(line_header::file_names): Change to private field.
	(file_full_name): Change API.

(dw2_get_file_names_reader): Initialize loval variable. Use newmethods.

	(dwarf2_cu::setup_type_unit_groups): Change implementation and use new
	methods.
	(process_structure_scope): Reflect API change.
	(line_header::add_file_name): Likewise.
	(read_formatted_entries): Handle DW_FORM_data16.
	(dwarf_decode_line_header): Fix line header length calculation.
	(psymtab_include_file_name): Reflect API change.
	(lnp_state_machine::current_file): Likewise.
	(lnp_state_machine::m_file): Update comment.
	(lnp_state_machine::record_line): Reflect type change.
	(dwarf_decode_lines): Reflect API change.
	(new_symbol): Likewise.
	(is_valid_file_index): New function.

(file_file_name): Change API and implementation to take care of DWARF5.

	(file_full_name): Likewise.
	(macro_start_file): Reflect API change.
---
 gdb/dwarf2read.c | 161 +++++++++++++++++++++++++++--------------------
 1 file changed, 94 insertions(+), 67 deletions(-)

diff --git a/gdb/dwarf2read.c b/gdb/dwarf2read.c
index 53e7393a7c..c474bdcd8b 100644
--- a/gdb/dwarf2read.c
+++ b/gdb/dwarf2read.c
@@ -922,13 +922,13 @@ typedef void (die_reader_func_ftype) (const
struct die_reader_specs *reader,
 				      int has_children,
 				      void *data);

-/* A 1-based directory index.  This is a strong typedef to prevent
-   accidentally using a directory index as a 0-based index into an
-   array/vector.  */
-enum class dir_index : unsigned int {};

+/* dir_index is 1-based in DWARF 4 and before, and is 0-based in DWARF5 and

+   later.  */
+typedef int dir_index;

-/* Likewise, a 1-based file name index.  */
-enum class file_name_index : unsigned int {};

+/* file_name_index is 1-based in DWARF 4 and before, and is 0-based inDWARF 5

+   and later.  */
+typedef int file_name_index;

Is there a reason you changed these? The reason why these were of enumclass type (or "strong typedef") still seems valid to me. We don't wantsome code accessing the vectors blindy using these indices, so itencourages people to use the right accessor method instead.


 struct file_entry
 {
@@ -980,26 +980,30 @@ struct line_header
   void add_file_name (const char *name, dir_index d_index,
 		      unsigned int mod_time, unsigned int length);

-  /* Return the include dir at INDEX (1-based).  Returns NULL if INDEX
-     is out of bounds.  */

+ /* Return the include dir at INDEX (0-based in DWARF 5 and 1-basedbefore).

+     Returns NULL if INDEX is out of bounds.  */
   const char *include_dir_at (dir_index index) const
   {
-    /* Convert directory index number (1-based) to vector index
-       (0-based).  */
-    size_t vec_index = to_underlying (index) - 1;
+    size_t vec_index;
+    if (version <= 4)
+      vec_index = index - 1;
+    else
+      vec_index = index;

     if (vec_index >= include_dirs.size ())
       return NULL;
     return include_dirs[vec_index];
   }

-  /* Return the file name at INDEX (1-based).  Returns NULL if INDEX
-     is out of bounds.  */
-  file_entry *file_name_at (file_name_index index)

+ /* Return the file name at INDEX (0-based in DWARF 5 and 1-basedbefore).

+     Returns NULL if INDEX is out of bounds.  */

+ file_entry *file_name_at (file_name_index index, boolis_zero_indexed)

   {
-    /* Convert file name index number (1-based) to vector index
-       (0-based).  */
-    size_t vec_index = to_underlying (index) - 1;
+    size_t vec_index;
+    if (is_zero_indexed || version >= 5)
+        vec_index = index;


Remove one indentation level.

+    else
+      vec_index = index - 1;

     if (vec_index >= file_names.size ())
       return NULL;
@@ -1032,12 +1036,20 @@ struct line_header
      pointers.  The memory is owned by debug_line_buffer.  */
   std::vector<const char *> include_dirs;

-  /* The file_names table.  */
-  std::vector<file_entry> file_names;
+  int file_names_size () {
+    return file_names.size();
+  }


Format it like this:

  int file_names_size ()
  {
    return file_names.size();
  }

or we also accept this for short/trivial methods:

  int file_names_size ()
  { return file_names.size(); }


   /* The start and end of the statement program following this
      header.  These point into dwarf2_per_objfile->line_buffer.  */

const gdb_byte *statement_program_start {}, *statement_program_end{};

+
+ private:
+  /* The file_names table. This is private because the meaning of
indexes differ


differ -> differs

+ among DWARF versions (The first valid index is 1 in DWARF 4 andbefore,+ and is 0 in DWARF 5 and later). So the client should usefile_name_at


Two spaces after period.

+     method for access.  */
+  std::vector<file_entry> file_names;

You can rename this to m_file_names (that's the style we use for privatemembers).

@@ -3638,7 +3650,7 @@ dw2_get_file_names_reader (const struct
die_reader_specs *reader,
   struct objfile *objfile = dwarf2_per_objfile->objfile;
   struct dwarf2_per_cu_data *lh_cu;
   struct attribute *attr;
-  int i;
+  int i = 0;

Unnecessary change? In fact, I'd take the opportunity to move thatdeclaration in the for loop that uses it.

@@ -20629,18 +20648,18 @@ dwarf_decode_line_header (sect_offset
sect_off, struct dwarf2_cu *cu)

Returns NULL if FILE_INDEX should be ignored, i.e., it ispst->filename. */


 static const char *

-psymtab_include_file_name (const struct line_header *lh, intfile_index,

+psymtab_include_file_name (struct line_header *lh, int file_index,
 			   const struct partial_symtab *pst,
 			   const char *comp_dir,
 			   gdb::unique_xmalloc_ptr<char> *name_holder)
 {
-  const file_entry &fe = lh->file_names[file_index];
-  const char *include_name = fe.name;
+  file_entry *fe = lh->file_name_at (file_index, true);

Hmm it's not great to have to lose the const here. Can you try adding aconst version of file_name_at, that returns a const file_entry *?

@@ -24154,22 +24173,29 @@ dwarf_alloc_die (struct dwarf2_cu *cu, intnum_attrs)*LH's file name table. The result is allocated using xmalloc; thecaller is
    responsible for freeing it.  */

The comment above applies to the `file_file_name` function, so it shouldstay right on top of it. Also, can you please document the newIS_ZERO_INDEXED parameter?


+bool is_valid_file_index (int file_index, struct line_header *lh,

This function should be static. Also, place the return type on its ownline:


static bool
is_valid_file_index (...

Simon

Follow-Ups:
- [PATCH] DWARF 5 support: Handle line table and file indexes
  - From: Ali Tamur via gdb-patches

References:
- [PATCH] DWARF 5 support: Handle line table and file indexes
  - From: Ali Tamur via gdb-patches

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]