This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH v4] Use a proper C tokenizer to implement the obsolete typedefs test.

From: Carlos O'Donell <carlos at redhat dot com>
To: Joseph Myers <joseph at codesourcery dot com>, Zack Weinberg <zackw at panix dot com>
Cc: GNU C Library <libc-alpha at sourceware dot org>
Date: Thu, 14 Mar 2019 09:00:11 -0400
Subject: Re: [PATCH v4] Use a proper C tokenizer to implement the obsolete typedefs test.
References: <20190311145927.20399-1-zackw@panix.com> <36fa3d03-0dc4-bbd7-5c99-c8fe19255b8e@redhat.com> <CAKCAbMg-iK8MPRizK=eaWHZaZG293D6ynbnJEsZqJxiAq9Zjow@mail.gmail.com> <f1d09baf-1082-846d-ce82-6f800df50ec6@redhat.com> <CAKCAbMgpLn-GKiP91DMyDkZ8inxe+DjzDrnUwK2=+5crcYxnGQ@mail.gmail.com> <alpine.DEB.2.21.1903132209520.10481@digraph.polyomino.org.uk>

On 3/13/19 6:16 PM, Joseph Myers wrote:
> I'm seeing failures from build-many-glibcs.py for 
> resource/check-obsolete-constructs:
> 
> UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 3198: ordinal not in range(128)
> 
> This is with LC_ALL=C (and bits/resource.h headers containing UTF-8 µ in a 
> comment).  Code opening text files that might not be pure ASCII needs to 
> specify an encoding explicitly to avoid depending on the locale tests are 
> run with.  There is also a case that the encoding specified should be 
> ASCII - that installed headers should be required to be pure ASCII so they 
> can be included in source files with any ASCII-compatible character set if 
> compiling with -finput-charset= (which affects included headers as well as 
> the main source file, so compiling "#include <sys/resource.h>" with 
> -finput-charset=ascii currently fails).

Do we have a requirement that #incldue <sys/resources.h> be compilable with
-finput-charset=ascii?

Or to put it another way, who decides which sources files have to be ASCII
compatible?

Is the fix to fix bits/resource.h or the python opening of the file with
UTF-8 support?

-- 
Cheers,
Carlos.

Follow-Ups:
- Re: [PATCH v4] Use a proper C tokenizer to implement the obsolete typedefs test.
  - From: Zack Weinberg

References:
- [PATCH v4] Use a proper C tokenizer to implement the obsolete typedefs test.
  - From: Zack Weinberg
- Re: [PATCH v4] Use a proper C tokenizer to implement the obsolete typedefs test.
  - From: Carlos O'Donell
- Re: [PATCH v4] Use a proper C tokenizer to implement the obsolete typedefs test.
  - From: Zack Weinberg
- Re: [PATCH v4] Use a proper C tokenizer to implement the obsolete typedefs test.
  - From: Carlos O'Donell
- Re: [PATCH v4] Use a proper C tokenizer to implement the obsolete typedefs test.
  - From: Zack Weinberg
- Re: [PATCH v4] Use a proper C tokenizer to implement the obsolete typedefs test.
  - From: Joseph Myers

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]