This is the mail archive of the
mailing list for the Cygwin project.
Re: [1.7] UTF-8, find vs. tar
- From: Corinna Vinschen <corinna-cygwin at cygwin dot com>
- To: cygwin at cygwin dot com
- Date: Fri, 25 Sep 2009 11:15:25 +0200
- Subject: Re: [1.7] UTF-8, find vs. tar
- References: <4ABC3BB3.firstname.lastname@example.org>
- Reply-to: cygwin at cygwin dot com
On Sep 24 22:40, Yaakov S wrote:
> I'm having some difficulty with a package containing a file with a UTF-8
> wget http://downloads.sourceforge.net/klavaro/klavaro-1.3.1.tar.bz2
> tar jxf klavaro-1.3.1.tar.bz2
> cd klavaro-1.3.1
> tar jcf TEST.tar.bz2 data/dvorak_fr*
> tar jtf TEST.tar.bz2 > tmptar.out
> find data/ -name 'dvorak_fr*' > tmpfind.out
> diff -u tmpfind.out tmptar.out
> The character in question is 'é' (aka U+00E9, small e with acute).
> The difference in rendering is throwing cygport off at the "checking
> packages for missing/duplicate files" stage.
> What to I need to do to get these to match?
Nothing but wait. The reason that tar doesn't print the characters
while find does is probably related to find callng setlocale and
tar doesn't. I hope to get this fixed in the next couple of days.
We're discussing the entire locale stuff on the cygwin-developers
list right now, see the threads starting at
My current locally patched DLL doesn't have that problem anymore,
so we're hopefully on the right way.
Corinna Vinschen Please, send mails regarding Cygwin to
Cygwin Project Co-Leader cygwin AT cygwin DOT com
Problem reports: http://cygwin.com/problems.html
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple