This is the mail archive of the cygwin mailing list for the Cygwin project.
Index Nav: | [Date Index] [Subject Index] [Author Index] [Thread Index] | |
---|---|---|
Message Nav: | [Date Prev] [Date Next] | [Thread Prev] [Thread Next] |
Other format: | [Raw text] |
On Thursday, May 20, 2010, Jurriaan wrote:With LANG=anything-unknown, the charmap is set to ASCII, so it works (as there is at least no multibyte character then).
With empty LANG you're using the default UTF-8 encoding, where thatA very long sed script that's been working for ages (back from the 1.5 age) here has stopped working.
It turned out sed doesn't like some strings anymore when environment variable LANG is empty. With LANG=ASCII, there are no problems.
The actual text in the SED command is shown below as spaces, but it's a Swedish a with a small o on top of it, like this:
sed -e"s/@a/ a/g;"
where a is character 0xe5.
Running with LANG=ASCII works, with LANG empty I get 'unterminated `s'
command' from sed (which confused me for a while).
0xe5 byte constitutes an incomplete character. You need to either run
with a LANG setting that fits your script, e.g. C.ISO-8859-1, or
convert your script to UTF-8. I'm puzzled as to why LANG=ASCII would
have worked, since that's not a valid setting.
-- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
Index Nav: | [Date Index] [Subject Index] [Author Index] [Thread Index] | |
---|---|---|
Message Nav: | [Date Prev] [Date Next] | [Thread Prev] [Thread Next] |