UTF-related question

Eliot Moss moss@cs.umass.edu
Tue Dec 15 18:52:00 GMT 2009


Following the guidelines related to cygwin 1.7, I have
generally been using LANG=en_US.UTF-8. But I found that
if I do "man <whatever>" to get a man page, and then
search (I have man's "more" program set to "less") for
a string having a dash in it, say to search for -a in the
rsync man page to find the description of that flag, it
fails to match.

This is because with that LANG setting, for a hyphen/dash
man produces a three-byte UTF sequence. The same happens
for LANG=C.UTF-8. The way I have found around it is to
use LANG=C.

Thoughts about this as a "problem"? Is it a problem? Is
there a better way?

Thanks -- Eliot
==============================================================================
J. Eliot B. Moss, Professor               http://www.cs.umass.edu/~moss    www
Director, Arch. and Lang. Impl. Lab.      +1-413-545-4206                voice
Department of Computer Science            +1-413-695-4226                 cell
140 Governor's Drive, Room 372            +1-413-545-1249                  fax
University of Massachusetts at Amherst    moss@cs.umass.edu              email
Amherst, MA  01003-9264  USA              +1-413-545-2746 Laurie Downey  sec'y
==============================================================================

--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple



More information about the Cygwin mailing list