This is the mail archive of the glibc-bugs@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

[Bug libc/21092] New: mbsnrtowcs: *src is not left pointing to the invalid multibyte sequence when dest is NULL


https://sourceware.org/bugzilla/show_bug.cgi?id=21092

            Bug ID: 21092
           Summary: mbsnrtowcs: *src is not left pointing to the invalid
                    multibyte sequence when dest is NULL
           Product: glibc
           Version: 2.24
            Status: UNCONFIRMED
          Severity: normal
          Priority: P2
         Component: libc
          Assignee: unassigned at sourceware dot org
          Reporter: igor.liferenko at gmail dot com
                CC: drepper.fsp at gmail dot com
  Target Milestone: ---

The following examples demonstrate the difference when dest is NULL and dest
is not NULL.

According to this bugreport[1], there are two possible cases:

1) incomplete multibyte character is at the end of input buffer *)
2) incomplete multibyte character is not at the end of input buffer *)

*) - end of buffer is determined by nms argument.

The examples 1.1 and 2.1 demonstrate that when dest is NULL, *src is
not changed.
The examples 1.2 and 2.2 demonstrate that when dest is not NULL, *src is
correctly changed.

In the examples the following UTF-8 sequences are used:

\320     = incomplete
\321\215 = U+044D (CYRILLIC SMALL LETTER E)
\321\216 = U+044E (CYRILLIC SMALL LETTER YU)

Example 1.1 (at the end of buffer, dest is NULL):

    #include <locale.h>
    #include <wchar.h>
    #include <stdio.h>
    int main(void)
    {
      setlocale(LC_CTYPE, "en_US.UTF-8");
      char *s = "\321\216\320";
      const char *x = s;
      printf("status: %d\n", mbsnrtowcs(NULL,&x,3,0,NULL));
      perror(NULL);
      printf("ori=%p\nnew=%p\n",(void *)s,(void *)x);
      return 0;
    }

Output 1.1:

    status: 1
    Success
    ori=0x56337c86d910
    new=0x56337c86d910


Example 2.1 (not at the end of buffer, dest is NULL):

    #include <locale.h>
    #include <wchar.h>
    #include <stdio.h>
    int main(void)
    {
      setlocale(LC_CTYPE, "en_US.UTF-8");
      char *s = "\321\216\320\321\215";
      const char *x = s;
      printf("status: %d\n", mbsnrtowcs(NULL,&x,5,0,NULL));
      perror(NULL);
      printf("ori=%p\nnew=%p\n",(void *)s,(void *)x);
      return 0;
    }

Output 2.1:

    status: -1
    Invalid or incomplete multibyte or wide character
    ori=0x55ad82792910
    new=0x55ad82792910

Example 1.2 (at the end of buffer, dest is not NULL):

    #include <locale.h>
    #include <wchar.h>
    #include <stdio.h>
    int main(void)
    {
      setlocale(LC_CTYPE, "en_US.UTF-8");
      char *s = "\321\216\320";
      const char *x = s;
      wchar_t wcs[3];
      printf("status: %d\n", mbsnrtowcs(wcs,&x,3,3,NULL));
      perror(NULL);
      printf("ori=%p\nnew=%p\n",(void *)s,(void *)x);
      return 0;
    }

Output 1.2:

    status: 1
    Success
    ori=0x556497c29980
    new=0x556497c29983


Example 2.2 (not at the end of buffer, dest is not NULL):

    #include <locale.h>
    #include <wchar.h>
    #include <stdio.h>
    int main(void)
    {
      setlocale(LC_CTYPE, "en_US.UTF-8");
      char *s = "\321\216\320\321\215";
      const char *x = s;
      wchar_t wcs[5];
      printf("status: %d\n", mbsnrtowcs(wcs,&x,5,5,NULL));
      perror(NULL);
      printf("ori=%p\nnew=%p\n",(void *)s,(void *)x);
      return 0;
    }

Output 2.2:

    status: -1
    Invalid or incomplete multibyte or wide character
    ori=0x55bb1aa98980
    new=0x55bb1aa98982

-- 
You are receiving this mail because:
You are on the CC list for the bug.

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]