This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH] copy_file_range: New function to copy file data

From: Adhemerval Zanella <adhemerval dot zanella at linaro dot org>
To: Florian Weimer <fweimer at redhat dot com>
Cc: libc-alpha at sourceware dot org, "Joseph S. Myers" <joseph at codesourcery dot com>
Date: Thu, 21 Dec 2017 17:04:34 -0200
Subject: Re: [PATCH] copy_file_range: New function to copy file data
Authentication-results: sourceware.org; auth=none
References: <20171117145604.655D24469B999@oldenburg.str.redhat.com> <16de12bf-eab1-b69c-cd58-a6dd24ad409f@redhat.com> <eeee1a5f-63c5-4379-1fd4-7f0319d28cd4@linaro.org> <82ae459f-90d5-8fc2-6b04-64a6af8dfa26@redhat.com> <c527b61c-3306-6770-e437-99fe0bf58b2d@linaro.org> <0e830a6c-d49f-08a0-42a3-a6dc4e07afd5@redhat.com> <2a35f2e6-9a5c-c477-cb5e-096d96fff635@linaro.org> <a19b1982-f153-adf9-de0d-dee1f2e7da95@redhat.com>


On 21/12/2017 16:07, Florian Weimer wrote:
> On 12/21/2017 06:04 PM, Adhemerval Zanella wrote:
> 
>>> +  struct stat64 instat;
>>> +  struct stat64 outstat;
>>> +  if (fstat64 (infd, &instat) != 0 || fstat64 (outfd, &outstat) != 0)
>>> +    return -1;
>>> +  if (S_ISDIR (instat.st_mode) || S_ISDIR (outstat.st_mode))
>>> +    {
>>> +      __set_errno (EISDIR);
>>> +      return -1;
>>> +    }
>>
>> To follow the pattern you can put 'instat' and 'outstat' in its own scope.
> 
> Agreed.
> 
>>> +      if (read_count < 0)
>>> +        {
>>> +          if (copied > 0)
>>> +            /* Report the number of bytes copied so far.  */
>>> +            return copied;
>>> +          return -1;
>>> +        }> +      if (pinoff != 0)
>>> +        *pinoff += read_count;
>>
>> pinoff != NULL.
> 
> Oh, right.
> 
>>> +
>>> +      /* Write the buffer part which was read to the destination.  */
>>> +      char *end = buf + read_count;
>>> +      for (char *p = buf; p < end; )
>>> +        {
>>> +          ssize_t write_count;
>>> +          if (poutoff == NULL)
>>> +            write_count = write (outfd, p, end - p);
>>> +          else
>>> +            write_count = __libc_pwrite64 (outfd, p, end - p, *poutoff);
>>> +          if (write_count < 0)
>>> +            {
>>> +              /* Adjust the input read position to match what we have
>>> +                 written, so that the caller can pick up after the
>>> +                 error.  */
>>> +              size_t written = p - buf;
>>> +              /* NB: This needs to be signed so that we can form the
>>> +                 negative value below.  */
>>> +              ssize_t overread = read_count - written;
>>> +              if (pinoff == NULL)
>>> +                {
>>> +                  if (overread > 0)
>>> +                    {
>>> +                      /* We are on an error recovery path, so we
>>> +                         cannot deal with failure here.  */
>>> +                      int save_errno = errno;
>>> +                      (void) __libc_lseek64 (infd, -overread, SEEK_CUR);
>>> +                      __set_errno (save_errno);
>>
>> Should we really handle errors here? Using current man pages EBADF, ENXIO,
>> ESPIPE can't really happen because of previous checks.  EINVAL and EOVERFLOW
>> due resulting file offset would be negative or beyond the end of a seekable
>> device is also unlikely due the fact we are using the results of a previous
>> partial write to calculate the required offset.  I am not sure if it can
>> really fail here.
> 
> Theoretically, I assume that with enough memory pressure, the seek might have to re-read on-disk data structures, and then anything can happen. This is why I don't want to assert on the error.  Perhaps more likely is a file descriptor race condition which closes the descriptor under us, but then the application is screwed anyway.
> 
> We cannot report the error in all cases because with a partial write, we need to report the number of written bytes (because that effect has already happened and is visible by other means).
> 
> So I think the code is okay as it is now, all things considering.

I think for former it will hit the oom scenario where kernel will randomly
killing a process (assuming it is what Linux still does) which result the
process to continue execution or being killed.  Anyway, I think I am think
I am over engineering things here, so your approach should be ok.

References:
- Re: [PATCH] copy_file_range: New function to copy file data
  - From: Florian Weimer
- Re: [PATCH] copy_file_range: New function to copy file data
  - From: Adhemerval Zanella
- Re: [PATCH] copy_file_range: New function to copy file data
  - From: Florian Weimer
- Re: [PATCH] copy_file_range: New function to copy file data
  - From: Adhemerval Zanella
- Re: [PATCH] copy_file_range: New function to copy file data
  - From: Florian Weimer
- Re: [PATCH] copy_file_range: New function to copy file data
  - From: Adhemerval Zanella
- Re: [PATCH] copy_file_range: New function to copy file data
  - From: Florian Weimer

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]