1 @node File System Interface, Pipes and FIFOs, Low-Level I/O, Top
2 @c %MENU% Functions for manipulating files
3 @chapter File System Interface
5 This chapter describes @theglibc{}'s functions for manipulating
6 files. Unlike the input and output functions (@pxref{I/O on Streams};
7 @pxref{Low-Level I/O}), these functions are concerned with operating
8 on the files themselves rather than on their contents.
10 Among the facilities described in this chapter are functions for
11 examining or modifying directories, functions for renaming and deleting
12 files, and functions for examining and setting file attributes such as
13 access permissions and modification times.
16 * Working Directory:: This is used to resolve relative
18 * Descriptor-Relative Access:: Ways to control file name lookup.
19 * Accessing Directories:: Finding out what files a directory
21 * Working with Directory Trees:: Apply actions to all files or a selectable
22 subset of a directory hierarchy.
23 * Hard Links:: Adding alternate names to a file.
24 * Symbolic Links:: A file that ``points to'' a file name.
25 * Deleting Files:: How to delete a file, and what that means.
26 * Renaming Files:: Changing a file's name.
27 * Creating Directories:: A system call just for creating a directory.
28 * File Attributes:: Attributes of individual files.
29 * Making Special Files:: How to create special files.
30 * Temporary Files:: Naming and creating temporary files.
33 @node Working Directory
34 @section Working Directory
36 @cindex current working directory
37 @cindex working directory
38 @cindex change working directory
39 Each process has associated with it a directory, called its @dfn{current
40 working directory} or simply @dfn{working directory}, that is used in
41 the resolution of relative file names (@pxref{File Name Resolution}).
43 When you log in and begin a new session, your working directory is
44 initially set to the home directory associated with your login account
45 in the system user database. You can find any user's home directory
46 using the @code{getpwuid} or @code{getpwnam} functions; see @ref{User
49 Users can change the working directory using shell commands like
50 @code{cd}. The functions described in this section are the primitives
51 used by those commands and by other programs for examining and changing
52 the working directory.
55 Prototypes for these functions are declared in the header file
59 @deftypefun {char *} getcwd (char *@var{buffer}, size_t @var{size})
60 @standards{POSIX.1, unistd.h}
61 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
62 @c If buffer is NULL, this function calls malloc and realloc, and, in
63 @c case of error, free. Linux offers a getcwd syscall that we use on
64 @c GNU/Linux systems, but it may fail if the pathname is too long. As a
65 @c fallback, and on other systems, the generic implementation opens each
66 @c parent directory with opendir, which allocates memory for the
67 @c directory stream with malloc. If a fstatat64 syscall is not
68 @c available, very deep directory trees may also have to malloc to build
69 @c longer sequences of ../../../... than those supported by a global
70 @c const read-only string.
74 @c malloc/realloc/free if buffer is NULL, or if dir is too deep
75 @c lstat64 -> see its own entry
77 @c direct syscall if possible, alloca+snprintf+*stat64 otherwise
78 @c openat64_not_cancel_3, close_not_cancel_no_status
79 @c __fdopendir, __opendir, __readdir, rewinddir
80 The @code{getcwd} function returns an absolute file name representing
81 the current working directory, storing it in the character array
82 @var{buffer} that you provide. The @var{size} argument is how you tell
83 the system the allocation size of @var{buffer}.
85 The @glibcadj{} version of this function also permits you to specify a
86 null pointer for the @var{buffer} argument. Then @code{getcwd}
87 allocates a buffer automatically, as with @code{malloc}
88 (@pxref{Unconstrained Allocation}). If the @var{size} is greater than
89 zero, then the buffer is that large; otherwise, the buffer is as large
90 as necessary to hold the result.
92 The return value is @var{buffer} on success and a null pointer on failure.
93 The following @code{errno} error conditions are defined for this function:
97 The @var{size} argument is zero and @var{buffer} is not a null pointer.
100 The @var{size} argument is less than the length of the working directory
101 name. You need to allocate a bigger array and try again.
104 Permission to read or search a component of the file name was denied.
108 You could implement the behavior of GNU's @w{@code{getcwd (NULL, 0)}}
109 using only the standard behavior of @code{getcwd}:
119 char *buffer = (char *) xmalloc (size);
120 if (getcwd (buffer, size) == buffer)
131 @xref{Malloc Examples}, for information about @code{xmalloc}, which is
132 not a library function but is a customary name used in most GNU
135 @deftypefn {Deprecated Function} {char *} getwd (char *@var{buffer})
136 @standards{BSD, unistd.h}
137 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @ascuintl{}}@acunsafe{@acsmem{} @acsfd{}}}
138 @c Besides the getcwd safety issues, it calls strerror_r on error, which
139 @c brings in all of the i18n issues.
140 This is similar to @code{getcwd}, but has no way to specify the size of
141 the buffer. @Theglibc{} provides @code{getwd} only
142 for backwards compatibility with BSD.
144 The @var{buffer} argument should be a pointer to an array at least
145 @code{PATH_MAX} bytes long (@pxref{Limits for Files}). On @gnuhurdsystems{}
146 there is no limit to the size of a file name, so this is not
147 necessarily enough space to contain the directory name. That is why
148 this function is deprecated.
152 @deftypefun {char *} get_current_dir_name (void)
153 @standards{GNU, unistd.h}
154 @safety{@prelim{}@mtsafe{@mtsenv{}}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
155 @c Besides getcwd, which this function calls as a fallback, it calls
156 @c getenv, with the potential thread-safety issues that brings about.
157 The @code{get_current_dir_name} function is basically equivalent to
158 @w{@code{getcwd (NULL, 0)}}, except the value of the @env{PWD}
159 environment variable is first examined, and if it does in fact
160 correspond to the current directory, that value is returned. This is
161 a subtle difference which is visible if the path described by the
162 value in @env{PWD} is using one or more symbolic links, in which case
163 the value returned by @code{getcwd} would resolve the symbolic links
164 and therefore yield a different result.
166 This function is a GNU extension.
169 @deftypefun int chdir (const char *@var{filename})
170 @standards{POSIX.1, unistd.h}
171 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
172 This function is used to set the process's working directory to
175 The normal, successful return value from @code{chdir} is @code{0}. A
176 value of @code{-1} is returned to indicate an error. The @code{errno}
177 error conditions defined for this function are the usual file name
178 syntax errors (@pxref{File Name Errors}), plus @code{ENOTDIR} if the
179 file @var{filename} is not a directory.
182 @deftypefun int fchdir (int @var{filedes})
183 @standards{XPG, unistd.h}
184 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
185 This function is used to set the process's working directory to
186 directory associated with the file descriptor @var{filedes}.
188 The normal, successful return value from @code{fchdir} is @code{0}. A
189 value of @code{-1} is returned to indicate an error. The following
190 @code{errno} error conditions are defined for this function:
194 Read permission is denied for the directory named by @code{dirname}.
197 The @var{filedes} argument is not a valid file descriptor.
200 The file descriptor @var{filedes} is not associated with a directory.
203 The function call was interrupt by a signal.
206 An I/O error occurred.
210 @node Descriptor-Relative Access
211 @section Descriptor-Relative Access
212 @cindex file name resolution based on descriptors
213 @cindex descriptor-based file name resolution
214 @cindex @code{@dots{}at} functions
216 Many functions that accept file names have @code{@dots{}at} variants
217 which accept a file descriptor and a file name argument instead of just
218 a file name argument. For example, @code{fstatat} is the
219 descriptor-based variant of the @code{fstat} function. Most such
220 functions also accept an additional flags argument which changes the
221 behavior of the file name lookup based on the passed @code{AT_@dots{}}
224 There are several reasons to use descriptor-relative access:
228 The working directory is a process-wide resource, so individual threads
229 cannot change it without affecting other threads in the process.
230 Explicitly specifying the directory against which relative paths are
231 resolved can be a thread-safe alternative to changing the working
235 If a program wishes to access a directory tree which is being modified
236 concurrently, perhaps even by a different user on the system, the
237 program must avoid looking up file names with multiple components, in
238 order to detect symbolic links, using the @code{O_NOFOLLOW} flag
239 (@pxref{Open-time Flags}) or the @code{AT_SYMLINK_FOLLOW} flag
240 (described below). Without directory-relative access, it is necessary
241 to use the @code{fchdir} function to change the working directory
242 (@pxref{Working Directory}), which is not thread-safe.
245 Listing directory contents using the @code{readdir} or @code{readdir64}
246 functions (@pxref{Reading/Closing Directory}) does not provide full file
247 name paths. Using @code{@dots{}at} functions, it is possible to use the
248 file names directly, without having to construct such full paths.
251 Additional flags available with some of the @code{@dots{}at} functions
252 provide access to functionality which is not available otherwise.
255 The file descriptor used by these @code{@dots{}at} functions has the
260 It can be a file descriptor referring to a directory. Such a descriptor
261 can be created explicitly using the @code{open} function and the
262 @code{O_RDONLY} file access mode, with or without the @code{O_DIRECTORY}
263 flag. @xref{Opening and Closing Files}. Or it can be created
264 implicitly by @code{opendir} and retrieved using the @code{dirfd}
265 function. @xref{Opening a Directory}.
267 If a directory descriptor is used with one of the @code{@dots{}at}
268 functions, a relative file name argument is resolved relative to
269 directory referred to by the file descriptor, just as if that directory
270 were the current working directory. Absolute file name arguments
271 (starting with @samp{/}) are resolved against the file system root, and
272 the descriptor argument is effectively ignored.
274 This means that file name lookup is not constrained to the directory of
275 the descriptor. For example, it is possible to access a file
276 @file{example} in the descriptor's parent directory using a file name
277 argument @code{"../example"}, or in the root directory using
280 If the file descriptor refers to a directory, the empty string @code{""}
281 is not a valid file name argument. It is possible to use @code{"."} to
282 refer to the directory itself. Also see @code{AT_EMPTY_PATH} below.
285 @vindex @code{AT_FDCWD}
286 The special value @code{AT_FDCWD}. This means that the current working
287 directory is used for the lookup if the file name is a relative. For
288 @code{@dots{}at} functions with an @code{AT_@dots{}} flags argument,
289 this provides a shortcut to use those flags with regular (not
290 descriptor-based) file name lookups.
292 If @code{AT_FDCWD} is used, the empty string @code{""} is not a valid
296 An arbitrary file descriptor, along with an empty string @code{""} as
297 the file name argument, and the @code{AT_EMPTY_PATH} flag. In this
298 case, the operation uses the file descriptor directly, without further
299 file name resolution. On Linux, this allows operations on descriptors
300 opened with the @code{O_PATH} flag. For regular descriptors (opened
301 without @code{O_PATH}), the same functionality is also available through
302 the plain descriptor-based functions (for example, @code{fstat} instead
305 This is a GNU extension.
308 @cindex file name resolution flags
309 @cindex @code{AT_*} file name resolution flags
310 The flags argument in @code{@dots{}at} functions can be a combination of
311 the following flags, defined in @file{fcntl.h}. Not all such functions
312 support all flags, and some (such as @code{openat}) do not accept a
313 flags argument at all.
315 In the flag descriptions below, the @dfn{effective final path component}
316 refers to the final component (basename) of the full path constructed
317 from the descriptor and file name arguments, using file name lookup, as
322 This flag is used with an empty file name @code{""} and a descriptor
323 which does not necessarily refer to a directory. It is most useful with
324 @code{O_PATH} descriptors, as described above. This flag is a GNU
327 @item AT_NO_AUTOMOUNT
328 If the effective final path component refers to a potential file system
329 mount point controlled by an auto-mounting service, the operation does
330 not trigger auto-mounting and refers to the unmounted mount point
331 instead. @xref{Mount-Unmount-Remount}. If a file system has already
332 been mounted at the effective final path component, the operation
333 applies to the file or directory in the mounted file system, not the
334 underlying file system that was mounted over. This flag is a GNU
337 @item AT_SYMLINK_FOLLOW
338 If the effective final path component is a symbolic link, the
339 operation follows the symbolic link and operates on its target. (For
340 most functions, this is the default behavior.)
342 @item AT_SYMLINK_NOFOLLOW
343 If the effective final path component is a symbolic link, the
344 operation operates on the symbolic link, without following it. The
345 difference in behavior enabled by this flag is similar to the difference
346 between the @code{lstat} and @code{stat} functions, or the behavior
347 activated by the @code{O_NOFOLLOW} argument to the @code{open} function.
348 Even with the @code{AT_SYMLINK_NOFOLLOW} flag present, symbolic links in
349 a non-final component of the file name are still followed.
352 @strong{Note:} There is no relationship between these flags and the type
353 argument to the @code{getauxval} function (with @code{AT_@dots{}}
354 constants defined in @file{elf.h}). @xref{Auxiliary Vector}.
356 @node Accessing Directories
357 @section Accessing Directories
358 @cindex accessing directories
359 @cindex reading from a directory
360 @cindex directories, accessing
362 The facilities described in this section let you read the contents of a
363 directory file. This is useful if you want your program to list all the
364 files in a directory, perhaps as part of a menu.
366 @cindex directory stream
367 The @code{opendir} function opens a @dfn{directory stream} whose
368 elements are directory entries. Alternatively @code{fdopendir} can be
369 used which can have advantages if the program needs to have more
370 control over the way the directory is opened for reading. This
371 allows, for instance, to pass the @code{O_NOATIME} flag to
374 You use the @code{readdir} function on the directory stream to
375 retrieve these entries, represented as @w{@code{struct dirent}}
376 objects. The name of the file for each entry is stored in the
377 @code{d_name} member of this structure. There are obvious parallels
378 here to the stream facilities for ordinary files, described in
379 @ref{I/O on Streams}.
382 * Directory Entries:: Format of one directory entry.
383 * Opening a Directory:: How to open a directory stream.
384 * Reading/Closing Directory:: How to read directory entries from the stream.
385 * Simple Directory Lister:: A very simple directory listing program.
386 * Random Access Directory:: Rereading part of the directory
387 already read with the same stream.
388 * Scanning Directory Content:: Get entries for user selected subset of
389 contents in given directory.
390 * Simple Directory Lister Mark II:: Revised version of the program.
391 * Low-level Directory Access:: AS-Safe functions for directory access.
394 @node Directory Entries
395 @subsection Format of a Directory Entry
398 This section describes what you find in a single directory entry, as you
399 might obtain it from a directory stream. All the symbols are declared
400 in the header file @file{dirent.h}.
402 @deftp {Data Type} {struct dirent}
403 @standards{POSIX.1, dirent.h}
404 This is a structure type used to return information about directory
405 entries. It contains the following fields:
409 This is the null-terminated file name component. This is the only
410 field you can count on in all POSIX systems.
413 This is the file serial number. For BSD compatibility, you can also
414 refer to this member as @code{d_ino}. On @gnulinuxhurdsystems{} and most POSIX
415 systems, for most files this the same as the @code{st_ino} member that
416 @code{stat} will return for the file. @xref{File Attributes}.
418 @item unsigned char d_namlen
419 This is the length of the file name, not including the terminating
420 null character. Its type is @code{unsigned char} because that is the
421 integer type of the appropriate size. This member is a BSD extension.
422 The symbol @code{_DIRENT_HAVE_D_NAMLEN} is defined if this member is
425 @item unsigned char d_type
426 This is the type of the file, possibly unknown. The following constants
427 are defined for its value:
431 The type is unknown. Only some filesystems have full support to
432 return the type of the file, others might always return this value.
441 A named pipe, or FIFO. @xref{FIFO Special Files}.
444 A local-domain socket. @c !!! @xref{Local Domain}.
456 This member is a BSD extension. The symbol @code{_DIRENT_HAVE_D_TYPE}
457 is defined if this member is available. On systems where it is used, it
458 corresponds to the file type bits in the @code{st_mode} member of
459 @code{struct stat}. If the value cannot be determined the member
460 value is DT_UNKNOWN. These two macros convert between @code{d_type}
461 values and @code{st_mode} values:
463 @deftypefun int IFTODT (mode_t @var{mode})
464 @standards{BSD, dirent.h}
465 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
466 This returns the @code{d_type} value corresponding to @var{mode}.
469 @deftypefun mode_t DTTOIF (int @var{dtype})
470 @standards{BSD, dirent.h}
471 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
472 This returns the @code{st_mode} value corresponding to @var{dtype}.
476 This structure may contain additional members in the future. Their
477 availability is always announced in the compilation environment by a
478 macro named @code{_DIRENT_HAVE_D_@var{xxx}} where @var{xxx} is replaced
479 by the name of the new member. For instance, the member @code{d_reclen}
480 available on some systems is announced through the macro
481 @code{_DIRENT_HAVE_D_RECLEN}.
483 When a file has multiple names, each name has its own directory entry.
484 The only way you can tell that the directory entries belong to a
485 single file is that they have the same value for the @code{d_fileno}
488 File attributes such as size, modification times etc., are part of the
489 file itself, not of any particular directory entry. @xref{File
493 @node Opening a Directory
494 @subsection Opening a Directory Stream
497 This section describes how to open a directory stream. All the symbols
498 are declared in the header file @file{dirent.h}.
500 @deftp {Data Type} DIR
501 @standards{POSIX.1, dirent.h}
502 The @code{DIR} data type represents a directory stream.
505 You shouldn't ever allocate objects of the @code{struct dirent} or
506 @code{DIR} data types, since the directory access functions do that for
507 you. Instead, you refer to these objects using the pointers returned by
508 the following functions.
510 Directory streams are a high-level interface. On Linux, alternative
511 interfaces for accessing directories using file descriptors are
512 available. @xref{Low-level Directory Access}.
514 @deftypefun {DIR *} opendir (const char *@var{dirname})
515 @standards{POSIX.1, dirent.h}
516 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
517 @c Besides the safe syscall, we have to allocate the DIR object with
518 @c __alloc_dir, that calls malloc.
519 The @code{opendir} function opens and returns a directory stream for
520 reading the directory whose file name is @var{dirname}. The stream has
523 If unsuccessful, @code{opendir} returns a null pointer. In addition to
524 the usual file name errors (@pxref{File Name Errors}), the
525 following @code{errno} error conditions are defined for this function:
529 Read permission is denied for the directory named by @code{dirname}.
532 The process has too many files open.
535 The entire system, or perhaps the file system which contains the
536 directory, cannot support any additional open files at the moment.
537 (This problem cannot happen on @gnuhurdsystems{}.)
540 Not enough memory available.
543 The @code{DIR} type is typically implemented using a file descriptor,
544 and the @code{opendir} function in terms of the @code{open} function.
545 @xref{Low-Level I/O}. Directory streams and the underlying
546 file descriptors are closed on @code{exec} (@pxref{Executing a File}).
549 The directory which is opened for reading by @code{opendir} is
550 identified by the name. In some situations this is not sufficient.
551 Or the way @code{opendir} implicitly creates a file descriptor for the
552 directory is not the way a program might want it. In these cases an
553 alternative interface can be used.
555 @deftypefun {DIR *} fdopendir (int @var{fd})
556 @standards{GNU, dirent.h}
557 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
558 @c The DIR object is allocated with __alloc_dir, that calls malloc.
559 The @code{fdopendir} function works just like @code{opendir} but
560 instead of taking a file name and opening a file descriptor for the
561 directory the caller is required to provide a file descriptor. This
562 file descriptor is then used in subsequent uses of the returned
563 directory stream object.
565 The caller must make sure the file descriptor is associated with a
566 directory and it allows reading.
568 If the @code{fdopendir} call returns successfully the file descriptor
569 is now under the control of the system. It can be used in the same
570 way the descriptor implicitly created by @code{opendir} can be used
571 but the program must not close the descriptor.
573 In case the function is unsuccessful it returns a null pointer and the
574 file descriptor remains to be usable by the program. The following
575 @code{errno} error conditions are defined for this function:
579 The file descriptor is not valid.
582 The file descriptor is not associated with a directory.
585 The descriptor does not allow reading the directory content.
588 Not enough memory available.
592 In some situations it can be desirable to get hold of the file
593 descriptor which is created by the @code{opendir} call. For instance,
594 to switch the current working directory to the directory just read the
595 @code{fchdir} function could be used. Historically the @code{DIR} type
596 was exposed and programs could access the fields. This does not happen
597 in @theglibc{}. Instead a separate function is provided to allow
600 @deftypefun int dirfd (DIR *@var{dirstream})
601 @standards{GNU, dirent.h}
602 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
603 The function @code{dirfd} returns the file descriptor associated with
604 the directory stream @var{dirstream}. This descriptor can be used until
605 the directory is closed with @code{closedir}. If the directory stream
606 implementation is not using file descriptors the return value is
610 @node Reading/Closing Directory
611 @subsection Reading and Closing a Directory Stream
614 This section describes how to read directory entries from a directory
615 stream, and how to close the stream when you are done with it. All the
616 symbols are declared in the header file @file{dirent.h}.
618 @deftypefun {struct dirent *} readdir (DIR *@var{dirstream})
619 @standards{POSIX.1, dirent.h}
620 @safety{@prelim{}@mtsafe{}@asunsafe{@asulock{}}@acunsafe{@aculock{}}}
621 @c This function holds dirstream's non-recursive lock, which brings
622 @c about the usual issues with locks and async signals and cancellation,
623 @c but the lock taking is not enough to make the returned value safe to
624 @c use, since it points to a stream's internal buffer that can be
625 @c overwritten by subsequent calls or even released by closedir.
626 This function reads the next entry from the directory. It normally
627 returns a pointer to a structure containing information about the
628 file. This structure is associated with the @var{dirstream} handle
629 and can be rewritten by a subsequent call.
631 @strong{Portability Note:} On some systems @code{readdir} may not
632 return entries for @file{.} and @file{..}, even though these are always
633 valid file names in any directory. @xref{File Name Resolution}.
635 If there are no more entries in the directory or an error is detected,
636 @code{readdir} returns a null pointer. The following @code{errno} error
637 conditions are defined for this function:
641 The @var{dirstream} argument is not valid.
644 To distinguish between an end-of-directory condition or an error, you
645 must set @code{errno} to zero before calling @code{readdir}. To avoid
646 entering an infinite loop, you should stop reading from the directory
647 after the first error.
649 @strong{Caution:} The pointer returned by @code{readdir} points to
650 a buffer within the @code{DIR} object. The data in that buffer will
651 be overwritten by the next call to @code{readdir}. You must take care,
652 for instance, to copy the @code{d_name} string if you need it later.
654 Because of this, it is not safe to share a @code{DIR} object among
655 multiple threads, unless you use your own locking to ensure that
656 no thread calls @code{readdir} while another thread is still using the
657 data from the previous call. In @theglibc{}, it is safe to call
658 @code{readdir} from multiple threads as long as each thread uses
659 its own @code{DIR} object. POSIX.1-2008 does not require this to
660 be safe, but we are not aware of any operating systems where it
663 @code{readdir_r} allows you to provide your own buffer for the
664 @code{struct dirent}, but it is less portable than @code{readdir}, and
665 has problems with very long filenames (see below). We recommend
666 you use @code{readdir}, but do not share @code{DIR} objects.
669 @deftypefun int readdir_r (DIR *@var{dirstream}, struct dirent *@var{entry}, struct dirent **@var{result})
670 @standards{GNU, dirent.h}
671 @safety{@prelim{}@mtsafe{}@asunsafe{@asulock{}}@acunsafe{@aculock{}}}
672 This function is a version of @code{readdir} which performs internal
673 locking. Like @code{readdir} it returns the next entry from the
674 directory. To prevent conflicts between simultaneously running
675 threads the result is stored inside the @var{entry} object.
677 @strong{Portability Note:} @code{readdir_r} is deprecated. It is
678 recommended to use @code{readdir} instead of @code{readdir_r} for the
683 On systems which do not define @code{NAME_MAX}, it may not be possible
684 to use @code{readdir_r} safely because the caller does not specify the
685 length of the buffer for the directory entry.
688 On some systems, @code{readdir_r} cannot read directory entries with
689 very long names. If such a name is encountered, @theglibc{}
690 implementation of @code{readdir_r} returns with an error code of
691 @code{ENAMETOOLONG} after the final directory entry has been read. On
692 other systems, @code{readdir_r} may return successfully, but the
693 @code{d_name} member may not be NUL-terminated or may be truncated.
696 POSIX-1.2008 does not guarantee that @code{readdir} is thread-safe,
697 even when access to the same @var{dirstream} is serialized. But in
698 current implementations (including @theglibc{}), it is safe to call
699 @code{readdir} concurrently on different @var{dirstream}s, so there is
700 no need to use @code{readdir_r} in most multi-threaded programs. In
701 the rare case that multiple threads need to read from the same
702 @var{dirstream}, it is still better to use @code{readdir} and external
706 It is expected that future versions of POSIX will obsolete
707 @code{readdir_r} and mandate the level of thread safety for
708 @code{readdir} which is provided by @theglibc{} and other
709 implementations today.
712 Normally @code{readdir_r} returns zero and sets @code{*@var{result}}
713 to @var{entry}. If there are no more entries in the directory or an
714 error is detected, @code{readdir_r} sets @code{*@var{result}} to a
715 null pointer and returns a nonzero error code, also stored in
716 @code{errno}, as described for @code{readdir}.
718 It is also important to look at the definition of the @code{struct
719 dirent} type. Simply passing a pointer to an object of this type for
720 the second parameter of @code{readdir_r} might not be enough. Some
721 systems don't define the @code{d_name} element sufficiently long. In
722 this case the user has to provide additional space. There must be room
723 for at least @code{NAME_MAX + 1} characters in the @code{d_name} array.
724 Code to call @code{readdir_r} could look like this:
730 char b[offsetof (struct dirent, d_name) + NAME_MAX + 1];
733 if (readdir_r (dir, &u.d, &res) == 0)
738 To support large filesystems on 32-bit machines there are LFS variants
739 of the last two functions.
741 @deftypefun {struct dirent64 *} readdir64 (DIR *@var{dirstream})
742 @standards{LFS, dirent.h}
743 @safety{@prelim{}@mtsafe{}@asunsafe{@asulock{}}@acunsafe{@aculock{}}}
744 The @code{readdir64} function is just like the @code{readdir} function
745 except that it returns a pointer to a record of type @code{struct
746 dirent64}. Some of the members of this data type (notably @code{d_ino})
747 might have a different size to allow large filesystems.
749 In all other aspects this function is equivalent to @code{readdir}.
752 @deftypefun int readdir64_r (DIR *@var{dirstream}, struct dirent64 *@var{entry}, struct dirent64 **@var{result})
753 @standards{LFS, dirent.h}
754 @safety{@prelim{}@mtsafe{}@asunsafe{@asulock{}}@acunsafe{@aculock{}}}
755 The deprecated @code{readdir64_r} function is equivalent to the
756 @code{readdir_r} function except that it takes parameters of base type
757 @code{struct dirent64} instead of @code{struct dirent} in the second and
758 third position. The same precautions mentioned in the documentation of
759 @code{readdir_r} also apply here.
762 @deftypefun int closedir (DIR *@var{dirstream})
763 @standards{POSIX.1, dirent.h}
764 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asulock{/hurd}}@acunsafe{@acsmem{} @acsfd{} @aculock{/hurd}}}
765 @c No synchronization in the posix implementation, only in the hurd
766 @c one. This is regarded as safe because it is undefined behavior if
767 @c other threads could still be using the dir stream while it's closed.
768 This function closes the directory stream @var{dirstream}. It returns
769 @code{0} on success and @code{-1} on failure.
771 The following @code{errno} error conditions are defined for this
776 The @var{dirstream} argument is not valid.
780 @node Simple Directory Lister
781 @subsection Simple Program to List a Directory
783 Here's a simple program that prints the names of the files in
784 the current working directory:
790 The order in which files appear in a directory tends to be fairly
791 random. A more useful program would sort the entries (perhaps by
792 alphabetizing them) before printing them; see
793 @ref{Scanning Directory Content}, and @ref{Array Sort Function}.
796 @node Random Access Directory
797 @subsection Random Access in a Directory Stream
800 This section describes how to reread parts of a directory that you have
801 already read from an open directory stream. All the symbols are
802 declared in the header file @file{dirent.h}.
804 @deftypefun void rewinddir (DIR *@var{dirstream})
805 @standards{POSIX.1, dirent.h}
806 @safety{@prelim{}@mtsafe{}@asunsafe{@asulock{}}@acunsafe{@aculock{}}}
807 The @code{rewinddir} function is used to reinitialize the directory
808 stream @var{dirstream}, so that if you call @code{readdir} it
809 returns information about the first entry in the directory again. This
810 function also notices if files have been added or removed to the
811 directory since it was opened with @code{opendir}. (Entries for these
812 files might or might not be returned by @code{readdir} if they were
813 added or removed since you last called @code{opendir} or
817 @deftypefun {long int} telldir (DIR *@var{dirstream})
818 @standards{BSD, dirent.h}
819 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{/bsd} @asulock{/bsd}}@acunsafe{@acsmem{/bsd} @aculock{/bsd}}}
820 @c The implementation is safe on most platforms, but on BSD it uses
821 @c cookies, buckets and records, and the global array of pointers to
822 @c dynamically allocated records is guarded by a non-recursive lock.
823 The @code{telldir} function returns the file position of the directory
824 stream @var{dirstream}. You can use this value with @code{seekdir} to
825 restore the directory stream to that position.
828 @deftypefun void seekdir (DIR *@var{dirstream}, long int @var{pos})
829 @standards{BSD, dirent.h}
830 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{/bsd} @asulock{/bsd}}@acunsafe{@acsmem{/bsd} @aculock{/bsd}}}
831 @c The implementation is safe on most platforms, but on BSD it uses
832 @c cookies, buckets and records, and the global array of pointers to
833 @c dynamically allocated records is guarded by a non-recursive lock.
834 The @code{seekdir} function sets the file position of the directory
835 stream @var{dirstream} to @var{pos}. The value @var{pos} must be the
836 result of a previous call to @code{telldir} on this particular stream;
837 closing and reopening the directory can invalidate values returned by
842 @node Scanning Directory Content
843 @subsection Scanning the Content of a Directory
845 A higher-level interface to the directory handling functions is the
846 @code{scandir} function. With its help one can select a subset of the
847 entries in a directory, possibly sort them and get a list of names as
850 @deftypefun int scandir (const char *@var{dir}, struct dirent ***@var{namelist}, int (*@var{selector}) (const struct dirent *), int (*@var{cmp}) (const struct dirent **, const struct dirent **))
851 @standards{BSD, dirent.h}
852 @standards{SVID, dirent.h}
853 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
854 @c The scandir function calls __opendirat, __readdir, and __closedir to
855 @c go over the named dir; malloc and realloc to allocate the namelist
856 @c and copies of each selected dirent, besides the selector, if given,
857 @c and qsort and the cmp functions if the latter is given. In spite of
858 @c the cleanup handler that releases memory and the file descriptor in
859 @c case of synchronous cancellation, an asynchronous cancellation may
860 @c still leak memory and a file descriptor. Although readdir is unsafe
861 @c in general, the use of an internal dir stream for sequential scanning
862 @c of the directory with copying of dirents before subsequent calls
863 @c makes the use safe, and the fact that the dir stream is private to
864 @c each scandir call does away with the lock issues in readdir and
867 The @code{scandir} function scans the contents of the directory selected
868 by @var{dir}. The result in *@var{namelist} is an array of pointers to
869 structures of type @code{struct dirent} which describe all selected
870 directory entries and which is allocated using @code{malloc}. Instead
871 of always getting all directory entries returned, the user supplied
872 function @var{selector} can be used to decide which entries are in the
873 result. Only the entries for which @var{selector} returns a non-zero
876 Finally the entries in *@var{namelist} are sorted using the
877 user-supplied function @var{cmp}. The arguments passed to the @var{cmp}
878 function are of type @code{struct dirent **}, therefore one cannot
879 directly use the @code{strcmp} or @code{strcoll} functions; instead see
880 the functions @code{alphasort} and @code{versionsort} below.
882 The return value of the function is the number of entries placed in
883 *@var{namelist}. If it is @code{-1} an error occurred (either the
884 directory could not be opened for reading or memory allocation failed) and
885 the global variable @code{errno} contains more information on the error.
888 As described above, the fourth argument to the @code{scandir} function
889 must be a pointer to a sorting function. For the convenience of the
890 programmer @theglibc{} contains implementations of functions which
891 are very helpful for this purpose.
893 @deftypefun int alphasort (const struct dirent **@var{a}, const struct dirent **@var{b})
894 @standards{BSD, dirent.h}
895 @standards{SVID, dirent.h}
896 @safety{@prelim{}@mtsafe{@mtslocale{}}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{}}}
898 The @code{alphasort} function behaves like the @code{strcoll} function
899 (@pxref{String/Array Comparison}). The difference is that the arguments
900 are not string pointers but instead they are of type
901 @code{struct dirent **}.
903 The return value of @code{alphasort} is less than, equal to, or greater
904 than zero depending on the order of the two entries @var{a} and @var{b}.
907 @deftypefun int versionsort (const struct dirent **@var{a}, const struct dirent **@var{b})
908 @standards{GNU, dirent.h}
909 @safety{@prelim{}@mtsafe{@mtslocale{}}@assafe{}@acsafe{}}
910 @c Calls strverscmp, which will accesses the locale object multiple
912 The @code{versionsort} function is like @code{alphasort} except that it
913 uses the @code{strverscmp} function internally.
916 If the filesystem supports large files we cannot use the @code{scandir}
917 anymore since the @code{dirent} structure might not able to contain all
918 the information. The LFS provides the new type @w{@code{struct
919 dirent64}}. To use this we need a new function.
921 @deftypefun int scandir64 (const char *@var{dir}, struct dirent64 ***@var{namelist}, int (*@var{selector}) (const struct dirent64 *), int (*@var{cmp}) (const struct dirent64 **, const struct dirent64 **))
922 @standards{GNU, dirent.h}
923 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
925 The @code{scandir64} function works like the @code{scandir} function
926 except that the directory entries it returns are described by elements
927 of type @w{@code{struct dirent64}}. The function pointed to by
928 @var{selector} is again used to select the desired entries, except that
929 @var{selector} now must point to a function which takes a
930 @w{@code{struct dirent64 *}} parameter.
932 Similarly the @var{cmp} function should expect its two arguments to be
933 of type @code{struct dirent64 **}.
936 As @var{cmp} is now a function of a different type, the functions
937 @code{alphasort} and @code{versionsort} cannot be supplied for that
938 argument. Instead we provide the two replacement functions below.
940 @deftypefun int alphasort64 (const struct dirent64 **@var{a}, const struct dirent **@var{b})
941 @standards{GNU, dirent.h}
942 @safety{@prelim{}@mtsafe{@mtslocale{}}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{}}}
944 The @code{alphasort64} function behaves like the @code{strcoll} function
945 (@pxref{String/Array Comparison}). The difference is that the arguments
946 are not string pointers but instead they are of type
947 @code{struct dirent64 **}.
949 Return value of @code{alphasort64} is less than, equal to, or greater
950 than zero depending on the order of the two entries @var{a} and @var{b}.
953 @deftypefun int versionsort64 (const struct dirent64 **@var{a}, const struct dirent64 **@var{b})
954 @standards{GNU, dirent.h}
955 @safety{@prelim{}@mtsafe{@mtslocale{}}@assafe{}@acsafe{}}
957 The @code{versionsort64} function is like @code{alphasort64}, excepted that it
958 uses the @code{strverscmp} function internally.
961 It is important not to mix the use of @code{scandir} and the 64-bit
962 comparison functions or vice versa. There are systems on which this
963 works but on others it will fail miserably.
965 @node Simple Directory Lister Mark II
966 @subsection Simple Program to List a Directory, Mark II
968 Here is a revised version of the directory lister found above
969 (@pxref{Simple Directory Lister}). Using the @code{scandir} function we
970 can avoid the functions which work directly with the directory contents.
971 After the call the returned entries are available for direct use.
977 Note the simple selector function in this example. Since we want to see
978 all directory entries we always return @code{1}.
980 @node Low-level Directory Access
981 @subsection Low-level Directory Access
983 The stream-based directory functions are not AS-Safe and cannot be
984 used after @code{vfork}. @xref{POSIX Safety Concepts}. The functions
985 below provide an alternative that can be used in these contexts.
987 Directory data is obtained from a file descriptor, as created by the
988 @code{open} function, with or without the @code{O_DIRECTORY} flag.
989 @xref{Opening and Closing Files}.
991 @deftypefun ssize_t getdents64 (int @var{fd}, void *@var{buffer}, size_t @var{length})
992 @standards{Linux, dirent.h}
993 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
994 The @code{getdents64} function reads at most @var{length} bytes of
995 directory entry data from the file descriptor @var{fd} and stores it
996 into the byte array starting at @var{buffer}.
998 On success, the function returns the number of bytes written to the
999 buffer. This number is zero if @var{fd} is already at the end of the
1000 directory stream. On error, the function returns @code{-1} and sets
1001 @code{errno} to the appropriate error code.
1003 The data is stored as a sequence of @code{struct dirent64} records,
1004 which can be traversed using the @code{d_reclen} member. The buffer
1005 should be large enough to hold the largest possible directory entry.
1006 Note that some file systems support file names longer than
1007 @code{NAME_MAX} bytes (e.g., because they support up to 255 Unicode
1008 characters), so a buffer size of at least 1024 is recommended.
1010 This function is specific to Linux.
1014 @node Working with Directory Trees
1015 @section Working with Directory Trees
1016 @cindex directory hierarchy
1017 @cindex hierarchy, directory
1018 @cindex tree, directory
1020 The functions described so far for handling the files in a directory
1021 have allowed you to either retrieve the information bit by bit, or to
1022 process all the files as a group (see @code{scandir}). Sometimes it is
1023 useful to process whole hierarchies of directories and their contained
1024 files. The X/Open specification defines two functions to do this. The
1025 simpler form is derived from an early definition in @w{System V} systems
1026 and therefore this function is available on SVID-derived systems. The
1027 prototypes and required definitions can be found in the @file{ftw.h}
1030 There are four functions in this family: @code{ftw}, @code{nftw} and
1031 their 64-bit counterparts @code{ftw64} and @code{nftw64}. These
1032 functions take as one of their arguments a pointer to a callback
1033 function of the appropriate type.
1035 @deftp {Data Type} __ftw_func_t
1036 @standards{GNU, ftw.h}
1039 int (*) (const char *, const struct stat *, int)
1042 The type of callback functions given to the @code{ftw} function. The
1043 first parameter points to the file name, the second parameter to an
1044 object of type @code{struct stat} which is filled in for the file named
1045 in the first parameter.
1048 The last parameter is a flag giving more information about the current
1049 file. It can have the following values:
1053 The item is either a normal file or a file which does not fit into one
1054 of the following categories. This could be special files, sockets etc.
1056 The item is a directory.
1058 The @code{stat} call failed and so the information pointed to by the
1059 second parameter is invalid.
1061 The item is a directory which cannot be read.
1063 The item is a symbolic link. Since symbolic links are normally followed
1064 seeing this value in a @code{ftw} callback function means the referenced
1065 file does not exist. The situation for @code{nftw} is different.
1067 This value is only available if the program is compiled with
1068 @code{_XOPEN_EXTENDED} defined before including
1069 the first header. The original SVID systems do not have symbolic links.
1072 If the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
1073 type is in fact @code{__ftw64_func_t} since this mode changes
1074 @code{struct stat} to be @code{struct stat64}.
1077 For the LFS interface and for use in the function @code{ftw64}, the
1078 header @file{ftw.h} defines another function type.
1080 @deftp {Data Type} __ftw64_func_t
1081 @standards{GNU, ftw.h}
1084 int (*) (const char *, const struct stat64 *, int)
1087 This type is used just like @code{__ftw_func_t} for the callback
1088 function, but this time is called from @code{ftw64}. The second
1089 parameter to the function is a pointer to a variable of type
1090 @code{struct stat64} which is able to represent the larger values.
1093 @deftp {Data Type} __nftw_func_t
1094 @standards{GNU, ftw.h}
1097 int (*) (const char *, const struct stat *, int, struct FTW *)
1100 The first three arguments are the same as for the @code{__ftw_func_t}
1101 type. However for the third argument some additional values are defined
1102 to allow finer differentiation:
1105 The current item is a directory and all subdirectories have already been
1106 visited and reported. This flag is returned instead of @code{FTW_D} if
1107 the @code{FTW_DEPTH} flag is passed to @code{nftw} (see below).
1109 The current item is a stale symbolic link. The file it points to does
1113 The last parameter of the callback function is a pointer to a structure
1114 with some extra information as described below.
1116 If the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
1117 type is in fact @code{__nftw64_func_t} since this mode changes
1118 @code{struct stat} to be @code{struct stat64}.
1121 For the LFS interface there is also a variant of this data type
1122 available which has to be used with the @code{nftw64} function.
1124 @deftp {Data Type} __nftw64_func_t
1125 @standards{GNU, ftw.h}
1128 int (*) (const char *, const struct stat64 *, int, struct FTW *)
1131 This type is used just like @code{__nftw_func_t} for the callback
1132 function, but this time is called from @code{nftw64}. The second
1133 parameter to the function is this time a pointer to a variable of type
1134 @code{struct stat64} which is able to represent the larger values.
1137 @deftp {Data Type} {struct FTW}
1138 @standards{XPG4.2, ftw.h}
1139 The information contained in this structure helps in interpreting the
1140 name parameter and gives some information about the current state of the
1141 traversal of the directory hierarchy.
1145 The value is the offset into the string passed in the first parameter to
1146 the callback function of the beginning of the file name. The rest of
1147 the string is the path of the file. This information is especially
1148 important if the @code{FTW_CHDIR} flag was set in calling @code{nftw}
1149 since then the current directory is the one the current item is found
1152 Whilst processing, the code tracks how many directories down it has gone
1153 to find the current file. This nesting level starts at @math{0} for
1154 files in the initial directory (or is zero for the initial file if a
1160 @deftypefun int ftw (const char *@var{filename}, __ftw_func_t @var{func}, int @var{descriptors})
1161 @standards{SVID, ftw.h}
1162 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
1163 @c see nftw for safety details
1164 The @code{ftw} function calls the callback function given in the
1165 parameter @var{func} for every item which is found in the directory
1166 specified by @var{filename} and all directories below. The function
1167 follows symbolic links if necessary but does not process an item twice.
1168 If @var{filename} is not a directory then it itself is the only object
1169 returned to the callback function.
1171 The file name passed to the callback function is constructed by taking
1172 the @var{filename} parameter and appending the names of all passed
1173 directories and then the local file name. So the callback function can
1174 use this parameter to access the file. @code{ftw} also calls
1175 @code{stat} for the file and passes that information on to the callback
1176 function. If this @code{stat} call is not successful the failure is
1177 indicated by setting the third argument of the callback function to
1178 @code{FTW_NS}. Otherwise it is set according to the description given
1179 in the account of @code{__ftw_func_t} above.
1181 The callback function is expected to return @math{0} to indicate that no
1182 error occurred and that processing should continue. If an error
1183 occurred in the callback function or it wants @code{ftw} to return
1184 immediately, the callback function can return a value other than
1185 @math{0}. This is the only correct way to stop the function. The
1186 program must not use @code{setjmp} or similar techniques to continue
1187 from another place. This would leave resources allocated by the
1188 @code{ftw} function unfreed.
1190 The @var{descriptors} parameter to @code{ftw} specifies how many file
1191 descriptors it is allowed to consume. The function runs faster the more
1192 descriptors it can use. For each level in the directory hierarchy at
1193 most one descriptor is used, but for very deep ones any limit on open
1194 file descriptors for the process or the system may be exceeded.
1195 Moreover, file descriptor limits in a multi-threaded program apply to
1196 all the threads as a group, and therefore it is a good idea to supply a
1197 reasonable limit to the number of open descriptors.
1199 The return value of the @code{ftw} function is @math{0} if all callback
1200 function calls returned @math{0} and all actions performed by the
1201 @code{ftw} succeeded. If a function call failed (other than calling
1202 @code{stat} on an item) the function returns @math{-1}. If a callback
1203 function returns a value other than @math{0} this value is returned as
1204 the return value of @code{ftw}.
1206 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} on a
1207 32-bit system this function is in fact @code{ftw64}, i.e., the LFS
1208 interface transparently replaces the old interface.
1211 @deftypefun int ftw64 (const char *@var{filename}, __ftw64_func_t @var{func}, int @var{descriptors})
1212 @standards{Unix98, ftw.h}
1213 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
1214 This function is similar to @code{ftw} but it can work on filesystems
1215 with large files. File information is reported using a variable of type
1216 @code{struct stat64} which is passed by reference to the callback
1219 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} on a
1220 32-bit system this function is available under the name @code{ftw} and
1221 transparently replaces the old implementation.
1224 @deftypefun int nftw (const char *@var{filename}, __nftw_func_t @var{func}, int @var{descriptors}, int @var{flag})
1225 @standards{XPG4.2, ftw.h}
1226 @safety{@prelim{}@mtsafe{@mtasscwd{}}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{} @acscwd{}}}
1227 @c ftw_startup calls alloca, malloc, free, xstat/lxstat, tdestroy, and ftw_dir
1228 @c if FTW_CHDIR, call open, and fchdir, or chdir and getcwd
1229 @c ftw_dir calls open_dir_stream, readdir64, process_entry, closedir
1230 @c if FTW_CHDIR, also calls fchdir
1231 @c open_dir_stream calls malloc, realloc, readdir64, free, closedir,
1232 @c then openat64_not_cancel_3 and fdopendir or opendir, then dirfd.
1233 @c process_entry may cal realloc, fxstatat/lxstat/xstat, ftw_dir, and
1234 @c find_object (tsearch) and add_object (tfind).
1235 @c Since each invocation of *ftw uses its own private search tree, none
1236 @c of the search tree concurrency issues apply.
1237 The @code{nftw} function works like the @code{ftw} functions. They call
1238 the callback function @var{func} for all items found in the directory
1239 @var{filename} and below. At most @var{descriptors} file descriptors
1240 are consumed during the @code{nftw} call.
1242 One difference is that the callback function is of a different type. It
1243 is of type @w{@code{struct FTW *}} and provides the callback function
1244 with the extra information described above.
1246 A second difference is that @code{nftw} takes a fourth argument, which
1247 is @math{0} or a bitwise-OR combination of any of the following values.
1251 While traversing the directory symbolic links are not followed. Instead
1252 symbolic links are reported using the @code{FTW_SL} value for the type
1253 parameter to the callback function. If the file referenced by a
1254 symbolic link does not exist @code{FTW_SLN} is returned instead.
1256 The callback function is only called for items which are on the same
1257 mounted filesystem as the directory given by the @var{filename}
1258 parameter to @code{nftw}.
1260 If this flag is given the current working directory is changed to the
1261 directory of the reported object before the callback function is called.
1262 When @code{ntfw} finally returns the current directory is restored to
1265 If this option is specified then all subdirectories and files within
1266 them are processed before processing the top directory itself
1267 (depth-first processing). This also means the type flag given to the
1268 callback function is @code{FTW_DP} and not @code{FTW_D}.
1269 @item FTW_ACTIONRETVAL
1270 If this option is specified then return values from callbacks
1271 are handled differently. If the callback returns @code{FTW_CONTINUE},
1272 walking continues normally. @code{FTW_STOP} means walking stops
1273 and @code{FTW_STOP} is returned to the caller. If @code{FTW_SKIP_SUBTREE}
1274 is returned by the callback with @code{FTW_D} argument, the subtree
1275 is skipped and walking continues with next sibling of the directory.
1276 If @code{FTW_SKIP_SIBLINGS} is returned by the callback, all siblings
1277 of the current entry are skipped and walking continues in its parent.
1278 No other return values should be returned from the callbacks if
1279 this option is set. This option is a GNU extension.
1282 The return value is computed in the same way as for @code{ftw}.
1283 @code{nftw} returns @math{0} if no failures occurred and all callback
1284 functions returned @math{0}. In case of internal errors, such as memory
1285 problems, the return value is @math{-1} and @code{errno} is set
1286 accordingly. If the return value of a callback invocation was non-zero
1287 then that value is returned.
1289 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} on a
1290 32-bit system this function is in fact @code{nftw64}, i.e., the LFS
1291 interface transparently replaces the old interface.
1294 @deftypefun int nftw64 (const char *@var{filename}, __nftw64_func_t @var{func}, int @var{descriptors}, int @var{flag})
1295 @standards{Unix98, ftw.h}
1296 @safety{@prelim{}@mtsafe{@mtasscwd{}}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{} @acscwd{}}}
1297 This function is similar to @code{nftw} but it can work on filesystems
1298 with large files. File information is reported using a variable of type
1299 @code{struct stat64} which is passed by reference to the callback
1302 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} on a
1303 32-bit system this function is available under the name @code{nftw} and
1304 transparently replaces the old implementation.
1312 @cindex multiple names for one file
1313 @cindex file names, multiple
1315 In POSIX systems, one file can have many names at the same time. All of
1316 the names are equally real, and no one of them is preferred to the
1319 To add a name to a file, use the @code{link} function. (The new name is
1320 also called a @dfn{hard link} to the file.) Creating a new link to a
1321 file does not copy the contents of the file; it simply makes a new name
1322 by which the file can be known, in addition to the file's existing name
1325 One file can have names in several directories, so the organization
1326 of the file system is not a strict hierarchy or tree.
1328 In most implementations, it is not possible to have hard links to the
1329 same file in multiple file systems. @code{link} reports an error if you
1330 try to make a hard link to the file from another file system when this
1333 The prototype for the @code{link} function is declared in the header
1334 file @file{unistd.h}.
1337 @deftypefun int link (const char *@var{oldname}, const char *@var{newname})
1338 @standards{POSIX.1, unistd.h}
1339 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1340 The @code{link} function makes a new link to the existing file named by
1341 @var{oldname}, under the new name @var{newname}.
1343 This function returns a value of @code{0} if it is successful and
1344 @code{-1} on failure. In addition to the usual file name errors
1345 (@pxref{File Name Errors}) for both @var{oldname} and @var{newname}, the
1346 following @code{errno} error conditions are defined for this function:
1350 You are not allowed to write to the directory in which the new link is
1353 Some implementations also require that the existing file be accessible
1354 by the caller, and use this error to report failure for that reason.
1358 There is already a file named @var{newname}. If you want to replace
1359 this link with a new link, you must remove the old link explicitly first.
1362 There are already too many links to the file named by @var{oldname}.
1363 (The maximum number of links to a file is @w{@code{LINK_MAX}}; see
1364 @ref{Limits for Files}.)
1367 The file named by @var{oldname} doesn't exist. You can't make a link to
1368 a file that doesn't exist.
1371 The directory or file system that would contain the new link is full
1372 and cannot be extended.
1375 On @gnulinuxhurdsystems{} and some others, you cannot make links to
1377 Many systems allow only privileged users to do so. This error
1378 is used to report the problem.
1381 The directory containing the new link can't be modified because it's on
1382 a read-only file system.
1385 The directory specified in @var{newname} is on a different file system
1386 than the existing file.
1389 A hardware error occurred while trying to read or write the to filesystem.
1393 @deftypefun int linkat (int oldfd, const char *@var{oldname}, int newfd, const char *@var{newname}, int flags)
1394 @standards{POSIX.1, unistd.h}
1395 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1397 The @code{linkat} function is analogous to the @code{link} function,
1398 except that it identifies its source and target using a combination of a
1399 file descriptor (referring to a directory) and a file name.
1400 @xref{Descriptor-Relative Access}. For @code{linkat}, if a file name is
1401 not absolute, it is resolved relative to the corresponding file
1402 descriptor. As usual, the special value @code{AT_FDCWD} denotes the
1405 The @var{flags} argument is a combination of the following flags:
1408 @item AT_SYMLINK_FOLLOW
1409 If the source path identified by @var{oldfd} and @var{oldname} is a
1410 symbolic link, @code{linkat} follows the symbolic link and creates a
1411 link to its target. If the flag is not set, a link for the symbolic
1412 link itself is created; this is not supported by all file systems and
1413 @code{linkat} can fail in this case.
1416 If this flag is specified, @var{oldname} can be an empty string. In
1417 this case, a new link to the file denoted by the descriptor @var{oldfd}
1418 is created, which may have been opened with @code{O_PATH} or
1419 @code{O_TMPFILE}. This flag is a GNU extension.
1423 @node Symbolic Links
1424 @section Symbolic Links
1427 @cindex symbolic link
1428 @cindex link, symbolic
1430 @gnusystems{} support @dfn{soft links} or @dfn{symbolic links}. This
1431 is a kind of ``file'' that is essentially a pointer to another file
1432 name. Unlike hard links, symbolic links can be made to directories or
1433 across file systems with no restrictions. You can also make a symbolic
1434 link to a name which is not the name of any file. (Opening this link
1435 will fail until a file by that name is created.) Likewise, if the
1436 symbolic link points to an existing file which is later deleted, the
1437 symbolic link continues to point to the same file name even though the
1438 name no longer names any file.
1440 The reason symbolic links work the way they do is that special things
1441 happen when you try to open the link. The @code{open} function realizes
1442 you have specified the name of a link, reads the file name contained in
1443 the link, and opens that file name instead. The @code{stat} function
1444 likewise operates on the file that the symbolic link points to, instead
1445 of on the link itself.
1447 By contrast, other operations such as deleting or renaming the file
1448 operate on the link itself. The functions @code{readlink} and
1449 @code{lstat} also refrain from following symbolic links, because their
1450 purpose is to obtain information about the link. @code{link}, the
1451 function that makes a hard link, does too. It makes a hard link to the
1452 symbolic link, which one rarely wants.
1454 Some systems have, for some functions operating on files, a limit on
1455 how many symbolic links are followed when resolving a path name. The
1456 limit if it exists is published in the @file{sys/param.h} header file.
1458 @deftypevr Macro int MAXSYMLINKS
1459 @standards{BSD, sys/param.h}
1461 The macro @code{MAXSYMLINKS} specifies how many symlinks some function
1462 will follow before returning @code{ELOOP}. Not all functions behave the
1463 same and this value is not the same as that returned for
1464 @code{_SC_SYMLOOP} by @code{sysconf}. In fact, the @code{sysconf}
1465 result can indicate that there is no fixed limit although
1466 @code{MAXSYMLINKS} exists and has a finite value.
1469 Prototypes for most of the functions listed in this section are in
1473 @deftypefun int symlink (const char *@var{oldname}, const char *@var{newname})
1474 @standards{BSD, unistd.h}
1475 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1476 The @code{symlink} function makes a symbolic link to @var{oldname} named
1479 The normal return value from @code{symlink} is @code{0}. A return value
1480 of @code{-1} indicates an error. In addition to the usual file name
1481 syntax errors (@pxref{File Name Errors}), the following @code{errno}
1482 error conditions are defined for this function:
1486 There is already an existing file named @var{newname}.
1489 The file @var{newname} would exist on a read-only file system.
1492 The directory or file system cannot be extended to make the new link.
1495 A hardware error occurred while reading or writing data on the disk.
1497 @comment not sure about these
1500 There are too many levels of indirection. This can be the result of
1501 circular symbolic links to directories.
1504 The new link can't be created because the user's disk quota has been
1510 @deftypefun ssize_t readlink (const char *@var{filename}, char *@var{buffer}, size_t @var{size})
1511 @standards{BSD, unistd.h}
1512 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1513 The @code{readlink} function gets the value of the symbolic link
1514 @var{filename}. The file name that the link points to is copied into
1515 @var{buffer}. This file name string is @emph{not} null-terminated;
1516 @code{readlink} normally returns the number of characters copied. The
1517 @var{size} argument specifies the maximum number of characters to copy,
1518 usually the allocation size of @var{buffer}.
1520 If the return value equals @var{size}, you cannot tell whether or not
1521 there was room to return the entire name. So make a bigger buffer and
1522 call @code{readlink} again. Here is an example:
1526 readlink_malloc (const char *filename)
1529 char *buffer = NULL;
1533 buffer = xreallocarray (buffer, size, 2);
1535 ssize_t nchars = readlink (filename, buffer, size);
1547 @c @group Invalid outside example.
1548 A value of @code{-1} is returned in case of error. In addition to the
1549 usual file name errors (@pxref{File Name Errors}), the following
1550 @code{errno} error conditions are defined for this function:
1554 The named file is not a symbolic link.
1557 A hardware error occurred while reading or writing data on the disk.
1562 In some situations it is desirable to resolve all the
1563 symbolic links to get the real
1564 name of a file where no prefix names a symbolic link which is followed
1565 and no filename in the path is @code{.} or @code{..}. This is for
1566 instance desirable if files have to be compared in which case different
1567 names can refer to the same inode.
1569 @deftypefun {char *} canonicalize_file_name (const char *@var{name})
1570 @standards{GNU, stdlib.h}
1571 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
1574 The @code{canonicalize_file_name} function returns the absolute name of
1575 the file named by @var{name} which contains no @code{.}, @code{..}
1576 components nor any repeated path separators (@code{/}) or symlinks. The
1577 result is passed back as the return value of the function in a block of
1578 memory allocated with @code{malloc}. If the result is not used anymore
1579 the memory should be freed with a call to @code{free}.
1581 If any of the path components are missing the function returns a NULL
1582 pointer. This is also what is returned if the length of the path
1583 reaches or exceeds @code{PATH_MAX} characters. In any case
1584 @code{errno} is set accordingly.
1588 The resulting path is too long. This error only occurs on systems which
1589 have a limit on the file name length.
1592 At least one of the path components is not readable.
1595 The input file name is empty.
1598 At least one of the path components does not exist.
1601 More than @code{MAXSYMLINKS} many symlinks have been followed.
1604 This function is a GNU extension and is declared in @file{stdlib.h}.
1607 The Unix standard includes a similar function which differs from
1608 @code{canonicalize_file_name} in that the user has to provide the buffer
1609 where the result is placed in.
1611 @deftypefun {char *} realpath (const char *restrict @var{name}, char *restrict @var{resolved})
1612 @standards{XPG, stdlib.h}
1613 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{} @acsfd{}}}
1614 @c Calls malloc, realloc, getcwd, lxstat64, readlink, alloca.
1616 A call to @code{realpath} where the @var{resolved} parameter is
1617 @code{NULL} behaves exactly like @code{canonicalize_file_name}. The
1618 function allocates a buffer for the file name and returns a pointer to
1619 it. If @var{resolved} is not @code{NULL} it points to a buffer into
1620 which the result is copied. It is the callers responsibility to
1621 allocate a buffer which is large enough. On systems which define
1622 @code{PATH_MAX} this means the buffer must be large enough for a
1623 pathname of this size. For systems without limitations on the pathname
1624 length the requirement cannot be met and programs should not call
1625 @code{realpath} with anything but @code{NULL} for the second parameter.
1627 One other difference is that the buffer @var{resolved} (if nonzero) will
1628 contain the part of the path component which does not exist or is not
1629 readable if the function returns @code{NULL} and @code{errno} is set to
1630 @code{EACCES} or @code{ENOENT}.
1632 This function is declared in @file{stdlib.h}.
1635 The advantage of using this function is that it is more widely
1636 available. The drawback is that it reports failures for long paths on
1637 systems which have no limits on the file name length.
1639 @node Deleting Files
1640 @section Deleting Files
1641 @cindex deleting a file
1642 @cindex removing a file
1643 @cindex unlinking a file
1645 You can delete a file with @code{unlink} or @code{remove}.
1647 Deletion actually deletes a file name. If this is the file's only name,
1648 then the file is deleted as well. If the file has other remaining names
1649 (@pxref{Hard Links}), it remains accessible under those names.
1651 @deftypefun int unlink (const char *@var{filename})
1652 @standards{POSIX.1, unistd.h}
1653 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1654 The @code{unlink} function deletes the file name @var{filename}. If
1655 this is a file's sole name, the file itself is also deleted. (Actually,
1656 if any process has the file open when this happens, deletion is
1657 postponed until all processes have closed the file.)
1660 The function @code{unlink} is declared in the header file @file{unistd.h}.
1662 This function returns @code{0} on successful completion, and @code{-1}
1663 on error. In addition to the usual file name errors
1664 (@pxref{File Name Errors}), the following @code{errno} error conditions are
1665 defined for this function:
1669 Write permission is denied for the directory from which the file is to be
1670 removed, or the directory has the sticky bit set and you do not own the file.
1673 This error indicates that the file is being used by the system in such a
1674 way that it can't be unlinked. For example, you might see this error if
1675 the file name specifies the root directory or a mount point for a file
1679 The file name to be deleted doesn't exist.
1682 On some systems @code{unlink} cannot be used to delete the name of a
1683 directory, or at least can only be used this way by a privileged user.
1684 To avoid such problems, use @code{rmdir} to delete directories. (On
1685 @gnulinuxhurdsystems{} @code{unlink} can never delete the name of a directory.)
1688 The directory containing the file name to be deleted is on a read-only
1689 file system and can't be modified.
1693 @deftypefun int rmdir (const char *@var{filename})
1694 @standards{POSIX.1, unistd.h}
1695 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1696 @cindex directories, deleting
1697 @cindex deleting a directory
1698 The @code{rmdir} function deletes a directory. The directory must be
1699 empty before it can be removed; in other words, it can only contain
1700 entries for @file{.} and @file{..}.
1702 In most other respects, @code{rmdir} behaves like @code{unlink}. There
1703 are two additional @code{errno} error conditions defined for
1709 The directory to be deleted is not empty.
1712 These two error codes are synonymous; some systems use one, and some use
1713 the other. @gnulinuxhurdsystems{} always use @code{ENOTEMPTY}.
1715 The prototype for this function is declared in the header file
1720 @deftypefun int remove (const char *@var{filename})
1721 @standards{ISO, stdio.h}
1722 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1723 @c Calls unlink and rmdir.
1724 This is the @w{ISO C} function to remove a file. It works like
1725 @code{unlink} for files and like @code{rmdir} for directories.
1726 @code{remove} is declared in @file{stdio.h}.
1730 @node Renaming Files
1731 @section Renaming Files
1733 The @code{rename} function is used to change a file's name.
1735 @cindex renaming a file
1736 @deftypefun int rename (const char *@var{oldname}, const char *@var{newname})
1737 @standards{ISO, stdio.h}
1738 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1739 @c In the absence of a rename syscall, there's an emulation with link
1740 @c and unlink, but it's racy, even more so if newname exists and is
1742 The @code{rename} function renames the file @var{oldname} to
1743 @var{newname}. The file formerly accessible under the name
1744 @var{oldname} is afterwards accessible as @var{newname} instead. (If
1745 the file had any other names aside from @var{oldname}, it continues to
1748 The directory containing the name @var{newname} must be on the same file
1749 system as the directory containing the name @var{oldname}.
1751 One special case for @code{rename} is when @var{oldname} and
1752 @var{newname} are two names for the same file. The consistent way to
1753 handle this case is to delete @var{oldname}. However, in this case
1754 POSIX requires that @code{rename} do nothing and report success---which
1755 is inconsistent. We don't know what your operating system will do.
1757 If @var{oldname} is not a directory, then any existing file named
1758 @var{newname} is removed during the renaming operation. However, if
1759 @var{newname} is the name of a directory, @code{rename} fails in this
1762 If @var{oldname} is a directory, then either @var{newname} must not
1763 exist or it must name a directory that is empty. In the latter case,
1764 the existing directory named @var{newname} is deleted first. The name
1765 @var{newname} must not specify a subdirectory of the directory
1766 @code{oldname} which is being renamed.
1768 One useful feature of @code{rename} is that the meaning of @var{newname}
1769 changes ``atomically'' from any previously existing file by that name to
1770 its new meaning (i.e., the file that was called @var{oldname}). There is
1771 no instant at which @var{newname} is non-existent ``in between'' the old
1772 meaning and the new meaning. If there is a system crash during the
1773 operation, it is possible for both names to still exist; but
1774 @var{newname} will always be intact if it exists at all.
1776 If @code{rename} fails, it returns @code{-1}. In addition to the usual
1777 file name errors (@pxref{File Name Errors}), the following
1778 @code{errno} error conditions are defined for this function:
1782 One of the directories containing @var{newname} or @var{oldname}
1783 refuses write permission; or @var{newname} and @var{oldname} are
1784 directories and write permission is refused for one of them.
1787 A directory named by @var{oldname} or @var{newname} is being used by
1788 the system in a way that prevents the renaming from working. This includes
1789 directories that are mount points for filesystems, and directories
1790 that are the current working directories of processes.
1794 The directory @var{newname} isn't empty. @gnulinuxhurdsystems{} always return
1795 @code{ENOTEMPTY} for this, but some other systems return @code{EEXIST}.
1798 @var{oldname} is a directory that contains @var{newname}.
1801 @var{newname} is a directory but the @var{oldname} isn't.
1804 The parent directory of @var{newname} would have too many links
1808 The file @var{oldname} doesn't exist.
1811 The directory that would contain @var{newname} has no room for another
1812 entry, and there is no space left in the file system to expand it.
1815 The operation would involve writing to a directory on a read-only file
1819 The two file names @var{newname} and @var{oldname} are on different
1824 @node Creating Directories
1825 @section Creating Directories
1826 @cindex creating a directory
1827 @cindex directories, creating
1830 Directories are created with the @code{mkdir} function. (There is also
1831 a shell command @code{mkdir} which does the same thing.)
1834 @deftypefun int mkdir (const char *@var{filename}, mode_t @var{mode})
1835 @standards{POSIX.1, sys/stat.h}
1836 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1837 The @code{mkdir} function creates a new, empty directory with name
1840 The argument @var{mode} specifies the file permissions for the new
1841 directory file. @xref{Permission Bits}, for more information about
1844 A return value of @code{0} indicates successful completion, and
1845 @code{-1} indicates failure. In addition to the usual file name syntax
1846 errors (@pxref{File Name Errors}), the following @code{errno} error
1847 conditions are defined for this function:
1851 Write permission is denied for the parent directory in which the new
1852 directory is to be added.
1855 A file named @var{filename} already exists.
1858 The parent directory has too many links (entries).
1860 Well-designed file systems never report this error, because they permit
1861 more links than your disk could possibly hold. However, you must still
1862 take account of the possibility of this error, as it could result from
1863 network access to a file system on another machine.
1866 The file system doesn't have enough room to create the new directory.
1869 The parent directory of the directory being created is on a read-only
1870 file system and cannot be modified.
1873 To use this function, your program should include the header file
1878 @node File Attributes
1879 @section File Attributes
1882 When you issue an @samp{ls -l} shell command on a file, it gives you
1883 information about the size of the file, who owns it, when it was last
1884 modified, etc. These are called the @dfn{file attributes}, and are
1885 associated with the file itself and not a particular one of its names.
1887 This section contains information about how you can inquire about and
1888 modify the attributes of a file.
1891 * Attribute Meanings:: The names of the file attributes,
1892 and what their values mean.
1893 * Reading Attributes:: How to read the attributes of a file.
1894 * Testing File Type:: Distinguishing ordinary files,
1895 directories, links@dots{}
1896 * File Owner:: How ownership for new files is determined,
1897 and how to change it.
1898 * Permission Bits:: How information about a file's access
1900 * Access Permission:: How the system decides who can access a file.
1901 * Setting Permissions:: How permissions for new files are assigned,
1902 and how to change them.
1903 * Testing File Access:: How to find out if your process can
1905 * File Times:: About the time attributes of a file.
1906 * File Size:: Manually changing the size of a file.
1907 * Storage Allocation:: Allocate backing storage for files.
1910 @node Attribute Meanings
1911 @subsection The meaning of the File Attributes
1912 @cindex status of a file
1913 @cindex attributes of a file
1914 @cindex file attributes
1916 When you read the attributes of a file, they come back in a structure
1917 called @code{struct stat}. This section describes the names of the
1918 attributes, their data types, and what they mean. For the functions
1919 to read the attributes of a file, see @ref{Reading Attributes}.
1921 The header file @file{sys/stat.h} declares all the symbols defined
1925 @deftp {Data Type} {struct stat}
1926 @standards{POSIX.1, sys/stat.h}
1927 The @code{stat} structure type is used to return information about the
1928 attributes of a file. It contains at least the following members:
1931 @item mode_t st_mode
1932 Specifies the mode of the file. This includes file type information
1933 (@pxref{Testing File Type}) and the file permission bits
1934 (@pxref{Permission Bits}).
1937 The file serial number, which distinguishes this file from all other
1938 files on the same device.
1941 Identifies the device containing the file. The @code{st_ino} and
1942 @code{st_dev}, taken together, uniquely identify the file. The
1943 @code{st_dev} value is not necessarily consistent across reboots or
1944 system crashes, however.
1946 @item nlink_t st_nlink
1947 The number of hard links to the file. This count keeps track of how
1948 many directories have entries for this file. If the count is ever
1949 decremented to zero, then the file itself is discarded as soon as no
1950 process still holds it open. Symbolic links are not counted in the
1954 The user ID of the file's owner. @xref{File Owner}.
1957 The group ID of the file. @xref{File Owner}.
1960 This specifies the size of a regular file in bytes. For files that are
1961 really devices this field isn't usually meaningful. For symbolic links
1962 this specifies the length of the file name the link refers to.
1964 @item time_t st_atime
1965 This is the last access time for the file. @xref{File Times}.
1967 @item unsigned long int st_atime_usec
1968 This is the fractional part of the last access time for the file.
1971 @item time_t st_mtime
1972 This is the time of the last modification to the contents of the file.
1975 @item unsigned long int st_mtime_usec
1976 This is the fractional part of the time of the last modification to the
1977 contents of the file. @xref{File Times}.
1979 @item time_t st_ctime
1980 This is the time of the last modification to the attributes of the file.
1983 @item unsigned long int st_ctime_usec
1984 This is the fractional part of the time of the last modification to the
1985 attributes of the file. @xref{File Times}.
1988 @item blkcnt_t st_blocks
1989 This is the amount of disk space that the file occupies, measured in
1990 units of 512-byte blocks.
1992 The number of disk blocks is not strictly proportional to the size of
1993 the file, for two reasons: the file system may use some blocks for
1994 internal record keeping; and the file may be sparse---it may have
1995 ``holes'' which contain zeros but do not actually take up space on the
1998 You can tell (approximately) whether a file is sparse by comparing this
1999 value with @code{st_size}, like this:
2002 (st.st_blocks * 512 < st.st_size)
2005 This test is not perfect because a file that is just slightly sparse
2006 might not be detected as sparse at all. For practical applications,
2007 this is not a problem.
2009 @item unsigned int st_blksize
2010 The optimal block size for reading or writing this file, in bytes. You
2011 might use this size for allocating the buffer space for reading or
2012 writing the file. (This is unrelated to @code{st_blocks}.)
2016 The extensions for the Large File Support (LFS) require, even on 32-bit
2017 machines, types which can handle file sizes up to @twoexp{63}.
2018 Therefore a new definition of @code{struct stat} is necessary.
2020 @deftp {Data Type} {struct stat64}
2021 @standards{LFS, sys/stat.h}
2022 The members of this type are the same and have the same names as those
2023 in @code{struct stat}. The only difference is that the members
2024 @code{st_ino}, @code{st_size}, and @code{st_blocks} have a different
2025 type to support larger values.
2028 @item mode_t st_mode
2029 Specifies the mode of the file. This includes file type information
2030 (@pxref{Testing File Type}) and the file permission bits
2031 (@pxref{Permission Bits}).
2033 @item ino64_t st_ino
2034 The file serial number, which distinguishes this file from all other
2035 files on the same device.
2038 Identifies the device containing the file. The @code{st_ino} and
2039 @code{st_dev}, taken together, uniquely identify the file. The
2040 @code{st_dev} value is not necessarily consistent across reboots or
2041 system crashes, however.
2043 @item nlink_t st_nlink
2044 The number of hard links to the file. This count keeps track of how
2045 many directories have entries for this file. If the count is ever
2046 decremented to zero, then the file itself is discarded as soon as no
2047 process still holds it open. Symbolic links are not counted in the
2051 The user ID of the file's owner. @xref{File Owner}.
2054 The group ID of the file. @xref{File Owner}.
2056 @item off64_t st_size
2057 This specifies the size of a regular file in bytes. For files that are
2058 really devices this field isn't usually meaningful. For symbolic links
2059 this specifies the length of the file name the link refers to.
2061 @item time_t st_atime
2062 This is the last access time for the file. @xref{File Times}.
2064 @item unsigned long int st_atime_usec
2065 This is the fractional part of the last access time for the file.
2068 @item time_t st_mtime
2069 This is the time of the last modification to the contents of the file.
2072 @item unsigned long int st_mtime_usec
2073 This is the fractional part of the time of the last modification to the
2074 contents of the file. @xref{File Times}.
2076 @item time_t st_ctime
2077 This is the time of the last modification to the attributes of the file.
2080 @item unsigned long int st_ctime_usec
2081 This is the fractional part of the time of the last modification to the
2082 attributes of the file. @xref{File Times}.
2085 @item blkcnt64_t st_blocks
2086 This is the amount of disk space that the file occupies, measured in
2087 units of 512-byte blocks.
2089 @item unsigned int st_blksize
2090 The optimal block size for reading of writing this file, in bytes. You
2091 might use this size for allocating the buffer space for reading of
2092 writing the file. (This is unrelated to @code{st_blocks}.)
2096 Some of the file attributes have special data type names which exist
2097 specifically for those attributes. (They are all aliases for well-known
2098 integer types that you know and love.) These typedef names are defined
2099 in the header file @file{sys/types.h} as well as in @file{sys/stat.h}.
2100 Here is a list of them.
2102 @deftp {Data Type} mode_t
2103 @standards{POSIX.1, sys/types.h}
2104 This is an integer data type used to represent file modes. In
2105 @theglibc{}, this is an unsigned type no narrower than @code{unsigned
2109 @cindex inode number
2110 @deftp {Data Type} ino_t
2111 @standards{POSIX.1, sys/types.h}
2112 This is an unsigned integer type used to represent file serial numbers.
2113 (In Unix jargon, these are sometimes called @dfn{inode numbers}.)
2114 In @theglibc{}, this type is no narrower than @code{unsigned int}.
2116 If the source is compiled with @code{_FILE_OFFSET_BITS == 64} this type
2117 is transparently replaced by @code{ino64_t}.
2120 @deftp {Data Type} ino64_t
2121 @standards{Unix98, sys/types.h}
2122 This is an unsigned integer type used to represent file serial numbers
2123 for the use in LFS. In @theglibc{}, this type is no narrower than
2124 @code{unsigned int}.
2126 When compiling with @code{_FILE_OFFSET_BITS == 64} this type is
2127 available under the name @code{ino_t}.
2130 @deftp {Data Type} dev_t
2131 @standards{POSIX.1, sys/types.h}
2132 This is an arithmetic data type used to represent file device numbers.
2133 In @theglibc{}, this is an integer type no narrower than @code{int}.
2136 @deftp {Data Type} nlink_t
2137 @standards{POSIX.1, sys/types.h}
2138 This is an integer type used to represent file link counts.
2141 @deftp {Data Type} blkcnt_t
2142 @standards{Unix98, sys/types.h}
2143 This is a signed integer type used to represent block counts.
2144 In @theglibc{}, this type is no narrower than @code{int}.
2146 If the source is compiled with @code{_FILE_OFFSET_BITS == 64} this type
2147 is transparently replaced by @code{blkcnt64_t}.
2150 @deftp {Data Type} blkcnt64_t
2151 @standards{Unix98, sys/types.h}
2152 This is a signed integer type used to represent block counts for the
2153 use in LFS. In @theglibc{}, this type is no narrower than @code{int}.
2155 When compiling with @code{_FILE_OFFSET_BITS == 64} this type is
2156 available under the name @code{blkcnt_t}.
2159 @node Reading Attributes
2160 @subsection Reading the Attributes of a File
2162 To examine the attributes of files, use the functions @code{stat},
2163 @code{fstat} and @code{lstat}. They return the attribute information in
2164 a @code{struct stat} object. All three functions are declared in the
2165 header file @file{sys/stat.h}.
2167 @deftypefun int stat (const char *@var{filename}, struct stat *@var{buf})
2168 @standards{POSIX.1, sys/stat.h}
2169 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2170 The @code{stat} function returns information about the attributes of the
2171 file named by @w{@var{filename}} in the structure pointed to by @var{buf}.
2173 If @var{filename} is the name of a symbolic link, the attributes you get
2174 describe the file that the link points to. If the link points to a
2175 nonexistent file name, then @code{stat} fails reporting a nonexistent
2178 The return value is @code{0} if the operation is successful, or
2179 @code{-1} on failure. In addition to the usual file name errors
2180 (@pxref{File Name Errors}, the following @code{errno} error conditions
2181 are defined for this function:
2185 The file named by @var{filename} doesn't exist.
2188 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2189 function is in fact @code{stat64} since the LFS interface transparently
2190 replaces the normal implementation.
2193 @deftypefun int stat64 (const char *@var{filename}, struct stat64 *@var{buf})
2194 @standards{Unix98, sys/stat.h}
2195 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2196 This function is similar to @code{stat} but it is also able to work on
2197 files larger than @twoexp{31} bytes on 32-bit systems. To be able to do
2198 this the result is stored in a variable of type @code{struct stat64} to
2199 which @var{buf} must point.
2201 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2202 function is available under the name @code{stat} and so transparently
2203 replaces the interface for small files on 32-bit machines.
2206 @deftypefun int fstat (int @var{filedes}, struct stat *@var{buf})
2207 @standards{POSIX.1, sys/stat.h}
2208 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2209 The @code{fstat} function is like @code{stat}, except that it takes an
2210 open file descriptor as an argument instead of a file name.
2211 @xref{Low-Level I/O}.
2213 Like @code{stat}, @code{fstat} returns @code{0} on success and @code{-1}
2214 on failure. The following @code{errno} error conditions are defined for
2219 The @var{filedes} argument is not a valid file descriptor.
2222 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2223 function is in fact @code{fstat64} since the LFS interface transparently
2224 replaces the normal implementation.
2227 @deftypefun int fstat64 (int @var{filedes}, struct stat64 *@var{buf})
2228 @standards{Unix98, sys/stat.h}
2229 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2230 This function is similar to @code{fstat} but is able to work on large
2231 files on 32-bit platforms. For large files the file descriptor
2232 @var{filedes} should be obtained by @code{open64} or @code{creat64}.
2233 The @var{buf} pointer points to a variable of type @code{struct stat64}
2234 which is able to represent the larger values.
2236 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2237 function is available under the name @code{fstat} and so transparently
2238 replaces the interface for small files on 32-bit machines.
2241 @deftypefun int fstatat (int @var{filedes}, const char *@var{filename}, struct stat *@var{buf}, int @var{flags})
2242 @standards{POSIX.1, sys/stat.h}
2243 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2244 This function is a descriptor-relative version of the @code{fstat}
2245 function above. @xref{Descriptor-Relative Access}. The @var{flags}
2246 argument can contain a combination of the flags @code{AT_EMPTY_PATH},
2247 @code{AT_NO_AUTOMOUNT}, @code{AT_SYMLINK_NOFOLLOW}.
2249 Compared to @code{fstat}, the following additional error conditions can
2254 The @var{filedes} argument is not a valid file descriptor.
2257 The @var{flags} argument is not valid for this function.
2260 The descriptor @var{filedes} is not associated with a directory, and
2261 @var{filename} is a relative file name.
2264 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2265 function is in fact @code{fstatat64} since the LFS interface transparently
2266 replaces the normal implementation.
2269 @deftypefun int fstatat64 (int @var{filedes}, const char *@var{filename}, struct stat64 *@var{buf}, int @var{flags})
2270 @standards{GNU, sys/stat.h}
2271 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2272 This function is the large-file variant of @code{fstatat}, similar to
2273 how @code{fstat64} is the variant of @code{fstat}.
2275 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2276 function is available under the name @code{fstatat} and so transparently
2277 replaces the interface for small files on 32-bit machines.
2280 @deftypefun int lstat (const char *@var{filename}, struct stat *@var{buf})
2281 @standards{BSD, sys/stat.h}
2282 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2283 @c Direct system call through lxstat, sometimes with an xstat conv call
2285 The @code{lstat} function is like @code{stat}, except that it does not
2286 follow symbolic links. If @var{filename} is the name of a symbolic
2287 link, @code{lstat} returns information about the link itself; otherwise
2288 @code{lstat} works like @code{stat}. @xref{Symbolic Links}.
2290 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2291 function is in fact @code{lstat64} since the LFS interface transparently
2292 replaces the normal implementation.
2295 @deftypefun int lstat64 (const char *@var{filename}, struct stat64 *@var{buf})
2296 @standards{Unix98, sys/stat.h}
2297 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2298 @c Direct system call through lxstat64, sometimes with an xstat conv
2300 This function is similar to @code{lstat} but it is also able to work on
2301 files larger than @twoexp{31} bytes on 32-bit systems. To be able to do
2302 this the result is stored in a variable of type @code{struct stat64} to
2303 which @var{buf} must point.
2305 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} this
2306 function is available under the name @code{lstat} and so transparently
2307 replaces the interface for small files on 32-bit machines.
2310 @node Testing File Type
2311 @subsection Testing the Type of a File
2313 The @dfn{file mode}, stored in the @code{st_mode} field of the file
2314 attributes, contains two kinds of information: the file type code, and
2315 the access permission bits. This section discusses only the type code,
2316 which you can use to tell whether the file is a directory, socket,
2317 symbolic link, and so on. For details about access permissions see
2318 @ref{Permission Bits}.
2320 There are two ways you can access the file type information in a file
2321 mode. Firstly, for each file type there is a @dfn{predicate macro}
2322 which examines a given file mode and returns whether it is of that type
2323 or not. Secondly, you can mask out the rest of the file mode to leave
2324 just the file type code, and compare this against constants for each of
2325 the supported file types.
2327 All of the symbols listed in this section are defined in the header file
2331 The following predicate macros test the type of a file, given the value
2332 @var{m} which is the @code{st_mode} field returned by @code{stat} on
2335 @deftypefn Macro int S_ISDIR (mode_t @var{m})
2336 @standards{POSIX, sys/stat.h}
2337 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2338 This macro returns non-zero if the file is a directory.
2341 @deftypefn Macro int S_ISCHR (mode_t @var{m})
2342 @standards{POSIX, sys/stat.h}
2343 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2344 This macro returns non-zero if the file is a character special file (a
2345 device like a terminal).
2348 @deftypefn Macro int S_ISBLK (mode_t @var{m})
2349 @standards{POSIX, sys/stat.h}
2350 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2351 This macro returns non-zero if the file is a block special file (a device
2355 @deftypefn Macro int S_ISREG (mode_t @var{m})
2356 @standards{POSIX, sys/stat.h}
2357 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2358 This macro returns non-zero if the file is a regular file.
2361 @deftypefn Macro int S_ISFIFO (mode_t @var{m})
2362 @standards{POSIX, sys/stat.h}
2363 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2364 This macro returns non-zero if the file is a FIFO special file, or a
2365 pipe. @xref{Pipes and FIFOs}.
2368 @deftypefn Macro int S_ISLNK (mode_t @var{m})
2369 @standards{GNU, sys/stat.h}
2370 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2371 This macro returns non-zero if the file is a symbolic link.
2372 @xref{Symbolic Links}.
2375 @deftypefn Macro int S_ISSOCK (mode_t @var{m})
2376 @standards{GNU, sys/stat.h}
2377 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2378 This macro returns non-zero if the file is a socket. @xref{Sockets}.
2381 An alternate non-POSIX method of testing the file type is supported for
2382 compatibility with BSD. The mode can be bitwise AND-ed with
2383 @code{S_IFMT} to extract the file type code, and compared to the
2384 appropriate constant. For example,
2387 S_ISCHR (@var{mode})
2394 ((@var{mode} & S_IFMT) == S_IFCHR)
2397 @deftypevr Macro int S_IFMT
2398 @standards{BSD, sys/stat.h}
2399 This is a bit mask used to extract the file type code from a mode value.
2402 These are the symbolic names for the different file type codes:
2406 @standards{BSD, sys/stat.h}
2407 This is the file type constant of a directory file.
2410 @standards{BSD, sys/stat.h}
2411 This is the file type constant of a character-oriented device file.
2414 @standards{BSD, sys/stat.h}
2415 This is the file type constant of a block-oriented device file.
2418 @standards{BSD, sys/stat.h}
2419 This is the file type constant of a regular file.
2422 @standards{BSD, sys/stat.h}
2423 This is the file type constant of a symbolic link.
2426 @standards{BSD, sys/stat.h}
2427 This is the file type constant of a socket.
2430 @standards{BSD, sys/stat.h}
2431 This is the file type constant of a FIFO or pipe.
2434 The POSIX.1b standard introduced a few more objects which possibly can
2435 be implemented as objects in the filesystem. These are message queues,
2436 semaphores, and shared memory objects. To allow differentiating these
2437 objects from other files the POSIX standard introduced three new test
2438 macros. But unlike the other macros they do not take the value of the
2439 @code{st_mode} field as the parameter. Instead they expect a pointer to
2440 the whole @code{struct stat} structure.
2442 @deftypefn Macro int S_TYPEISMQ (struct stat *@var{s})
2443 @standards{POSIX, sys/stat.h}
2444 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2445 If the system implements POSIX message queues as distinct objects and the
2446 file is a message queue object, this macro returns a non-zero value.
2447 In all other cases the result is zero.
2450 @deftypefn Macro int S_TYPEISSEM (struct stat *@var{s})
2451 @standards{POSIX, sys/stat.h}
2452 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2453 If the system implements POSIX semaphores as distinct objects and the
2454 file is a semaphore object, this macro returns a non-zero value.
2455 In all other cases the result is zero.
2458 @deftypefn Macro int S_TYPEISSHM (struct stat *@var{s})
2459 @standards{POSIX, sys/stat.h}
2460 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2461 If the system implements POSIX shared memory objects as distinct objects
2462 and the file is a shared memory object, this macro returns a non-zero
2463 value. In all other cases the result is zero.
2467 @subsection File Owner
2469 @cindex owner of a file
2470 @cindex group owner of a file
2472 Every file has an @dfn{owner} which is one of the registered user names
2473 defined on the system. Each file also has a @dfn{group} which is one of
2474 the defined groups. The file owner can often be useful for showing you
2475 who edited the file (especially when you edit with GNU Emacs), but its
2476 main purpose is for access control.
2478 The file owner and group play a role in determining access because the
2479 file has one set of access permission bits for the owner, another set
2480 that applies to users who belong to the file's group, and a third set of
2481 bits that applies to everyone else. @xref{Access Permission}, for the
2482 details of how access is decided based on this data.
2484 When a file is created, its owner is set to the effective user ID of the
2485 process that creates it (@pxref{Process Persona}). The file's group ID
2486 may be set to either the effective group ID of the process, or the group
2487 ID of the directory that contains the file, depending on the system
2488 where the file is stored. When you access a remote file system, it
2489 behaves according to its own rules, not according to the system your
2490 program is running on. Thus, your program must be prepared to encounter
2491 either kind of behavior no matter what kind of system you run it on.
2495 You can change the owner and/or group owner of an existing file using
2496 the @code{chown} function. This is the primitive for the @code{chown}
2497 and @code{chgrp} shell commands.
2500 The prototype for this function is declared in @file{unistd.h}.
2502 @deftypefun int chown (const char *@var{filename}, uid_t @var{owner}, gid_t @var{group})
2503 @standards{POSIX.1, unistd.h}
2504 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2505 The @code{chown} function changes the owner of the file @var{filename} to
2506 @var{owner}, and its group owner to @var{group}.
2508 Changing the owner of the file on certain systems clears the set-user-ID
2509 and set-group-ID permission bits. (This is because those bits may not
2510 be appropriate for the new owner.) Other file permission bits are not
2513 The return value is @code{0} on success and @code{-1} on failure.
2514 In addition to the usual file name errors (@pxref{File Name Errors}),
2515 the following @code{errno} error conditions are defined for this function:
2519 This process lacks permission to make the requested change.
2521 Only privileged users or the file's owner can change the file's group.
2522 On most file systems, only privileged users can change the file owner;
2523 some file systems allow you to change the owner if you are currently the
2524 owner. When you access a remote file system, the behavior you encounter
2525 is determined by the system that actually holds the file, not by the
2526 system your program is running on.
2528 @xref{Options for Files}, for information about the
2529 @code{_POSIX_CHOWN_RESTRICTED} macro.
2532 The file is on a read-only file system.
2536 @deftypefun int fchown (int @var{filedes}, uid_t @var{owner}, gid_t @var{group})
2537 @standards{BSD, unistd.h}
2538 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2539 This is like @code{chown}, except that it changes the owner of the open
2540 file with descriptor @var{filedes}.
2542 The return value from @code{fchown} is @code{0} on success and @code{-1}
2543 on failure. The following @code{errno} error codes are defined for this
2548 The @var{filedes} argument is not a valid file descriptor.
2551 The @var{filedes} argument corresponds to a pipe or socket, not an ordinary
2555 This process lacks permission to make the requested change. For details
2556 see @code{chmod} above.
2559 The file resides on a read-only file system.
2563 @node Permission Bits
2564 @subsection The Mode Bits for Access Permission
2566 The @dfn{file mode}, stored in the @code{st_mode} field of the file
2567 attributes, contains two kinds of information: the file type code, and
2568 the access permission bits. This section discusses only the access
2569 permission bits, which control who can read or write the file.
2570 @xref{Testing File Type}, for information about the file type code.
2572 All of the symbols listed in this section are defined in the header file
2576 @cindex file permission bits
2577 These symbolic constants are defined for the file mode bits that control
2578 access permission for the file:
2583 @standards{POSIX.1, sys/stat.h}
2584 @standardsx{S_IREAD, BSD, sys/stat.h}
2585 Read permission bit for the owner of the file. On many systems this bit
2586 is 0400. @code{S_IREAD} is an obsolete synonym provided for BSD
2591 @standards{POSIX.1, sys/stat.h}
2592 @standardsx{S_IWRITE, BSD, sys/stat.h}
2593 Write permission bit for the owner of the file. Usually 0200.
2594 @w{@code{S_IWRITE}} is an obsolete synonym provided for BSD compatibility.
2598 @standards{POSIX.1, sys/stat.h}
2599 @standardsx{S_IEXEC, BSD, sys/stat.h}
2600 Execute (for ordinary files) or search (for directories) permission bit
2601 for the owner of the file. Usually 0100. @code{S_IEXEC} is an obsolete
2602 synonym provided for BSD compatibility.
2605 @standards{POSIX.1, sys/stat.h}
2606 This is equivalent to @samp{(S_IRUSR | S_IWUSR | S_IXUSR)}.
2609 @standards{POSIX.1, sys/stat.h}
2610 Read permission bit for the group owner of the file. Usually 040.
2613 @standards{POSIX.1, sys/stat.h}
2614 Write permission bit for the group owner of the file. Usually 020.
2617 @standards{POSIX.1, sys/stat.h}
2618 Execute or search permission bit for the group owner of the file.
2622 @standards{POSIX.1, sys/stat.h}
2623 This is equivalent to @samp{(S_IRGRP | S_IWGRP | S_IXGRP)}.
2626 @standards{POSIX.1, sys/stat.h}
2627 Read permission bit for other users. Usually 04.
2630 @standards{POSIX.1, sys/stat.h}
2631 Write permission bit for other users. Usually 02.
2634 @standards{POSIX.1, sys/stat.h}
2635 Execute or search permission bit for other users. Usually 01.
2638 @standards{POSIX.1, sys/stat.h}
2639 This is equivalent to @samp{(S_IROTH | S_IWOTH | S_IXOTH)}.
2642 @standards{POSIX, sys/stat.h}
2643 This is the set-user-ID on execute bit, usually 04000.
2644 @xref{How Change Persona}.
2647 @standards{POSIX, sys/stat.h}
2648 This is the set-group-ID on execute bit, usually 02000.
2649 @xref{How Change Persona}.
2653 @standards{BSD, sys/stat.h}
2654 This is the @dfn{sticky} bit, usually 01000.
2656 For a directory it gives permission to delete a file in that directory
2657 only if you own that file. Ordinarily, a user can either delete all the
2658 files in a directory or cannot delete any of them (based on whether the
2659 user has write permission for the directory). The same restriction
2660 applies---you must have both write permission for the directory and own
2661 the file you want to delete. The one exception is that the owner of the
2662 directory can delete any file in the directory, no matter who owns it
2663 (provided the owner has given himself write permission for the
2664 directory). This is commonly used for the @file{/tmp} directory, where
2665 anyone may create files but not delete files created by other users.
2667 Originally the sticky bit on an executable file modified the swapping
2668 policies of the system. Normally, when a program terminated, its pages
2669 in core were immediately freed and reused. If the sticky bit was set on
2670 the executable file, the system kept the pages in core for a while as if
2671 the program were still running. This was advantageous for a program
2672 likely to be run many times in succession. This usage is obsolete in
2673 modern systems. When a program terminates, its pages always remain in
2674 core as long as there is no shortage of memory in the system. When the
2675 program is next run, its pages will still be in core if no shortage
2676 arose since the last run.
2678 On some modern systems where the sticky bit has no useful meaning for an
2679 executable file, you cannot set the bit at all for a non-directory.
2680 If you try, @code{chmod} fails with @code{EFTYPE};
2681 @pxref{Setting Permissions}.
2683 Some systems (particularly SunOS) have yet another use for the sticky
2684 bit. If the sticky bit is set on a file that is @emph{not} executable,
2685 it means the opposite: never cache the pages of this file at all. The
2686 main use of this is for the files on an NFS server machine which are
2687 used as the swap area of diskless client machines. The idea is that the
2688 pages of the file will be cached in the client's memory, so it is a
2689 waste of the server's memory to cache them a second time. With this
2690 usage the sticky bit also implies that the filesystem may fail to record
2691 the file's modification time onto disk reliably (the idea being that
2692 no-one cares for a swap file).
2694 This bit is only available on BSD systems (and those derived from
2695 them). Therefore one has to use the @code{_GNU_SOURCE} feature select
2696 macro, or not define any feature test macros, to get the definition
2697 (@pxref{Feature Test Macros}).
2700 The actual bit values of the symbols are listed in the table above
2701 so you can decode file mode values when debugging your programs.
2702 These bit values are correct for most systems, but they are not
2705 @strong{Warning:} Writing explicit numbers for file permissions is bad
2706 practice. Not only is it not portable, it also requires everyone who
2707 reads your program to remember what the bits mean. To make your program
2708 clean use the symbolic names.
2710 @node Access Permission
2711 @subsection How Your Access to a File is Decided
2712 @cindex permission to access a file
2713 @cindex access permission for a file
2714 @cindex file access permission
2716 Recall that the operating system normally decides access permission for
2717 a file based on the effective user and group IDs of the process and its
2718 supplementary group IDs, together with the file's owner, group and
2719 permission bits. These concepts are discussed in detail in @ref{Process
2722 If the effective user ID of the process matches the owner user ID of the
2723 file, then permissions for read, write, and execute/search are
2724 controlled by the corresponding ``user'' (or ``owner'') bits. Likewise,
2725 if any of the effective group ID or supplementary group IDs of the
2726 process matches the group owner ID of the file, then permissions are
2727 controlled by the ``group'' bits. Otherwise, permissions are controlled
2728 by the ``other'' bits.
2730 Privileged users, like @samp{root}, can access any file regardless of
2731 its permission bits. As a special case, for a file to be executable
2732 even by a privileged user, at least one of its execute bits must be set.
2734 @node Setting Permissions
2735 @subsection Assigning File Permissions
2737 @cindex file creation mask
2739 The primitive functions for creating files (for example, @code{open} or
2740 @code{mkdir}) take a @var{mode} argument, which specifies the file
2741 permissions to give the newly created file. This mode is modified by
2742 the process's @dfn{file creation mask}, or @dfn{umask}, before it is
2745 The bits that are set in the file creation mask identify permissions
2746 that are always to be disabled for newly created files. For example, if
2747 you set all the ``other'' access bits in the mask, then newly created
2748 files are not accessible at all to processes in the ``other'' category,
2749 even if the @var{mode} argument passed to the create function would
2750 permit such access. In other words, the file creation mask is the
2751 complement of the ordinary access permissions you want to grant.
2753 Programs that create files typically specify a @var{mode} argument that
2754 includes all the permissions that make sense for the particular file.
2755 For an ordinary file, this is typically read and write permission for
2756 all classes of users. These permissions are then restricted as
2757 specified by the individual user's own file creation mask.
2760 To change the permission of an existing file given its name, call
2761 @code{chmod}. This function uses the specified permission bits and
2762 ignores the file creation mask.
2765 In normal use, the file creation mask is initialized by the user's login
2766 shell (using the @code{umask} shell command), and inherited by all
2767 subprocesses. Application programs normally don't need to worry about
2768 the file creation mask. It will automatically do what it is supposed to
2771 When your program needs to create a file and bypass the umask for its
2772 access permissions, the easiest way to do this is to use @code{fchmod}
2773 after opening the file, rather than changing the umask. In fact,
2774 changing the umask is usually done only by shells. They use the
2775 @code{umask} function.
2777 The functions in this section are declared in @file{sys/stat.h}.
2780 @deftypefun mode_t umask (mode_t @var{mask})
2781 @standards{POSIX.1, sys/stat.h}
2782 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2783 The @code{umask} function sets the file creation mask of the current
2784 process to @var{mask}, and returns the previous value of the file
2787 Here is an example showing how to read the mask with @code{umask}
2788 without changing it permanently:
2794 mode_t mask = umask (0);
2801 However, on @gnuhurdsystems{} it is better to use @code{getumask} if
2802 you just want to read the mask value, because it is reentrant.
2805 @deftypefun mode_t getumask (void)
2806 @standards{GNU, sys/stat.h}
2807 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2808 Return the current value of the file creation mask for the current
2809 process. This function is a GNU extension and is only available on
2813 @deftypefun int chmod (const char *@var{filename}, mode_t @var{mode})
2814 @standards{POSIX.1, sys/stat.h}
2815 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2816 The @code{chmod} function sets the access permission bits for the file
2817 named by @var{filename} to @var{mode}.
2819 If @var{filename} is a symbolic link, @code{chmod} changes the
2820 permissions of the file pointed to by the link, not those of the link
2823 This function returns @code{0} if successful and @code{-1} if not. In
2824 addition to the usual file name errors (@pxref{File Name
2825 Errors}), the following @code{errno} error conditions are defined for
2830 The named file doesn't exist.
2833 This process does not have permission to change the access permissions
2834 of this file. Only the file's owner (as judged by the effective user ID
2835 of the process) or a privileged user can change them.
2838 The file resides on a read-only file system.
2841 @var{mode} has the @code{S_ISVTX} bit (the ``sticky bit'') set,
2842 and the named file is not a directory. Some systems do not allow setting the
2843 sticky bit on non-directory files, and some do (and only some of those
2844 assign a useful meaning to the bit for non-directory files).
2846 You only get @code{EFTYPE} on systems where the sticky bit has no useful
2847 meaning for non-directory files, so it is always safe to just clear the
2848 bit in @var{mode} and call @code{chmod} again. @xref{Permission Bits},
2849 for full details on the sticky bit.
2853 @deftypefun int fchmod (int @var{filedes}, mode_t @var{mode})
2854 @standards{BSD, sys/stat.h}
2855 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2856 This is like @code{chmod}, except that it changes the permissions of the
2857 currently open file given by @var{filedes}.
2859 The return value from @code{fchmod} is @code{0} on success and @code{-1}
2860 on failure. The following @code{errno} error codes are defined for this
2865 The @var{filedes} argument is not a valid file descriptor.
2868 The @var{filedes} argument corresponds to a pipe or socket, or something
2869 else that doesn't really have access permissions.
2872 This process does not have permission to change the access permissions
2873 of this file. Only the file's owner (as judged by the effective user ID
2874 of the process) or a privileged user can change them.
2877 The file resides on a read-only file system.
2881 @node Testing File Access
2882 @subsection Testing Permission to Access a File
2883 @cindex testing access permission
2884 @cindex access, testing for
2885 @cindex setuid programs and file access
2887 In some situations it is desirable to allow programs to access files or
2888 devices even if this is not possible with the permissions granted to the
2889 user. One possible solution is to set the setuid-bit of the program
2890 file. If such a program is started the @emph{effective} user ID of the
2891 process is changed to that of the owner of the program file. So to
2892 allow write access to files like @file{/etc/passwd}, which normally can
2893 be written only by the super-user, the modifying program will have to be
2894 owned by @code{root} and the setuid-bit must be set.
2896 But besides the files the program is intended to change the user should
2897 not be allowed to access any file to which s/he would not have access
2898 anyway. The program therefore must explicitly check whether @emph{the
2899 user} would have the necessary access to a file, before it reads or
2902 To do this, use the function @code{access}, which checks for access
2903 permission based on the process's @emph{real} user ID rather than the
2904 effective user ID. (The setuid feature does not alter the real user ID,
2905 so it reflects the user who actually ran the program.)
2907 There is another way you could check this access, which is easy to
2908 describe, but very hard to use. This is to examine the file mode bits
2909 and mimic the system's own access computation. This method is
2910 undesirable because many systems have additional access control
2911 features; your program cannot portably mimic them, and you would not
2912 want to try to keep track of the diverse features that different systems
2913 have. Using @code{access} is simple and automatically does whatever is
2914 appropriate for the system you are using.
2916 @code{access} is @emph{only} appropriate to use in setuid programs.
2917 A non-setuid program will always use the effective ID rather than the
2921 The symbols in this section are declared in @file{unistd.h}.
2923 @deftypefun int access (const char *@var{filename}, int @var{how})
2924 @standards{POSIX.1, unistd.h}
2925 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
2926 The @code{access} function checks to see whether the file named by
2927 @var{filename} can be accessed in the way specified by the @var{how}
2928 argument. The @var{how} argument either can be the bitwise OR of the
2929 flags @code{R_OK}, @code{W_OK}, @code{X_OK}, or the existence test
2932 This function uses the @emph{real} user and group IDs of the calling
2933 process, rather than the @emph{effective} IDs, to check for access
2934 permission. As a result, if you use the function from a @code{setuid}
2935 or @code{setgid} program (@pxref{How Change Persona}), it gives
2936 information relative to the user who actually ran the program.
2938 The return value is @code{0} if the access is permitted, and @code{-1}
2939 otherwise. (In other words, treated as a predicate function,
2940 @code{access} returns true if the requested access is @emph{denied}.)
2942 In addition to the usual file name errors (@pxref{File Name
2943 Errors}), the following @code{errno} error conditions are defined for
2948 The access specified by @var{how} is denied.
2951 The file doesn't exist.
2954 Write permission was requested for a file on a read-only file system.
2958 These macros are defined in the header file @file{unistd.h} for use
2959 as the @var{how} argument to the @code{access} function. The values
2960 are integer constants.
2963 @deftypevr Macro int R_OK
2964 @standards{POSIX.1, unistd.h}
2965 Flag meaning test for read permission.
2968 @deftypevr Macro int W_OK
2969 @standards{POSIX.1, unistd.h}
2970 Flag meaning test for write permission.
2973 @deftypevr Macro int X_OK
2974 @standards{POSIX.1, unistd.h}
2975 Flag meaning test for execute/search permission.
2978 @deftypevr Macro int F_OK
2979 @standards{POSIX.1, unistd.h}
2980 Flag meaning test for existence of the file.
2984 @subsection File Times
2986 @cindex file access time
2987 @cindex file modification time
2988 @cindex file attribute modification time
2989 Each file has three time stamps associated with it: its access time,
2990 its modification time, and its attribute modification time. These
2991 correspond to the @code{st_atime}, @code{st_mtime}, and @code{st_ctime}
2992 members of the @code{stat} structure; see @ref{File Attributes}.
2994 All of these times are represented in calendar time format, as
2995 @code{time_t} objects. This data type is defined in @file{time.h}.
2996 For more information about representation and manipulation of time
2997 values, see @ref{Calendar Time}.
3000 Reading from a file updates its access time attribute, and writing
3001 updates its modification time. When a file is created, all three
3002 time stamps for that file are set to the current time. In addition, the
3003 attribute change time and modification time fields of the directory that
3004 contains the new entry are updated.
3006 Adding a new name for a file with the @code{link} function updates the
3007 attribute change time field of the file being linked, and both the
3008 attribute change time and modification time fields of the directory
3009 containing the new name. These same fields are affected if a file name
3010 is deleted with @code{unlink}, @code{remove} or @code{rmdir}. Renaming
3011 a file with @code{rename} affects only the attribute change time and
3012 modification time fields of the two parent directories involved, and not
3013 the times for the file being renamed.
3015 Changing the attributes of a file (for example, with @code{chmod})
3016 updates its attribute change time field.
3018 You can also change some of the time stamps of a file explicitly using
3019 the @code{utime} function---all except the attribute change time. You
3020 need to include the header file @file{utime.h} to use this facility.
3023 @deftp {Data Type} {struct utimbuf}
3024 @standards{POSIX.1, utime.h}
3025 The @code{utimbuf} structure is used with the @code{utime} function to
3026 specify new access and modification times for a file. It contains the
3031 This is the access time for the file.
3033 @item time_t modtime
3034 This is the modification time for the file.
3038 @deftypefun int utime (const char *@var{filename}, const struct utimbuf *@var{times})
3039 @standards{POSIX.1, utime.h}
3040 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3041 @c In the absence of a utime syscall, it non-atomically converts times
3042 @c to a struct timeval and calls utimes.
3043 This function is used to modify the file times associated with the file
3044 named @var{filename}.
3046 If @var{times} is a null pointer, then the access and modification times
3047 of the file are set to the current time. Otherwise, they are set to the
3048 values from the @code{actime} and @code{modtime} members (respectively)
3049 of the @code{utimbuf} structure pointed to by @var{times}.
3051 The attribute modification time for the file is set to the current time
3052 in either case (since changing the time stamps is itself a modification
3053 of the file attributes).
3055 The @code{utime} function returns @code{0} if successful and @code{-1}
3056 on failure. In addition to the usual file name errors
3057 (@pxref{File Name Errors}), the following @code{errno} error conditions
3058 are defined for this function:
3062 There is a permission problem in the case where a null pointer was
3063 passed as the @var{times} argument. In order to update the time stamp on
3064 the file, you must either be the owner of the file, have write
3065 permission for the file, or be a privileged user.
3068 The file doesn't exist.
3071 If the @var{times} argument is not a null pointer, you must either be
3072 the owner of the file or be a privileged user.
3075 The file lives on a read-only file system.
3079 Each of the three time stamps has a corresponding microsecond part,
3080 which extends its resolution. These fields are called
3081 @code{st_atime_usec}, @code{st_mtime_usec}, and @code{st_ctime_usec};
3082 each has a value between 0 and 999,999, which indicates the time in
3083 microseconds. They correspond to the @code{tv_usec} field of a
3084 @code{timeval} structure; see @ref{Time Types}.
3086 The @code{utimes} function is like @code{utime}, but also lets you specify
3087 the fractional part of the file times. The prototype for this function is
3088 in the header file @file{sys/time.h}.
3091 @deftypefun int utimes (const char *@var{filename}, const struct timeval @var{tvp}@t{[2]})
3092 @standards{BSD, sys/time.h}
3093 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3094 @c In the absence of a utimes syscall, it non-atomically converts tvp
3095 @c to struct timespec array and issues a utimensat syscall, or to
3096 @c struct utimbuf and calls utime.
3097 This function sets the file access and modification times of the file
3098 @var{filename}. The new file access time is specified by
3099 @code{@var{tvp}[0]}, and the new modification time by
3100 @code{@var{tvp}[1]}. Similar to @code{utime}, if @var{tvp} is a null
3101 pointer then the access and modification times of the file are set to
3102 the current time. This function comes from BSD.
3104 The return values and error conditions are the same as for the @code{utime}
3108 @deftypefun int lutimes (const char *@var{filename}, const struct timeval @var{tvp}@t{[2]})
3109 @standards{BSD, sys/time.h}
3110 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3111 @c Since there's no lutimes syscall, it non-atomically converts tvp
3112 @c to struct timespec array and issues a utimensat syscall.
3113 This function is like @code{utimes}, except that it does not follow
3114 symbolic links. If @var{filename} is the name of a symbolic link,
3115 @code{lutimes} sets the file access and modification times of the
3116 symbolic link special file itself (as seen by @code{lstat};
3117 @pxref{Symbolic Links}) while @code{utimes} sets the file access and
3118 modification times of the file the symbolic link refers to. This
3119 function comes from FreeBSD, and is not available on all platforms (if
3120 not available, it will fail with @code{ENOSYS}).
3122 The return values and error conditions are the same as for the @code{utime}
3126 @deftypefun int futimes (int @var{fd}, const struct timeval @var{tvp}@t{[2]})
3127 @standards{BSD, sys/time.h}
3128 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3129 @c Since there's no futimes syscall, it non-atomically converts tvp
3130 @c to struct timespec array and issues a utimensat syscall, falling back
3131 @c to utimes on a /proc/self/fd symlink.
3132 This function is like @code{utimes}, except that it takes an open file
3133 descriptor as an argument instead of a file name. @xref{Low-Level
3134 I/O}. This function comes from FreeBSD, and is not available on all
3135 platforms (if not available, it will fail with @code{ENOSYS}).
3137 Like @code{utimes}, @code{futimes} returns @code{0} on success and @code{-1}
3138 on failure. The following @code{errno} error conditions are defined for
3143 There is a permission problem in the case where a null pointer was
3144 passed as the @var{times} argument. In order to update the time stamp on
3145 the file, you must either be the owner of the file, have write
3146 permission for the file, or be a privileged user.
3149 The @var{filedes} argument is not a valid file descriptor.
3152 If the @var{times} argument is not a null pointer, you must either be
3153 the owner of the file or be a privileged user.
3156 The file lives on a read-only file system.
3161 @subsection File Size
3163 Normally file sizes are maintained automatically. A file begins with a
3164 size of @math{0} and is automatically extended when data is written past
3165 its end. It is also possible to empty a file completely by an
3166 @code{open} or @code{fopen} call.
3168 However, sometimes it is necessary to @emph{reduce} the size of a file.
3169 This can be done with the @code{truncate} and @code{ftruncate} functions.
3170 They were introduced in BSD Unix. @code{ftruncate} was later added to
3173 Some systems allow you to extend a file (creating holes) with these
3174 functions. This is useful when using memory-mapped I/O
3175 (@pxref{Memory-mapped I/O}), where files are not automatically extended.
3176 However, it is not portable but must be implemented if @code{mmap}
3177 allows mapping of files (i.e., @code{_POSIX_MAPPED_FILES} is defined).
3179 Using these functions on anything other than a regular file gives
3180 @emph{undefined} results. On many systems, such a call will appear to
3181 succeed, without actually accomplishing anything.
3183 @deftypefun int truncate (const char *@var{filename}, off_t @var{length})
3184 @standards{X/Open, unistd.h}
3185 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3186 @c In the absence of a truncate syscall, we use open and ftruncate.
3188 The @code{truncate} function changes the size of @var{filename} to
3189 @var{length}. If @var{length} is shorter than the previous length, data
3190 at the end will be lost. The file must be writable by the user to
3191 perform this operation.
3193 If @var{length} is longer, holes will be added to the end. However, some
3194 systems do not support this feature and will leave the file unchanged.
3196 When the source file is compiled with @code{_FILE_OFFSET_BITS == 64} the
3197 @code{truncate} function is in fact @code{truncate64} and the type
3198 @code{off_t} has 64 bits which makes it possible to handle files up to
3199 @twoexp{63} bytes in length.
3201 The return value is @math{0} for success, or @math{-1} for an error. In
3202 addition to the usual file name errors, the following errors may occur:
3207 The file is a directory or not writable.
3210 @var{length} is negative.
3213 The operation would extend the file beyond the limits of the operating system.
3216 A hardware I/O error occurred.
3219 The file is "append-only" or "immutable".
3222 The operation was interrupted by a signal.
3228 @deftypefun int truncate64 (const char *@var{name}, off64_t @var{length})
3229 @standards{Unix98, unistd.h}
3230 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3231 @c In the absence of a syscall, try truncate if length fits.
3232 This function is similar to the @code{truncate} function. The
3233 difference is that the @var{length} argument is 64 bits wide even on 32
3234 bits machines, which allows the handling of files with sizes up to
3237 When the source file is compiled with @code{_FILE_OFFSET_BITS == 64} on a
3238 32 bits machine this function is actually available under the name
3239 @code{truncate} and so transparently replaces the 32 bits interface.
3242 @deftypefun int ftruncate (int @var{fd}, off_t @var{length})
3243 @standards{POSIX, unistd.h}
3244 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3246 This is like @code{truncate}, but it works on a file descriptor @var{fd}
3247 for an opened file instead of a file name to identify the object. The
3248 file must be opened for writing to successfully carry out the operation.
3250 The POSIX standard leaves it implementation defined what happens if the
3251 specified new @var{length} of the file is bigger than the original size.
3252 The @code{ftruncate} function might simply leave the file alone and do
3253 nothing or it can increase the size to the desired size. In this later
3254 case the extended area should be zero-filled. So using @code{ftruncate}
3255 is no reliable way to increase the file size but if it is possible it is
3256 probably the fastest way. The function also operates on POSIX shared
3257 memory segments if these are implemented by the system.
3259 @code{ftruncate} is especially useful in combination with @code{mmap}.
3260 Since the mapped region must have a fixed size one cannot enlarge the
3261 file by writing something beyond the last mapped page. Instead one has
3262 to enlarge the file itself and then remap the file with the new size.
3263 The example below shows how this works.
3265 When the source file is compiled with @code{_FILE_OFFSET_BITS == 64} the
3266 @code{ftruncate} function is in fact @code{ftruncate64} and the type
3267 @code{off_t} has 64 bits which makes it possible to handle files up to
3268 @twoexp{63} bytes in length.
3270 The return value is @math{0} for success, or @math{-1} for an error. The
3271 following errors may occur:
3276 @var{fd} does not correspond to an open file.
3279 @var{fd} is a directory or not open for writing.
3282 @var{length} is negative.
3285 The operation would extend the file beyond the limits of the operating system.
3286 @c or the open() call -- with the not-yet-discussed feature of opening
3287 @c files with extra-large offsets.
3290 A hardware I/O error occurred.
3293 The file is "append-only" or "immutable".
3296 The operation was interrupted by a signal.
3298 @c ENOENT is also possible on Linux --- however it only occurs if the file
3299 @c descriptor has a `file' structure but no `inode' structure. I'm not
3300 @c sure how such an fd could be created. Perhaps it's a bug.
3306 @deftypefun int ftruncate64 (int @var{id}, off64_t @var{length})
3307 @standards{Unix98, unistd.h}
3308 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3309 @c In the absence of a syscall, try ftruncate if length fits.
3310 This function is similar to the @code{ftruncate} function. The
3311 difference is that the @var{length} argument is 64 bits wide even on 32
3312 bits machines which allows the handling of files with sizes up to
3315 When the source file is compiled with @code{_FILE_OFFSET_BITS == 64} on a
3316 32 bits machine this function is actually available under the name
3317 @code{ftruncate} and so transparently replaces the 32 bits interface.
3320 As announced here is a little example of how to use @code{ftruncate} in
3321 combination with @code{mmap}:
3329 add (off_t at, void *block, size_t size)
3331 if (at + size > len)
3333 /* Resize the file and remap. */
3334 size_t ps = sysconf (_SC_PAGESIZE);
3335 size_t ns = (at + size + ps - 1) & ~(ps - 1);
3337 if (ftruncate (fd, ns) < 0)
3339 np = mmap (NULL, ns, PROT_READ|PROT_WRITE, MAP_SHARED, fd, 0);
3340 if (np == MAP_FAILED)
3345 memcpy ((char *) start + at, block, size);
3350 The function @code{add} writes a block of memory at an arbitrary
3351 position in the file. If the current size of the file is too small it
3352 is extended. Note that it is extended by a whole number of pages. This
3353 is a requirement of @code{mmap}. The program has to keep track of the
3354 real size, and when it has finished a final @code{ftruncate} call should
3355 set the real size of the file.
3357 @node Storage Allocation
3358 @subsection Storage Allocation
3359 @cindex allocating file storage
3360 @cindex file allocation
3361 @cindex storage allocating
3363 @cindex file fragmentation
3364 @cindex fragmentation of files
3365 @cindex sparse files
3366 @cindex files, sparse
3367 Most file systems support allocating large files in a non-contiguous
3368 fashion: the file is split into @emph{fragments} which are allocated
3369 sequentially, but the fragments themselves can be scattered across the
3370 disk. File systems generally try to avoid such fragmentation because it
3371 decreases performance, but if a file gradually increases in size, there
3372 might be no other option than to fragment it. In addition, many file
3373 systems support @emph{sparse files} with @emph{holes}: regions of null
3374 bytes for which no backing storage has been allocated by the file
3375 system. When the holes are finally overwritten with data, fragmentation
3378 Explicit allocation of storage for yet-unwritten parts of the file can
3379 help the system to avoid fragmentation. Additionally, if storage
3380 pre-allocation fails, it is possible to report the out-of-disk error
3381 early, often without filling up the entire disk. However, due to
3382 deduplication, copy-on-write semantics, and file compression, such
3383 pre-allocation may not reliably prevent the out-of-disk-space error from
3384 occurring later. Checking for write errors is still required, and
3385 writes to memory-mapped regions created with @code{mmap} can still
3386 result in @code{SIGBUS}.
3388 @deftypefun int posix_fallocate (int @var{fd}, off_t @var{offset}, off_t @var{length})
3389 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3390 @c If the file system does not support allocation,
3391 @c @code{posix_fallocate} has a race with file extension (if
3392 @c @var{length} is zero) or with concurrent writes of non-NUL bytes (if
3393 @c @var{length} is positive).
3395 Allocate backing store for the region of @var{length} bytes starting at
3396 byte @var{offset} in the file for the descriptor @var{fd}. The file
3397 length is increased to @samp{@var{length} + @var{offset}} if necessary.
3399 @var{fd} must be a regular file opened for writing, or @code{EBADF} is
3400 returned. If there is insufficient disk space to fulfill the allocation
3401 request, @code{ENOSPC} is returned.
3403 @strong{Note:} If @code{fallocate} is not available (because the file
3404 system does not support it), @code{posix_fallocate} is emulated, which
3405 has the following drawbacks:
3409 It is very inefficient because all file system blocks in the requested
3410 range need to be examined (even if they have been allocated before) and
3411 potentially rewritten. In contrast, with proper @code{fallocate}
3412 support (see below), the file system can examine the internal file
3413 allocation data structures and eliminate holes directly, maybe even
3414 using unwritten extents (which are pre-allocated but uninitialized on
3418 There is a race condition if another thread or process modifies the
3419 underlying file in the to-be-allocated area. Non-null bytes could be
3420 overwritten with null bytes.
3423 If @var{fd} has been opened with the @code{O_WRONLY} flag, the function
3424 will fail with an @code{errno} value of @code{EBADF}.
3427 If @var{fd} has been opened with the @code{O_APPEND} flag, the function
3428 will fail with an @code{errno} value of @code{EBADF}.
3431 If @var{length} is zero, @code{ftruncate} is used to increase the file
3432 size as requested, without allocating file system blocks. There is a
3433 race condition which means that @code{ftruncate} can accidentally
3434 truncate the file if it has been extended concurrently.
3437 On Linux, if an application does not benefit from emulation or if the
3438 emulation is harmful due to its inherent race conditions, the
3439 application can use the Linux-specific @code{fallocate} function, with a
3440 zero flag argument. For the @code{fallocate} function, @theglibc{} does
3441 not perform allocation emulation if the file system does not support
3442 allocation. Instead, an @code{EOPNOTSUPP} is returned to the caller.
3446 @deftypefun int posix_fallocate64 (int @var{fd}, off64_t @var{offset}, off64_t @var{length})
3447 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3449 This function is a variant of @code{posix_fallocate64} which accepts
3450 64-bit file offsets on all platforms.
3454 @node Making Special Files
3455 @section Making Special Files
3456 @cindex creating special files
3457 @cindex special files
3459 The @code{mknod} function is the primitive for making special files,
3460 such as files that correspond to devices. @Theglibc{} includes
3461 this function for compatibility with BSD.
3463 The prototype for @code{mknod} is declared in @file{sys/stat.h}.
3466 @deftypefun int mknod (const char *@var{filename}, mode_t @var{mode}, dev_t @var{dev})
3467 @standards{BSD, sys/stat.h}
3468 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3469 @c Instead of issuing the syscall directly, we go through xmknod.
3470 @c Although the internal xmknod takes a dev_t*, that could lead to
3471 @c @mtsrace races, it's passed a pointer to mknod's dev.
3472 The @code{mknod} function makes a special file with name @var{filename}.
3473 The @var{mode} specifies the mode of the file, and may include the various
3474 special file bits, such as @code{S_IFCHR} (for a character special file)
3475 or @code{S_IFBLK} (for a block special file). @xref{Testing File Type}.
3477 The @var{dev} argument specifies which device the special file refers to.
3478 Its exact interpretation depends on the kind of special file being created.
3480 The return value is @code{0} on success and @code{-1} on error. In addition
3481 to the usual file name errors (@pxref{File Name Errors}), the
3482 following @code{errno} error conditions are defined for this function:
3486 The calling process is not privileged. Only the superuser can create
3490 The directory or file system that would contain the new file is full
3491 and cannot be extended.
3494 The directory containing the new file can't be modified because it's on
3495 a read-only file system.
3498 There is already a file named @var{filename}. If you want to replace
3499 this file, you must remove the old file explicitly first.
3503 @node Temporary Files
3504 @section Temporary Files
3506 If you need to use a temporary file in your program, you can use the
3507 @code{tmpfile} function to open it. Or you can use the @code{tmpnam}
3508 (better: @code{tmpnam_r}) function to provide a name for a temporary
3509 file and then you can open it in the usual way with @code{fopen}.
3511 The @code{tempnam} function is like @code{tmpnam} but lets you choose
3512 what directory temporary files will go in, and something about what
3513 their file names will look like. Important for multi-threaded programs
3514 is that @code{tempnam} is reentrant, while @code{tmpnam} is not since it
3515 returns a pointer to a static buffer.
3517 These facilities are declared in the header file @file{stdio.h}.
3520 @deftypefun {FILE *} tmpfile (void)
3521 @standards{ISO, stdio.h}
3522 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@acsmem{} @acsfd{} @aculock{}}}
3523 @c The unsafety issues are those of fdopen, plus @acsfd because of the
3525 @c __path_search (internal buf, !dir, const pfx, !try_tmpdir) ok
3526 @c libc_secure_genenv only if try_tmpdir
3527 @c xstat64, strlen, strcmp, sprintf
3528 @c __gen_tempname (internal tmpl, __GT_FILE) ok
3529 @c strlen, memcmp, getpid, open/mkdir/lxstat64 ok
3530 @c HP_TIMING_NOW if available ok
3531 @c gettimeofday (!tz) first time, or every time if no HP_TIMING_NOW ok
3532 @c static value is used and modified without synchronization ok
3533 @c but the use is as a source of non-cryptographic randomness
3534 @c with retries in case of collision, so it should be safe
3536 This function creates a temporary binary file for update mode, as if by
3537 calling @code{fopen} with mode @code{"wb+"}. The file is deleted
3538 automatically when it is closed or when the program terminates. (On
3539 some other @w{ISO C} systems the file may fail to be deleted if the program
3540 terminates abnormally).
3542 This function is reentrant.
3544 When the sources are compiled with @code{_FILE_OFFSET_BITS == 64} on a
3545 32-bit system this function is in fact @code{tmpfile64}, i.e., the LFS
3546 interface transparently replaces the old interface.
3549 @deftypefun {FILE *} tmpfile64 (void)
3550 @standards{Unix98, stdio.h}
3551 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@acsmem{} @acsfd{} @aculock{}}}
3552 This function is similar to @code{tmpfile}, but the stream it returns a
3553 pointer to was opened using @code{tmpfile64}. Therefore this stream can
3554 be used for files larger than @twoexp{31} bytes on 32-bit machines.
3556 Please note that the return type is still @code{FILE *}. There is no
3557 special @code{FILE} type for the LFS interface.
3559 If the sources are compiled with @code{_FILE_OFFSET_BITS == 64} on a 32
3560 bits machine this function is available under the name @code{tmpfile}
3561 and so transparently replaces the old interface.
3564 @deftypefun {char *} tmpnam (char *@var{result})
3565 @standards{ISO, stdio.h}
3566 @safety{@prelim{}@mtunsafe{@mtasurace{:tmpnam/!result}}@asunsafe{}@acsafe{}}
3567 @c The passed-in buffer should not be modified concurrently with the
3569 @c __path_search (static or passed-in buf, !dir, !pfx, !try_tmpdir) ok
3570 @c __gen_tempname (internal tmpl, __GT_NOCREATE) ok
3571 This function constructs and returns a valid file name that does not
3572 refer to any existing file. If the @var{result} argument is a null
3573 pointer, the return value is a pointer to an internal static string,
3574 which might be modified by subsequent calls and therefore makes this
3575 function non-reentrant. Otherwise, the @var{result} argument should be
3576 a pointer to an array of at least @code{L_tmpnam} characters, and the
3577 result is written into that array.
3579 It is possible for @code{tmpnam} to fail if you call it too many times
3580 without removing previously-created files. This is because the limited
3581 length of the temporary file names gives room for only a finite number
3582 of different names. If @code{tmpnam} fails it returns a null pointer.
3584 @strong{Warning:} Between the time the pathname is constructed and the
3585 file is created another process might have created a file with the same
3586 name using @code{tmpnam}, leading to a possible security hole. The
3587 implementation generates names which can hardly be predicted, but when
3588 opening the file you should use the @code{O_EXCL} flag. Using
3589 @code{tmpfile} or @code{mkstemp} is a safe way to avoid this problem.
3592 @deftypefun {char *} tmpnam_r (char *@var{result})
3593 @standards{GNU, stdio.h}
3594 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3595 This function is nearly identical to the @code{tmpnam} function, except
3596 that if @var{result} is a null pointer it returns a null pointer.
3598 This guarantees reentrancy because the non-reentrant situation of
3599 @code{tmpnam} cannot happen here.
3601 @strong{Warning}: This function has the same security problems as
3605 @deftypevr Macro int L_tmpnam
3606 @standards{ISO, stdio.h}
3607 The value of this macro is an integer constant expression that
3608 represents the minimum size of a string large enough to hold a file name
3609 generated by the @code{tmpnam} function.
3612 @deftypevr Macro int TMP_MAX
3613 @standards{ISO, stdio.h}
3614 The macro @code{TMP_MAX} is a lower bound for how many temporary names
3615 you can create with @code{tmpnam}. You can rely on being able to call
3616 @code{tmpnam} at least this many times before it might fail saying you
3617 have made too many temporary file names.
3619 With @theglibc{}, you can create a very large number of temporary
3620 file names. If you actually created the files, you would probably run
3621 out of disk space before you ran out of names. Some other systems have
3622 a fixed, small limit on the number of temporary files. The limit is
3623 never less than @code{25}.
3626 @deftypefun {char *} tempnam (const char *@var{dir}, const char *@var{prefix})
3627 @standards{SVID, stdio.h}
3628 @safety{@prelim{}@mtsafe{@mtsenv{}}@asunsafe{@ascuheap{}}@acunsafe{@acsmem{}}}
3629 @c There's no way (short of being setuid) to avoid getenv("TMPDIR"),
3630 @c even with a non-NULL dir.
3632 @c __path_search (internal buf, dir, pfx, try_tmpdir) unsafe getenv
3633 @c __gen_tempname (internal tmpl, __GT_NOCREATE) ok
3635 This function generates a unique temporary file name. If @var{prefix}
3636 is not a null pointer, up to five characters of this string are used as
3637 a prefix for the file name. The return value is a string newly
3638 allocated with @code{malloc}, so you should release its storage with
3639 @code{free} when it is no longer needed.
3641 Because the string is dynamically allocated this function is reentrant.
3643 The directory prefix for the temporary file name is determined by
3644 testing each of the following in sequence. The directory must exist and
3649 The environment variable @code{TMPDIR}, if it is defined. For security
3650 reasons this only happens if the program is not SUID or SGID enabled.
3653 The @var{dir} argument, if it is not a null pointer.
3656 The value of the @code{P_tmpdir} macro.
3659 The directory @file{/tmp}.
3662 This function is defined for SVID compatibility.
3664 @strong{Warning:} Between the time the pathname is constructed and the
3665 file is created another process might have created a file with the same
3666 name using @code{tempnam}, leading to a possible security hole. The
3667 implementation generates names which can hardly be predicted, but when
3668 opening the file you should use the @code{O_EXCL} flag. Using
3669 @code{tmpfile} or @code{mkstemp} is a safe way to avoid this problem.
3671 @cindex TMPDIR environment variable
3673 @c !!! are we putting SVID/GNU/POSIX.1/BSD in here or not??
3674 @deftypevr {SVID Macro} {char *} P_tmpdir
3675 @standards{SVID, stdio.h}
3676 This macro is the name of the default directory for temporary files.
3679 Older Unix systems did not have the functions just described. Instead
3680 they used @code{mktemp} and @code{mkstemp}. Both of these functions
3681 work by modifying a file name template string you pass. The last six
3682 characters of this string must be @samp{XXXXXX}. These six @samp{X}s
3683 are replaced with six characters which make the whole string a unique
3684 file name. Usually the template string is something like
3685 @samp{/tmp/@var{prefix}XXXXXX}, and each program uses a unique @var{prefix}.
3687 @strong{NB:} Because @code{mktemp} and @code{mkstemp} modify the
3688 template string, you @emph{must not} pass string constants to them.
3689 String constants are normally in read-only storage, so your program
3690 would crash when @code{mktemp} or @code{mkstemp} tried to modify the
3691 string. These functions are declared in the header file @file{stdlib.h}.
3694 @deftypefun {char *} mktemp (char *@var{template})
3695 @standards{Unix, stdlib.h}
3696 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3697 @c __gen_tempname (caller tmpl, __GT_NOCREATE) ok
3698 The @code{mktemp} function generates a unique file name by modifying
3699 @var{template} as described above. If successful, it returns
3700 @var{template} as modified. If @code{mktemp} cannot find a unique file
3701 name, it makes @var{template} an empty string and returns that. If
3702 @var{template} does not end with @samp{XXXXXX}, @code{mktemp} returns a
3705 @strong{Warning:} Between the time the pathname is constructed and the
3706 file is created another process might have created a file with the same
3707 name using @code{mktemp}, leading to a possible security hole. The
3708 implementation generates names which can hardly be predicted, but when
3709 opening the file you should use the @code{O_EXCL} flag. Using
3710 @code{mkstemp} is a safe way to avoid this problem.
3713 @deftypefun int mkstemp (char *@var{template})
3714 @standards{BSD, stdlib.h}
3715 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{@acsfd{}}}
3716 @c __gen_tempname (caller tmpl, __GT_FILE) ok
3717 The @code{mkstemp} function generates a unique file name just as
3718 @code{mktemp} does, but it also opens the file for you with @code{open}
3719 (@pxref{Opening and Closing Files}). If successful, it modifies
3720 @var{template} in place and returns a file descriptor for that file open
3721 for reading and writing. If @code{mkstemp} cannot create a
3722 uniquely-named file, it returns @code{-1}. If @var{template} does not
3723 end with @samp{XXXXXX}, @code{mkstemp} returns @code{-1} and does not
3724 modify @var{template}.
3726 The file is opened using mode @code{0600}. If the file is meant to be
3727 used by other users this mode must be changed explicitly.
3730 Unlike @code{mktemp}, @code{mkstemp} is actually guaranteed to create a
3731 unique file that cannot possibly clash with any other program trying to
3732 create a temporary file. This is because it works by calling
3733 @code{open} with the @code{O_EXCL} flag, which says you want to create a
3734 new file and get an error if the file already exists.
3736 @deftypefun {char *} mkdtemp (char *@var{template})
3737 @standards{BSD, stdlib.h}
3738 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
3739 @c __gen_tempname (caller tmpl, __GT_DIR) ok
3740 The @code{mkdtemp} function creates a directory with a unique name. If
3741 it succeeds, it overwrites @var{template} with the name of the
3742 directory, and returns @var{template}. As with @code{mktemp} and
3743 @code{mkstemp}, @var{template} should be a string ending with
3746 If @code{mkdtemp} cannot create an uniquely named directory, it returns
3747 @code{NULL} and sets @code{errno} appropriately. If @var{template} does
3748 not end with @samp{XXXXXX}, @code{mkdtemp} returns @code{NULL} and does
3749 not modify @var{template}. @code{errno} will be set to @code{EINVAL} in
3752 The directory is created using mode @code{0700}.
3755 The directory created by @code{mkdtemp} cannot clash with temporary
3756 files or directories created by other users. This is because directory
3757 creation always works like @code{open} with @code{O_EXCL}.
3758 @xref{Creating Directories}.
3760 The @code{mkdtemp} function comes from OpenBSD.
3762 @c FIXME these are undocumented:
3767 @c fstatat (there's a commented-out safety assessment for this one)
3771 @c name_to_handle_at
3773 @c open_by_handle_at