This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH v3 04/18] Add string vectorized find and detection functions

From: Joseph Myers <joseph at codesourcery dot com>
To: Paul Eggert <eggert at cs dot ucla dot edu>
Cc: Adhemerval Zanella <adhemerval dot zanella at linaro dot org>, <libc-alpha at sourceware dot org>
Date: Fri, 12 Jan 2018 17:08:09 +0000
Subject: Re: [PATCH v3 04/18] Add string vectorized find and detection functions
Authentication-results: sourceware.org; auth=none
References: <1515588482-15744-1-git-send-email-adhemerval.zanella@linaro.org> <1515588482-15744-5-git-send-email-adhemerval.zanella@linaro.org> <b1c55ccd-24d4-1f68-61c8-87f486389b6f@cs.ucla.edu> <69b77164-ba4f-6847-ec93-164fe3f22662@linaro.org> <9fd53518-64b8-6e32-f92f-27b66151e3b1@cs.ucla.edu>

On Thu, 11 Jan 2018, Paul Eggert wrote:

> On 01/11/2018 10:54 AM, Adhemerval Zanella wrote:
> > > The Gnulib integer_length module has a faster implementation, at least for
> > > 32-bit platforms. Do we still care about 32-bit platforms? If so, you
> > > might want to take a look at it.
> > Do you mean the version which uses double to integer, the one with 6
> > comparisons
> > or the naive one?
> 
> I meant the one that converts int to double. It can be branchless since we
> assume the int is nonzero.

Looking at glibc architectures (and architectures with recently proposed 
ports):

* The following have clz patterns in GCC, unconditionally, meaning this 
glibc patch will always use __builtin_clz functions and any fallback code 
is irrelevant: aarch64 i386 ia64 powerpc tilegx x86_64.  (On ia64 the 
pattern uses conversion to floating point.)

* The following have clz patterns in GCC, conditionally: alpha arm m68k 
microblaze mips s390 sparc (and arc).  I have not checked whether in some 
of those cases the conditions might in fact be true for every 
configuration for which glibc can be built.

* The following lack clz patterns in GCC: hppa nios2 sh (and riscv).

If the configuration lacking clz is also soft-float, converting int to 
double is an extremely inefficient way ending up calling the libgcc clz 
implementation (both soft-fp and fp-bit use __builtin_clz).  I think 
that's sufficient reason to avoid an approach involving conversion to 
double unless an architecture has opted in to using it as an efficient 
approach on that architecture.

(For arm, for example, clz is supported if "TARGET_32BIT && arm_arch5", so 
the only configurations without __builtin_clz expanded inline by the 
compiler are armv4t ones - which are also all soft-float, so the expansion 
using double can never make sense for arm.)

-- 
Joseph S. Myers
joseph@codesourcery.com

Follow-Ups:
- Re: [PATCH v3 04/18] Add string vectorized find and detection functions
  - From: Adhemerval Zanella

References:
- [PATCH v3 00/18] Improve generic string routines
  - From: Adhemerval Zanella
- [PATCH v3 04/18] Add string vectorized find and detection functions
  - From: Adhemerval Zanella
- Re: [PATCH v3 04/18] Add string vectorized find and detection functions
  - From: Paul Eggert
- Re: [PATCH v3 04/18] Add string vectorized find and detection functions
  - From: Adhemerval Zanella
- Re: [PATCH v3 04/18] Add string vectorized find and detection functions
  - From: Paul Eggert

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]