This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH v3 04/18] Add string vectorized find and detection functions

From: Adhemerval Zanella <adhemerval dot zanella at linaro dot org>
To: Joseph Myers <joseph at codesourcery dot com>, Paul Eggert <eggert at cs dot ucla dot edu>
Cc: libc-alpha at sourceware dot org
Date: Fri, 12 Jan 2018 15:58:52 -0200
Subject: Re: [PATCH v3 04/18] Add string vectorized find and detection functions
Authentication-results: sourceware.org; auth=none
References: <1515588482-15744-1-git-send-email-adhemerval.zanella@linaro.org> <1515588482-15744-5-git-send-email-adhemerval.zanella@linaro.org> <b1c55ccd-24d4-1f68-61c8-87f486389b6f@cs.ucla.edu> <69b77164-ba4f-6847-ec93-164fe3f22662@linaro.org> <9fd53518-64b8-6e32-f92f-27b66151e3b1@cs.ucla.edu> <alpine.DEB.2.20.1801121643200.2172@digraph.polyomino.org.uk>


On 12/01/2018 15:08, Joseph Myers wrote:
> On Thu, 11 Jan 2018, Paul Eggert wrote:
> 
>> On 01/11/2018 10:54 AM, Adhemerval Zanella wrote:
>>>> The Gnulib integer_length module has a faster implementation, at least for
>>>> 32-bit platforms. Do we still care about 32-bit platforms? If so, you
>>>> might want to take a look at it.
>>> Do you mean the version which uses double to integer, the one with 6
>>> comparisons
>>> or the naive one?
>>
>> I meant the one that converts int to double. It can be branchless since we
>> assume the int is nonzero.
> 
> Looking at glibc architectures (and architectures with recently proposed 
> ports):
> 
> * The following have clz patterns in GCC, unconditionally, meaning this 
> glibc patch will always use __builtin_clz functions and any fallback code 
> is irrelevant: aarch64 i386 ia64 powerpc tilegx x86_64.  (On ia64 the 
> pattern uses conversion to floating point.)
> 
> * The following have clz patterns in GCC, conditionally: alpha arm m68k 
> microblaze mips s390 sparc (and arc).  I have not checked whether in some 
> of those cases the conditions might in fact be true for every 
> configuration for which glibc can be built.
> 
> * The following lack clz patterns in GCC: hppa nios2 sh (and riscv).
> 
> If the configuration lacking clz is also soft-float, converting int to 
> double is an extremely inefficient way ending up calling the libgcc clz 
> implementation (both soft-fp and fp-bit use __builtin_clz).  I think 
> that's sufficient reason to avoid an approach involving conversion to 
> double unless an architecture has opted in to using it as an efficient 
> approach on that architecture.

Thanks for remind about soft-float, also for some architectures that does
have hardware floating pointer units the int to/from float is also a costly
operation.

Regarding index_{first,last}_ fallback implementation, maybe simpler 
implementation which just check the mask bits instead of fallback ones for
leading/trailing zero bit should better, I am open to suggestions here.

> 
> (For arm, for example, clz is supported if "TARGET_32BIT && arm_arch5", so 
> the only configurations without __builtin_clz expanded inline by the 
> compiler are armv4t ones - which are also all soft-float, so the expansion 
> using double can never make sense for arm.)
>

References:
- [PATCH v3 00/18] Improve generic string routines
  - From: Adhemerval Zanella
- [PATCH v3 04/18] Add string vectorized find and detection functions
  - From: Adhemerval Zanella
- Re: [PATCH v3 04/18] Add string vectorized find and detection functions
  - From: Paul Eggert
- Re: [PATCH v3 04/18] Add string vectorized find and detection functions
  - From: Adhemerval Zanella
- Re: [PATCH v3 04/18] Add string vectorized find and detection functions
  - From: Paul Eggert
- Re: [PATCH v3 04/18] Add string vectorized find and detection functions
  - From: Joseph Myers

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]