This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH 00/10] Optimized math routines

From: Adhemerval Zanella <adhemerval dot zanella at linaro dot org>
To: Szabolcs Nagy <szabolcs dot nagy at arm dot com>, libc-alpha at sourceware dot org
Cc: nd at arm dot com
Date: Mon, 9 Jul 2018 11:26:56 -0300
Subject: Re: [PATCH 00/10] Optimized math routines
References: <d4963b4e-0013-adaf-df2f-698cf421a487@arm.com> <50b1b8b6-d03c-224f-31ff-8cf9d8268b6d@redhat.com> <b1105e5c-63e8-4edc-82f4-a986d1450961@arm.com> <bea47786-88ad-44dd-7bff-149bbae9145f@redhat.com> <dc44c9c9-f4da-e160-36d7-b644ca64f0b1@arm.com> <25ede414-21e9-bc19-c624-197c3435a346@arm.com> <ea5286c2-5863-44be-716a-f0d2b4b93edf@linaro.org> <414348c2-6538-c9c3-9c4d-2cd922f50cdc@arm.com>

On 09/07/2018 10:34, Szabolcs Nagy wrote:
> On 09/07/18 14:09, Adhemerval Zanella wrote:
>> On 09/07/2018 09:15, Szabolcs Nagy wrote:
>>> built and tested on a power8 machine now, glibc math
>>> tests pass (except for an unrelated fmal failure),
>>> benchmark improvements are consistent with aarch64/x86_64,
>>> but it was a shared access machine so i won't post exact
>>> numbers, sincosf improved a bit too, sinf/cosf didn't
>>> (apparently powerpc has its own implementation).
>>
>> PowerPC sinf/cosf uses the same algorithm used on x86, I presume
>> it would be a gain to generic implementation as well.
>>
> 
> you mean the new implementation would be better or the
> target specific one?
> 
> new implementation has better latency on this particular
> powerpc machine than the target specific code, but
> throughput is worse sometimes (using the default 0
> setting for PREFER_FLOAT_COMPARISON).

I did not measure, but I would expect.  PowerPC uses an different
implementation for generic code (s_sinf-ppc64.c) so comparing against
it maybe misleading (since it use the old implementation still).

I am not sure which compiler you used for evaluation, but at least
Ubuntu 16.04 one (gcc 5.4) does not use POWER8 ISA as default and
even with -mcpu=power8 it generates subpar code.  I will try to
check with a GCC 7.1 (but as for your environment, I am using
a shared machine, although it I think I might get slight better
results because it uses a micro-partition).

For PREFER_FLOAT_COMPARISON, do we use this on glibc? I think
it is only enabled on optimized-routines, isn't it?

Follow-Ups:
- Re: [PATCH 00/10] Optimized math routines
  - From: Szabolcs Nagy

References:
- [PATCH 00/10] Optimized math routines
  - From: Szabolcs Nagy
- Re: [PATCH 00/10] Optimized math routines
  - From: Carlos O'Donell
- Re: [PATCH 00/10] Optimized math routines
  - From: Szabolcs Nagy
- Re: [PATCH 00/10] Optimized math routines
  - From: Carlos O'Donell
- Re: [PATCH 00/10] Optimized math routines
  - From: Szabolcs Nagy
- Re: [PATCH 00/10] Optimized math routines
  - From: Szabolcs Nagy
- Re: [PATCH 00/10] Optimized math routines
  - From: Adhemerval Zanella
- Re: [PATCH 00/10] Optimized math routines
  - From: Szabolcs Nagy

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]