This is the mail archive of the
mailing list for the glibc project.
Re: [PATCH 00/10] Optimized math routines
On 09/07/18 14:09, Adhemerval Zanella wrote:
On 09/07/2018 09:15, Szabolcs Nagy wrote:
built and tested on a power8 machine now, glibc math
tests pass (except for an unrelated fmal failure),
benchmark improvements are consistent with aarch64/x86_64,
but it was a shared access machine so i won't post exact
numbers, sincosf improved a bit too, sinf/cosf didn't
(apparently powerpc has its own implementation).
PowerPC sinf/cosf uses the same algorithm used on x86, I presume
it would be a gain to generic implementation as well.
you mean the new implementation would be better or the
target specific one?
new implementation has better latency on this particular
powerpc machine than the target specific code, but
throughput is worse sometimes (using the default 0
setting for PREFER_FLOAT_COMPARISON).