]> sourceware.org Git - glibc.git/commit
x86-64: Add sincosf with vector FMA
authorH.J. Lu <hjl.tools@gmail.com>
Mon, 8 Jan 2018 16:04:26 +0000 (08:04 -0800)
committerH.J. Lu <hjl.tools@gmail.com>
Mon, 8 Jan 2018 16:04:40 +0000 (08:04 -0800)
commitc70e4e9c9efff9df4c847dd7cfd81bae674219ab
tree46cbbfb74a8c03e933fc4245c66559def374b1a8
parent579396ee082565ab5f42ff166a264891223b7b82
x86-64: Add sincosf with vector FMA

Since the x86-64 assembly version of sincosf is higly optimized with
vector instructions, there isn't much room for improvement.  However
s_sincosf.c written in C with vector math and intrinsics can be
optimized by GCC with FMA.

On Skylake, bench-sincosf reports performance improvement:

           Assembly       FMA         improvement
max        104.042       101.008         3%
min        9.426         8.586           10%
mean       20.6209       18.2238         13%

* sysdeps/x86_64/fpu/multiarch/Makefile (libm-sysdep_routines):
Add s_sincosf-sse2 and s_sincosf-fma.
(CFLAGS-s_sincosf-fma.c): New.
* sysdeps/x86_64/fpu/multiarch/s_sincosf-fma.c: New file.
* sysdeps/x86_64/fpu/multiarch/s_sincosf-sse2.S: Likewise.
* sysdeps/x86_64/fpu/multiarch/s_sincosf.c: Likewise.
* sysdeps/x86_64/fpu/s_sincosf.S: Don't add alias if
__sincosf is defined.
ChangeLog
NEWS
sysdeps/x86_64/fpu/multiarch/Makefile
sysdeps/x86_64/fpu/multiarch/s_sincosf-fma.c [new file with mode: 0644]
sysdeps/x86_64/fpu/multiarch/s_sincosf-sse2.S [new file with mode: 0644]
sysdeps/x86_64/fpu/multiarch/s_sincosf.c [new file with mode: 0644]
sysdeps/x86_64/fpu/s_sincosf.S
This page took 0.04299 seconds and 5 git commands to generate.