This is the mail archive of the
libc-alpha@sourceware.org
mailing list for the glibc project.
Re: [PATCH][PPC] Remove unnecessary local variable mptwo
On Thu, Jan 10, 2013 at 10:50:27AM -0600, Ryan S. Arnold wrote:
> Preventing regressions in performance of mpa is certainly something we
> (IBM) are interested in. In-fact we'd like to see improvements in
> performance of this code if possible.
This is actually a bit tricky on powerpc. My approach so far had been
to try as much as possible to avoid floating point operations (with my
x86 hat on), but a complete fixed-point based function does not do as
well as a complete fp function on POWER regardless of how the code is
partitioned - I'm guessing this has to do with the fact that there are
4 fp units to play with but just 2 fixed point units. Steven pointed
this out to me last week but I had to see it first hand to really
understand it (and the fact that I had my x86 hat still firmly on).
There is scope to improve the multiplication algorithm, but my current
opinion (might change as I learn more about ppc) is that it might be
best to have the ppc code continue storing the mantissa as double,
while x86 is best served with an integer mantissa and all-int
operations. I don't know about other architectures, but AFAICT, most
of them have assembler overrides for most transcendentals.
> I think in order to have a good measure of the performance impacts we
> need to make sure that Siddhesh's performance benchmark suite is in
> place and that it can work with the 2.17 release so that we can get a
> baseline.
Ack, this is not limited to math either; I'm sure a number of non-math
functions could do with a static benchamrk like that at least as a
starting point.
Siddhesh