This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: [PATCH] malloc: Use current (C11-style) atomics for fastbin access


Hi Florian,

> > I see a 16% regression on ppc64le with a simple threaded malloc test
> > case. I guess the C11 atomics aren't as good as what we have in
> > glibc.  
> 
> Uh-oh.  Would you please check if replacing the two
> atomic_load_acquire with atomic_load_relaxed restore the previous
> performance?

As you suspect, doing this does restore the performance. The two lwsync
barrier instructions must be causing the slow down.

> I believe both loads need to be acquire MO under the C11 memory model
> (see the comments why), but the old code did not have them.

Ok, thanks for looking into it.

Thanks,
Anton


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]