This is the mail archive of the libc-alpha@sourceware.org mailing list for the glibc project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: [PATCH] Remove atomic operations from malloc.c

From: Leonhard Holz <leonhard dot holz at web dot de>
To: libc-alpha at sourceware dot org
Date: Thu, 19 Feb 2015 07:54:35 +0100
Subject: Re: [PATCH] Remove atomic operations from malloc.c
Authentication-results: sourceware.org; auth=none
References: <54DB130F dot 9070300 at web dot de> <1423652468 dot 9778 dot 250 dot camel at triegel dot csb> <20150218130009 dot GJ1594 at spoyarek dot pnq dot redhat dot com> <20150219020729 dot GN23507 at brightrain dot aerifal dot cx>


Am 19.02.2015 um 03:07 schrieb Rich Felker:

On Wed, Feb 18, 2015 at 06:30:10PM +0530, Siddhesh Poyarekar wrote:

On Wed, Feb 11, 2015 at 12:01:08PM +0100, Torvald Riegel wrote:

If your machine has just two cores, then at the very least you should
measure for just two threads too; a bigger number of threads is not
putting more contention on any of the synchronization bits, there's just
some more likelihood to having to wait for a thread that isn't running.

Also, to really assess performance, this has to be benchmarked on a
machine with more cores.  Additionally, you could argue why it should
not make a difference, and if that's a compelling argument, we could
follow it instead of the benchmark (which, as Will mentions, is hard to
make representative of real-world workloads).


The default malloc implementation creates 8 * n arenas on a system
with n cores, so for anything up to 8 * n threads, you're just
measuring contention between threads for the CPU since they're all
working on different arenas.

Maybe one way to guarantee such contention is a test with one thread
that allocates on an arena and another thread that frees from the same
arena.  I don't think the current benchmark does that.


I would really like to see more attention to this usage case (allocate
in one thread, free in another). It's an idiomatic msg/data-passing
strategy and probably the least complex in most cases, and it's a
shame if people are avoiding it for performance reasons.

I agree. The benchmark can be changed to use a shared array of allocated blocks, Icould implement it next week maybe.

Follow-Ups:
- Re: [PATCH] Remove atomic operations from malloc.c
  - From: Siddhesh Poyarekar

References:
- [PATCH] Remove atomic operations from malloc.c
  - From: Leonhard Holz
- Re: [PATCH] Remove atomic operations from malloc.c
  - From: Torvald Riegel
- Re: [PATCH] Remove atomic operations from malloc.c
  - From: Siddhesh Poyarekar
- Re: [PATCH] Remove atomic operations from malloc.c
  - From: Rich Felker

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]