19192 – __old_sem_wait never terminates

Bug 19192 - __old_sem_wait never terminates

Summary: __old_sem_wait never terminates

Status:	NEW

Alias:	None

Product:	glibc
Classification:	Unclassified
Component:	nptl (show other bugs)
Version:	2.22

Importance:	P2 normal
Target Milestone:	---
Assignee:	Not yet assigned to anyone

URL:
Keywords:

Depends on:
Blocks:

Reported:	2015-10-30 16:41 UTC by Juro Bystricky
Modified:	2015-12-11 23:42 UTC (History)
CC List:	2 users (show)

See Also:
Host:
Target:
Build:
Last reconfirmed:

Flags:	fweimer: security-

Attachments
Add an attachment (proposed patch, testcase, etc.)

Note You need to log in before you can comment on or make changes to this bug.

Description Juro Bystricky 2015-10-30 16:41:13 UTC

The structure new_sem  (file internaltypes.h __HAVE_64B_ATOMICS = 0) has changed from 2.20 to 2.21 the way the semaphore value is stored. The semaphore value is now stored in top 31 bits of the field "value". Bit 0 is used to indicate any nwaiters.

This breaks the code of __old_sem_wait (nptl/sem_wait.c). In particular,
"atomic_decrement_if_positive"  needs to be replaced by a call that understands
the new layout of the modified structure new_sem, otherwise we may end up looping forever as the value of the futex may end up being interpreted as negative (for example if user initializes the semaphore value to SEM_VALUE_MAX (2147483647) we actually store 0x7FFFFFFF << 1 which is 0xFFFFFFFE which is  (-2)
and "atomic_decrement_if_positive" will never return value > 0.
Not to mention we also modify bit 0 while decrementing.

Comment 1 Florian Weimer 2015-11-02 14:42:55 UTC

Does the hang happen on every call to __old_sem_wait?

Comment 2 Juro Bystricky 2015-11-02 17:34:27 UTC

(In reply to Florian Weimer from comment #1)
> Does the hang happen on every call to __old_sem_wait?

It is hard to say. In my case the semaphore is initialized via
"new_sem_init" but then the wait for semaphore is via "old_sem_wait".
In this case there will be always problems, as the two calls assume different
structure for sem_t. Prior to glibc 2.21 this would work as well,
as the two different structures agreed on interpretation of the field "value".

Normally, I would expect no problems with combination old_sem_init/old_sem_wait.
I should clarify that I ran into this problem using Python 2.7/multiprocessing.

Comment 3 Juro Bystricky 2015-12-11 23:42:47 UTC

We have two sets of routines:
old_sem_init/old_sem_wait
new_sem_init/new_sem_wait

If we use matched pairs, there are no problems.

But there is only one routine sem_open. This one uses the semaphore 
structure new_sem. So if we create a semaphore using sem_open, we must 
use new_sem_wait. That is not guaranteed and as a matter of fact this is
the cause of the problem I have.
So I think there should be two routines: new_sem_open and old_sem_open as well.