This is the mail archive of the systemtap@sourceware.org mailing list for the systemtap project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: Functions that require interrupts be enabled

From: "Frank Ch. Eigler" <fche at redhat dot com>
To: Mike Mason <mmlnx at us dot ibm dot com>
Cc: systemtap at sources dot redhat dot com
Date: Thu, 17 May 2007 15:46:29 -0400
Subject: Re: Functions that require interrupts be enabled
References: <464A352A.8010608@us.ibm.com> <y0mk5v9h9sk.fsf@ton.toronto.redhat.com> <464B54D6.2070309@us.ibm.com>

Hi -

On Wed, May 16, 2007 at 12:00:38PM -0700, Mike Mason wrote:
> [...]
> >Why must this new routine be permitted to sleep?  We can tolerate
> >paged-out data via soft errors (=> blank strings).
> 
> My script filters based on the arguments, so having the routine randomly 
> fail to return the actual arguments isn't good.  

Yeah, though I wonder how frequently that can happens, considering
that running tools like "top" would keep those pages around.

> Plus I don't think whether the page is paged in or not is an issue.
> I think the routine I'm using handles that (see below).

Yes, via possible sleeps during the other routines.



> >
> >>Now that begin/end probes no longer require that interrupts be
> >>disabled, this function can be used in begin/end probes at least. 
> >
> >AFAIK, interrupts being enabled is not exactly the same thing as being
> >able to sleep.
> 
> Here's the routine I'm using to grab the arguments from user space.  It's a 
> modified version of access_process_vm(), which isn't callable from a 
> module.  It can potentially sleep in two places: down_read() and kmap().  
> These functions do a might_sleep() check and fail if interrupts are 
> disabled.

> I considered using down_read_trylock() and kmap_atomic() (which
> won't sleep) but I don't clearly understand the side-effects of
> doing so.  Any suggestions would be appreciated.

I too only have limited understanding of this.  For the _trylock, that
is not a problem, as one can detect contention and return early rather
than blocking.

For kmap_atomic(), it seems trickier.  You'd need to use one of the
KM_USER[01] slots.  But we're in trouble if we are running this code
within a probe in a kernel routine that is already using that slot.

Then we still have this stuff that might sleep indirectly via a page
fault (unless I'm mistaken):

>        ret = get_user_pages(tsk, mm, addr, 1, 0, 1, &page, &vma);
>        copy_from_user_page(vma, page, addr, buf, maddr + offset, bytes);


So it seems like our options are:
(a) write such a tapset function, permit it to only be called from
    begin/end-like probe contexts, and let it sleep and whatnot
(b) write a related tapset function, which uses very conservative
    atomic routines everywhere, and may return blanks
(c) accept an approximation, such as a deferred result.  It might
    look like this:

# ------------------------------------------------------------------------
# tapset/proc-args.stp
%{
// helper C code
%}

void deferred_lookup(...) %{
     // enqueue a possibly-sleepy lookup of process args using
     //    auxiliary thread or defer_work type callback
     // arrange to put results into _process_args[] when available
     // return right away
%}

global _process_args

function process_args:string (pid) {
  if (pid in _process_args) return _process_args [pid]
  else { deferred_lookup (pid)
         return "" }
}

# ------------------------------------------------------------------------

Then, ordinary user scripts would just call process_args(NNN), and
would have to accept & ignore empty results.  Eventually valid strings
should come by.

- FChE

References:
- Functions that require interrupts be enabled
  - From: Mike Mason
- Re: Functions that require interrupts be enabled
  - From: Frank Ch. Eigler
- Re: Functions that require interrupts be enabled
  - From: Mike Mason

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]