This is the mail archive of the
systemtap@sourceware.org
mailing list for the systemtap project.
Re: Functions that require interrupts be enabled
Hi -
On Wed, May 16, 2007 at 12:00:38PM -0700, Mike Mason wrote:
> [...]
> >Why must this new routine be permitted to sleep? We can tolerate
> >paged-out data via soft errors (=> blank strings).
>
> My script filters based on the arguments, so having the routine randomly
> fail to return the actual arguments isn't good.
Yeah, though I wonder how frequently that can happens, considering
that running tools like "top" would keep those pages around.
> Plus I don't think whether the page is paged in or not is an issue.
> I think the routine I'm using handles that (see below).
Yes, via possible sleeps during the other routines.
> >
> >>Now that begin/end probes no longer require that interrupts be
> >>disabled, this function can be used in begin/end probes at least.
> >
> >AFAIK, interrupts being enabled is not exactly the same thing as being
> >able to sleep.
>
> Here's the routine I'm using to grab the arguments from user space. It's a
> modified version of access_process_vm(), which isn't callable from a
> module. It can potentially sleep in two places: down_read() and kmap().
> These functions do a might_sleep() check and fail if interrupts are
> disabled.
> I considered using down_read_trylock() and kmap_atomic() (which
> won't sleep) but I don't clearly understand the side-effects of
> doing so. Any suggestions would be appreciated.
I too only have limited understanding of this. For the _trylock, that
is not a problem, as one can detect contention and return early rather
than blocking.
For kmap_atomic(), it seems trickier. You'd need to use one of the
KM_USER[01] slots. But we're in trouble if we are running this code
within a probe in a kernel routine that is already using that slot.
Then we still have this stuff that might sleep indirectly via a page
fault (unless I'm mistaken):
> ret = get_user_pages(tsk, mm, addr, 1, 0, 1, &page, &vma);
> copy_from_user_page(vma, page, addr, buf, maddr + offset, bytes);
So it seems like our options are:
(a) write such a tapset function, permit it to only be called from
begin/end-like probe contexts, and let it sleep and whatnot
(b) write a related tapset function, which uses very conservative
atomic routines everywhere, and may return blanks
(c) accept an approximation, such as a deferred result. It might
look like this:
# ------------------------------------------------------------------------
# tapset/proc-args.stp
%{
// helper C code
%}
void deferred_lookup(...) %{
// enqueue a possibly-sleepy lookup of process args using
// auxiliary thread or defer_work type callback
// arrange to put results into _process_args[] when available
// return right away
%}
global _process_args
function process_args:string (pid) {
if (pid in _process_args) return _process_args [pid]
else { deferred_lookup (pid)
return "" }
}
# ------------------------------------------------------------------------
Then, ordinary user scripts would just call process_args(NNN), and
would have to accept & ignore empty results. Eventually valid strings
should come by.
- FChE