David Smith [Tue, 2 Oct 2012 14:56:42 +0000 (09:56 -0500)]
(PR14571 partial fix) Correctly handle maps with limited entries.
* translate.cxx (mapvar::init): Remove hardcoded 'wrap' initialization and
let _stp_map_new() initialize 'wrap'.
* runtime/map.c (_stp_map_init): Set new 'wrap' parameter in map itself.
(_stp_map_new): Pass new 'wrap' parameter down to _stp_map_init().
(_stp_map_tls_object_init): Pass cached 'wrap' field to _stp_map_init().
(_stp_pmap_new): Pass new 'wrap' parameter down to _stp_map_init().
* runtime/map.h: Update function prototypes with new 'wrap' parameter.
* runtime/map-gen.c (KEYSYM(_stp_map_new)): Pass new 'wrap' parameter down
to the correct _stp_map_new* function.
* runtime/pmap-gen.c (KEYSYM(_stp_pmap_new)): Pass new 'wrap' parameter
down to the correct _stp_pmap_new* function
* runtime/map-stat.c (_stp_map_new_hstat_log): Pass new 'wrap' parameter
down to _stp_map_new().
(_stp_map_new_hstat_linear): Ditto.
(_stp_pmap_new_hstat_linear): Ditto.
(_stp_pmap_new_hstat_log): Ditto.
PR14555, replace kernel symbol "_stext" by a macro in runtime/k_syms.h
The macro is used by the runtime as well as the compilation
components. It is not guaranteed that this symbol is always called
"_stext" on all archtitectures. On powerpc64 for example its name is
".__start". Stap will not run on other architectures where this symbol
has a different name because the lookup for "_stext" will fail.
Adjusted by <fche> to leave _stext as the relocation pseudo-section
name as used by relocation basis code, and parametrizing only
symbol names.
Mark Wielaard [Sun, 30 Sep 2012 21:44:28 +0000 (23:44 +0200)]
testsuite stap_run_batch don't add an extra empty argument.
commit 8c94ef made it possible to add extra arguments to stap_run_batch.
But we must make sure we don't accidentially add an extra empty argument.
Some testcases like parseok/fourteen.stp depend on @# being zero.
Mark Wielaard [Sun, 30 Sep 2012 16:12:19 +0000 (18:12 +0200)]
memory.stp: do_mmap was replaced by vm_mmap.
do_mmap was completely replaced by vm_mmap, so if either is a good match
for the vm.mmap probe alias. See kernel commits 6be5ceb and dc98250.
(Note, do_mmap2 is a special case just for powerpc.)
do_munmap was partially replaced by vm_munmap, but vm_munmap calls
though do_munmap, so for the vm.munmap probe alias do_munmap is the
best match. See kernel commits a46ef99 and 17d1587.
The same is true for do_brk and vm_brk, vm_brk calls through do_brk,
so for the vm.brk probe alias do_brk is the function to probe. See
kernel commit e4eb1ff.
This used to resolve partly before because when CONFIG_COMPAT = "y"
there would still be a compat_sys_nfsservctl. But that was slightly
bogus because without CONFIG_NFSD that would just be:
Mark Wielaard [Sat, 29 Sep 2012 17:13:47 +0000 (19:13 +0200)]
parser::parse_global(): break after seeing a termination token.
The code would swallow the terminating token and then inspect whether it
was a ','. Which it obviously wasn't since we had just seen that it was
a ';'.
Mark Wielaard [Fri, 28 Sep 2012 21:16:18 +0000 (23:16 +0200)]
Add get_self_path() as workaround for running under valgrind.
get_base_hash() wants to get some stats of the main binary. But when
running under valgind a stat call on /proc/self/exe actually gives
the stats of the valgrind process binary. Using readlink before calling
stat works around that (readlink is intercepted by valgrind, stat isn't).
Add get_self_path to util.cxx.
Josh Stone [Sat, 29 Sep 2012 01:29:40 +0000 (18:29 -0700)]
stapdyn: Clean up error/warning/log messages
* stapdyn/stapdyn.cxx (staplog, stapwarn, staperror): New ostream
functions that allow common prefixes, log levels, and suppressed
warnings. All appropriate clog's are updated to these.
* buildrun.cxx (make_dyninst_run_command): Set -v and -w options.
Josh Stone [Fri, 28 Sep 2012 21:28:13 +0000 (14:28 -0700)]
stapdyn: Check and report the child exit status
* stapdyn/dynutil.cxx (check_dyninst_exit): New, check how the given
BPatch_process exited, and report failures.
* stapdyn/stapdyn.cxx (main): Use check_dyninst_exit.
* stapdyn/dynsdt.cxx (main): Use check_dyninst_exit.
PR14364, PR14630: Use set_fs and pagefault_disable/enable around more accesses
It turns out there are a bunch of conceptually overlapping
functions/macros throughout the runtime, each of which attempts to
dereference untrustworthy kernel- or user-space pointers, in slightly
different ways.
When deliberately invoked with bad pointer values, some lockdep
kernels (e.g. 2.6.32-279.9.1.el6.x86_64.debug) would emit errors about
page-fault handling paths being triggered in inappropriate contexts
for some of these lookup functions. It turns out a more robust
control of address space checking and fault suppression is necessary.
* runtime/linux/autoconf-pagefault_disable.c: New test.
* buildrun.cxx (compile_pass): Run it.
* runtime/linux/copy.c (_stp_read_address): Add pagefault_{disable,enable}.
Note duplication with loc2c-runtime.h
(_stp_strncpy_from_user): Add set_fs & pagefault_{disable,enable}.
Note duplication with loc2c-runtime.h
* runtime/stp_string.h (__stp_get_user): Wrap in pagefault_{disable,enable}.
Note duplication with loc2c-runtime.h
* tapset/uconversions.stp (__STP_GET_USER): Instead of __stp_get_user,
zap duplication with loc2c-runtime.h and just call loc2c-runtime.h.
* runtime/loc2c-runtime.h (STAPCONF_PAGEFAULT_DISABLE): Add dummy
macros for pre-rhel5 kernels.
(_stp_deref, _stp_store_deref): Revamped arch-specific macros, setting
segments and disabling pagefaults.
(uderef,ustore_deref,kderef,kstore_deref): Revamped macros to call the
above. These should become the standard throughout the runtime/tapset.
runtime: add noinline to *printf fns to limit frame-size errors
During the debugging work for PR14630, it turned out to trigger these
warning->errors. Some noinline's, and one or two static[]'s,
<south park>.... and it's gone!</>
Josh Stone [Fri, 28 Sep 2012 00:55:05 +0000 (17:55 -0700)]
PR14489: Revamp stapdyn probe metadata
Rather than having a fixed data structure for stapdyn to read from the
module, now stapdyn queries the module dynamically for its data. Thus,
we dlopen the module within stapdyn, then dlsym a few query functions
and use those to enumerate all of the probe data.
* runtime/dyninst/stapdyn.h: Add functions for metadata.
* runtime/dyninst/uprobes.h: New, define module internal datastructures.
* runtime/dyninst/uprobes.c: Implement the metadata functions.
* tapsets.cxx: Generate probe metadat in the new datastructures.
* stapdyn/stapdyn.cxx: Query probes using the new functions.
* stapdyn/Makefile.am: No longer need -lelf for stapdyn.
* stapdyn/Makefile.in: Regenerate.
Josh Stone [Thu, 27 Sep 2012 20:25:30 +0000 (13:25 -0700)]
PR14574: Let stapdyn run without a target command
For easier testing, let begin/end/error probes run directly in stapdyn
when there's no -c option given.
* runtime/dyninst/stapdyn.h: New, declare functions defined in the
module and used by stapdyn.
* runtime/dyninst/runtime.h: Include stapdyn.h so definitions match.
* stapdyn/stapdyn.cxx: When no command is given, dlsym the init/exit
functions and run them directly.
* (autoreconf...)
Josh Stone [Thu, 27 Sep 2012 18:42:03 +0000 (11:42 -0700)]
hash: Include several subdirectories of the runtime
One advantage of having the runtime path in the hash is its mtime also
tells us when any files were modified (at least with most editors that
write a temp file then atomically rename). For stap hackers like
myself, it would be nice to get this benefit for subdirectories of the
runtime too, so let's also add those to the hash.
* hash.cxx (get_base_hash): Add /transport, /unwind, and either /dyninst
or /linux depending on the current runtime mode.
(find_uprobes_hash): Update the uprobes paths since PR14179's move.
Josh Stone [Thu, 27 Sep 2012 16:19:39 +0000 (09:19 -0700)]
PR13486: Always output a frame_base if needed
Previously, loc2c only emitted code for frame_base if it was on the
first loc of the chain. But in some cases, the first piece may not
reference the frame_base while later parts do.
In the cases found in the bug, the first piece was GNU_implicit_pointer,
which doesn't even emit code of its own. But its target DIE did need
the frame_base to compute its value. Since loc2c didn't realize that,
we ended up emitting code using frame_base without ever declaring it.
* loc2c.c (c_emit_location): Loop over the whole loc chain to determine
if a frame_base is needed, and output the first one found.
Josh Stone [Wed, 26 Sep 2012 22:17:10 +0000 (15:17 -0700)]
stapdyn: Resolve the target executable from the PATH
BPatch::processCreate needs a full path for the process argument, so we
need to walk the PATH to figure that out. We already did this in
dynsdt, but using a private function. Now both use find_executable from
util.h, in a new form that doesn't care about sysroots.
* util.cxx (find_executable): Add a name-only version that just wraps
the full sysroot version, for convenience. Also let a few more things
be const in the implementation.
* hash.cxx (get_base_hash): Use the wrapper function for finding gcc,
instead of having a "dummy" sysenv itself.
* stapdyn/stapdyn.cxx (main): Resolve the target with find_executable.
* stapdyn/dynsdt.cxx (resolve_path): Removed.
(main): Use find_executable instead.
For helping diagnose crashes that may occur during a testsuite,
set $SYSTEMTAP_SYNC, which is handled by staprun, just before it
does the module-insertion. This will make tests slower, sorry.
Josh Stone [Mon, 24 Sep 2012 23:15:55 +0000 (16:15 -0700)]
stapdyn: Enable end/error probes
The dyninst exit hook runs too late for us to still call anything in the
mutatee, so the systemtap_module_exit() call which should run all of the
end/error probes wasn't happening.
Now we use a destructor function in the mutatee, so our exit path always
runs after main() returns or after an exit() call. Functions like
_exit() are still problematic though.
This now also makes a distinction between initializing process-local vs.
session resources, so we are more ready for operating with multiple
mutatees at once. See dyninst/runtime.h for design comments.
David Smith [Fri, 21 Sep 2012 20:13:28 +0000 (15:13 -0500)]
(PR14571 partial fix) For dyninst, use TLS for map and stat data.
* runtime/dyninst/tls_data.c: New file.
* runtime/stat.c (struct _Stat): Add a tls_data_container_t structure.
(_stp_stat_tls_object_init): New function.
(_stp_stat_tls_object_free): Ditto.
(_stp_stat_init): Instead of directly allocating percpu data, for
dyninst set up tls data to be created when accessed by calling
_stp_tls_data_container_init().
(_stp_stat_del): For dyninst, call _stp_tls_data_container_cleanup() to
remove all the tls data.
(_stp_stat_add): For dyninst, get the proper tls stat object.
(_stp_stat_get_cpu): Deleted unused function.
(_stp_stat_get): For dyninst, get the proper tls stat objects.
(_stp_stat_clear): For dyninst, clear the stat in each thread's tls data.
* runtime/stat.h (struct stat_data): Add a tls_data_object_t structure.
* runtime/map.c (_stp_map_tls_object_init): New function.
(_stp_map_tls_object_free): Ditto.
(_stp_pmap_new): Instead of directly allocating percpu data, for dyninst
set up tls data to be created when accessed by calling
_stp_tls_data_container_init().
(_stp_pmap_clear): For dyninst, clear the map in each thread's tls data.
(_stp_pmap_del): For dyninst, call _stp_tls_data_container_cleanup() to
remove all the tls data.
(_stp_pmap_agg): Add dyninst support.
* runtime/map.h (struct map_root): Add a tls_data_object_t structure.
(struct pmap): Add a tls_data_container_t structure.
* runtime/map-stat.c (_stp_hstat_tls_object_init): New function.
(_stp_pmap_new_hstat_linear): For dyninst, override the standard tls
data object init function with _stp_hstat_tls_object_init(), which knows
how to handle hstats.
(_stp_pmap_new_hstat_log): Ditto.
* runtime/pmap-gen.c (_stp_pmap_tls_object_init): New function.
(_stp_pmap_new): For dyninst, override the standard tls
data object init function with _stp_pmap_tls_object_init(), which knows
how to handle pmaps.
(_stp_pmap_set): For dyninst, get the proper tls pmap object.
(_stp_pmap_add): Ditto.
(_stp_pmap_get_cpu): Ditto.
(_stp_pmap_get): Ditto.
(_stp_pmap_del): Ditto.
* runtime/dyninst/linux_defs.h: Added container_of(), list_entry(),
list_for_each_entry(), and list_for_each_entry_safe().
Mark Wielaard [Wed, 19 Sep 2012 08:33:29 +0000 (10:33 +0200)]
parse.cxx swallow tokens we are definitely not using.
The tokens produced by expect_* () were immediately dropped on the floor
after inspection. And a lot of places in the parser called next () just
to get passed the current token without using it. Those tokens could
immediately be cleaned up saving ~3MB of "lost memory".
valgrind stap -v -k -p4 -e 'probe begin { log("Hello, World!"); exit(); }'
Before:
==12545== definitely lost: 2,470,128 bytes in 51,408 blocks
==12545== indirectly lost: 14,180,805 bytes in 319,624 blocks
After:
==14782== definitely lost: 18,856 bytes in 220 blocks
==14782== indirectly lost: 12,432,436 bytes in 264,176 blocks
Implements a cached unwinder, allowing most backtrace tapset functions
to be implemented in terms of stack() without loss of performance.
stack() calls can be made repeatedly and in any order, and they will
use the results of a single unwind. (Works only with the DWARF
unwinder).
_stp_stack_kernel_print et. al. retain their prior behaviour,
including a number of fallbacks not available to the incremental
unwind. These fallbacks only emit backtrace strings, which can be
tokenized on the tapset end as a last resort.
* runtime/unwind/unwind.h -- define struct unwind_cache to store PCs
obtained from unwinder.
* runtime/common_probe_context.h -- include two sets of unwinder
context and cache, one for user side, one for kernel.
* tapsets.cxx -- probe prologue includes a small thing to mark
the unwind caches as being in an uninitialized state.
* runtime/stack.c -- incremental unwinder implementation.
* runtime/stack-dwarf.c -- deleted. Code moved to stack.c since
this is now the preferred unwind method.
* tapset/linux/[u]context-symbols.stp -- change stack(), ustack()
to directly call incremental unwinder.
David Smith [Fri, 14 Sep 2012 15:13:19 +0000 (10:13 -0500)]
(PR14571 partial fix) For dyninst, use TLS for the context structure.
* translate.cxx (c_unparser::emit_common_header): Update to use thread
local storage in dyninst mode for context structure.
(c_unparser::emit_global_param): Since there are no module parameters
with dyninst, don't bother emitting code.
(c_unparser::emit_module_init): Don't bother emitting kernel-only code
in dyninst mode.
* runtime/vsprintf.c (_stp_vsprint_memory): For dyninst, use thread local
storage to get correct context structure.
* tapsets.cxx (common_probe_entryfn_prologue): Take a 'systemtap_sesion'
argument instead of a 'translator_output' argument. This allows access
to the s.is_usermode() function (so we can avoid outputting kernel code
when using dyninst).
(dwarf_derived_probe_group::emit_module_decls): Update
common_probe_entryfn_prologue() call.
(uprobe_derived_probe_group::emit_module_utrace_decls): Ditto.
(uprobe_derived_probe_group::emit_module_inode_decls): Ditto.
(uprobe_derived_probe_group::emit_module_dyninst_decls): Ditto.
(kprobe_derived_probe_group::emit_module_decls): Ditto.
(hwbkpt_derived_probe_group::emit_module_decls): Ditto.
(tracepoint_derived_probe_group::emit_module_decls): Ditto.
* tapset-been.cxx (be_derived_probe_group::emit_module_decls): Ditto.
* tapset-itrace.cxx (itrace_derived_probe_group::emit_module_decls): Ditto.
* tapset-mark.cxx (mark_derived_probe_group::emit_module_decls): Ditto.
* tapset-netfilter.cxx (netfilter_derived_probe_group::emit_module_decls):
Ditto.
* tapset-perfmon.cxx (perf_derived_probe_group::emit_module_decls): Ditto.
* tapset-procfs.cxx (procfs_derived_probe_group::emit_module_decls):
Ditto.
* tapset-timers.cxx (profile_derived_probe_group::emit_module_decls):
Ditto.
* tapset-utrace.cxx (utrace_derived_probe_group::emit_module_decls):
Ditto.
* tapsets.h: Updated common_probe_entryfn_prologue() declaration.
Mark Wielaard [Mon, 10 Sep 2012 08:29:15 +0000 (10:29 +0200)]
dflpp.cxx (find_variable_and_frame_base): Remove extra bracket in error.
Before:
semantic error: unable to find local 'task', dieoffset 0x111604 in nfs,
near pc 0x78863 in nfs_read_done fs/nfs/proc.c ( (alternatives: $data):
identifier '$task' at [...]
After:
semantic error: unable to find local 'task', dieoffset 0x111604 in nfs,
near pc 0x78863 in nfs_read_done fs/nfs/proc.c (alternatives: $data):
identifier '$task' at [...]
David Smith [Fri, 7 Sep 2012 19:57:18 +0000 (14:57 -0500)]
(PR14488 partial fix) Make memory reading more robust.
* runtime/dyninst/runtime.h (assert_is_myproc): New dummy macro.
(stp_dummy_init): Opens the /proc/self/mem file.
(stp_dummy_exit): Close the file.
* runtime/dyninst/linux_defs.h (user_mode): New dummy macro.
(__copy_from_user): Rewrote to read memory from /proc/self/mem, to be
able to catch errors.
* runtime/stp_string.h (__stp_get_user): Always use __get_user(), no
matter what arch we're on in dyninst mode.
* tapset/uconversions.stp: Moved from tapset/linux/uconversions.stp, since
the dyninst runtime can use the functions also.
Mark Wielaard [Wed, 5 Sep 2012 19:09:43 +0000 (21:09 +0200)]
PR14546 Only copy registers of initial state.
The DWARF unwinder was copying too much information into the cie_regs
state possibly overwriting ~18 bytes after the unwind_context. Just copy
the registers.
Serguei Makarov [Mon, 27 Aug 2012 18:13:39 +0000 (14:13 -0400)]
PR14524: support old embedded-locals notation on a per-function basis
Adding the pragma /* unmangled */ to an embedded-C function now enables
the use of the pre-1.8 local names, i.e. THIS->n instead of STAP_ARG_n.
This aids migration of pre-1.8 scripts and tapsets by allowing mixing
two kinds of embedded functions (pre- and post-1.8) in the same code.
Josh Stone [Wed, 22 Aug 2012 22:41:49 +0000 (15:41 -0700)]
langref: Mention 'delete' in the Arrays and Statistics sections
When reading just the sections on Arrays and Statistics, it was not
apparent that the 'delete' statement is applicable to each. These
sections now have a mention and back-reference to the delete statement.
Josh Stone [Tue, 21 Aug 2012 18:46:30 +0000 (11:46 -0700)]
runtime: Use typedef-typeof for safer function pointers
We use kallsyms_lookup_name to access several functions that are
publicly declared but not exported. We were manually recreating the
call signatures for function pointers though, which is begging for
trouble. This patch updates all those typedefs to use typeof on the
original function declaration, so we hopefully get compile-time breakage
if the call signature ever changes on us.
David Smith [Fri, 17 Aug 2012 18:31:24 +0000 (13:31 -0500)]
Fixed PR14463 by changing error handling of utrace_control().
* runtime/stp_utrace.c (utrace_barrier): Add a 'might_sleep()' call since
it calls schedule_timeout_interruptible(), which sleeps.
* runtime/linux/task_finder2.c (__stp_utrace_attach): Ignore
EINPROGRESS errors from utrace_control() when passed UTRACE_STOP, since
that just means the task hasn't stopped yet.
* runtime/linux/task_finder.c (__stp_utrace_attach_flags): Ditto.