Some DTrace actions are destructive in that they change the state of the system in some well-defined way. Destructive actions may not be used unless they have been explicitly enabled. When using
dtrace
(
1M
)
, you can enable destructive actions using the
w
option. If an attempt is made to enable destructive actions in
dtrace
(
1M
)
without explicitly enabling them, dtrace will fail with a message similar to the following example:
dtrace: failed to enable 'syscall': destructive actions not allowed
Some destructive actions are destructive only to a particular process. These actions are available to users with the dtrace_proc
or dtrace_user
privileges. See Chapter 35, Security for details on DTrace security privileges.
void stop(void)
The stop
action forces the process that fires the enabled probe to stop when it next leaves the kernel, as if stopped by a
proc
(
4
)
action. The
prun
(
1
)
utility may be used to resume a process that has been stopped by the stop
action. The stop
action can be used to stop a process at any DTrace probe point. This action can be used to capture a program in a particular state that would be difficult to achieve with a simple breakpoint, and then attach a traditional debugger like
mdb
(
1
)
to the process. You can also use the
gcore
(
1
)
utility to save the state of a stopped process in a core file for later analysis.
void raise(int signal
)
The raise
action sends the specified signal to the currently running process. This action is similar to using the
kill
(
1
)
command to send a process a signal. The raise
action can be used to send a signal at a precise point in a process's execution.
void copyout(void *buf
, uintptr_taddr
, size_tnbytes
)
The copyout
action copies nbytes
from the buffer specified by buf
to the address specified by addr
in the address space of the process associated with the current thread. If the user-space address does not correspond to a valid, faulted-in page in the current address space, an error will be generated.
void copyoutstr(stringstr
, uintptr_taddr
, size_tmaxlen
)
The copyoutstr
action copies the string specified by str
to the address specified by addr
in the address space of the process associated with the current thread. If the user-space address does not correspond to a valid, faulted-in page in the current address space, an error will be generated. The string length is limited to the value set by the strsize
option. See Chapter 16, Options and Tunables for details.
void system(string program
, ...)
The system
action causes the program specified by program
to be executed as if it were given to the shell as input. The program
string may contain any of the printf
/printa
format conversions. Arguments must be specified that match the format conversions. Refer to Chapter 12, Output Formatting for details on valid format conversions.
The following example runs the date ( 1 ) command once per second:
# dtrace -wqn tick-1sec'{system("date")}'
Tue Jul 20 11:56:26 CDT 2004
Tue Jul 20 11:56:27 CDT 2004
Tue Jul 20 11:56:28 CDT 2004
Tue Jul 20 11:56:29 CDT 2004
Tue Jul 20 11:56:30 CDT 2004
The following example shows a more elaborate use of the action, using printf
conversions in the program
string along with traditional filtering tools like pipes:
#pragma D option destructive #pragma D option quiet proc:::signal-send /args[2] == SIGINT/ { printf("SIGINT sent to %s by ", args[1]->pr_fname); system("getent passwd %d | cut -d: -f5", uid); }
Running the above script results in output similar to the following example:
# ./whosend.d
SIGINT sent to MozillaFirebird- by Bryan Cantrill
SIGINT sent to run-mozilla.sh by Bryan Cantrill
^C
SIGINT sent to dtrace by Bryan Cantrill
The execution of the specified command does not occur in the context of the firing probe – it occurs when the buffer containing the details of the system
action are processed at user-level. How and when this processing occurs depends on the buffering policy, described in Chapter 11, Buffers and Buffering. With the default buffering policy, the buffer processing rate is specified by the switchrate
option. You can see the delay inherent in system
if you explicitly tune the switchrate
higher than its one-second default, as shown in the following example:
#pragma D option quiet #pragma D option destructive #pragma D option switchrate=5sec tick-1sec /n++ < 5/ { printf("walltime : %Y\n", walltimestamp); printf("date : "); system("date"); printf("\n"); } tick-1sec /n == 5/ { exit(0); }
Running the above script results in output similar to the following example:
# dtrace -s ./time.d
walltime : 2004 Jul 20 13:26:30
date : Tue Jul 20 13:26:35 CDT 2004
walltime : 2004 Jul 20 13:26:31
date : Tue Jul 20 13:26:35 CDT 2004
walltime : 2004 Jul 20 13:26:32
date : Tue Jul 20 13:26:35 CDT 2004
walltime : 2004 Jul 20 13:26:33
date : Tue Jul 20 13:26:35 CDT 2004
walltime : 2004 Jul 20 13:26:34
date : Tue Jul 20 13:26:35 CDT 2004
Notice that the walltime
values differ, but the date
values are identical. This result reflects the fact that the execution of the
date
(
1
)
command occured only when the buffer was processed, not when the system
action was recorded.
Some destructive actions are destructive to the entire system. These actions must obviously be used extremely carefully, as they will affect every process on the system and any other system implicitly or explicitly depending upon the affected system's network services.
void breakpoint(void)
The breakpoint
action induces a kernel breakpoint, causing the system to stop and transfer control to the kernel debugger. The kernel debugger will emit a string denoting the DTrace probe that triggered the action. For example, if one were to do the following:
# dtrace -w -n clock:entry'{breakpoint()}'
dtrace: allowing destructive actions
dtrace: description 'clock:entry' matched 1 probe
On Solaris running on SPARC, the following message might appear on the console:
dtrace: breakpoint action at probe fbt:genunix:clock:entry (ecb 30002765700) Type 'go' to resume ok
On Solaris running on x86, the following message might appear on the console:
dtrace: breakpoint action at probe fbt:genunix:clock:entry (ecb d2b97060) stopped at int20+0xb: ret kmdb[0]:
The address following the probe description is the address of the enabling control block (ECB) within DTrace. You can use this address to determine more details about the probe enabling that induced the breakpoint action.
A mistake with the breakpoint
action may cause it to be called far more often than intended. This behavior might in turn prevent you from even terminating the DTrace consumer that is triggering the breakpoint actions. In this situation, set the kernel integer variable dtrace_destructive_disallow
to 1. This setting will disallow all destructive actions on the machine. Apply this setting only in this particular situation.
The exact method for setting dtrace_destructive_disallow
will depend on the kernel debugger that you are using. If using the OpenBoot PROM on a SPARC system, use w!:
ok 1 dtrace_destructive_disallow w!
ok
Confirm that the variable has been set using w?:
ok dtrace_destructive_disallow w?
1
ok
Continue by typing go:
ok go
If using
kmdb
(
1
)
on x86 or SPARC systems, use the 4–byte write modifier (W
) with the /
formatting dcmd:
kmdb[0]: dtrace_destructive_disallow/W 1
dtrace_destructive_disallow: 0x0 = 0x1
kmdb[0]:
Continue using :c:
kadb[0]: :c
To re-enable destructive actions after continuing, you will need to explicitly reset dtrace_destructive_disallow
back to 0 using
mdb
(
1
)
:
# echo "dtrace_destructive_disallow/W 0" | mdb -kw
dtrace_destructive_disallow: 0x1 = 0x0
#
void panic(void)
The panic
action causes a kernel panic when triggered. This action should be used to force a system crash dump at a time of interest. You can use this action together with ring buffering and postmortem analysis to understand a problem. For more information, see Chapter 11, Buffers and Buffering and Chapter 37, Postmortem Tracing respectively. When the panic action is used, a panic message appears that denotes the probe causing the panic. For example:
panic[cpu0]/thread=30001830b80: dtrace: panic action at probe syscall::mmap:entry (ecb 300000acfc8) 000002a10050b840 dtrace:dtrace_probe+518 (fffe, 0, 1830f88, 1830f88, 30002fb8040, 300000acfc8) %l0-3: 0000000000000000 00000300030e4d80 0000030003418000 00000300018c0800 %l4-7: 000002a10050b980 0000000000000500 0000000000000000 0000000000000502 000002a10050ba30 genunix:dtrace_systrace_syscall32+44 (0, 2000, 5, 80000002, 3, 1898400) %l0-3: 00000300030de730 0000000002200008 00000000000000e0 000000000184d928 %l4-7: 00000300030de000 0000000000000730 0000000000000073 0000000000000010 syncing file systems... 2 done dumping to /dev/dsk/c0t0d0s1, offset 214827008, content: kernel 100% done: 11837 pages dumped, compression ratio 4.66, dump succeeded rebooting...
syslogd ( 1M ) will also emit a message upon reboot:
Jun 10 16:56:31 machine1 savecore: [ID 570001 auth.error] reboot after panic: dtrace: panic action at probe syscall::mmap:entry (ecb 300000acfc8)
The message buffer of the crash dump also contains the probe and ECB responsible for the panic
action.
void chill(int nanoseconds
)
The chill
action causes DTrace to spin for the specified number of nanoseconds. chill
is primarily useful for exploring problems that might be timing related. For example, you can use this action to open race condition windows, or to bring periodic events into or out of phase with one another. Because interrupts are disabled while in DTrace probe context, any use of chill
will induce interrupt latency, scheduling latency, dispatch latency. Therefore, chill
can cause unexpected systemic effects and it should not used indiscriminately. Because system activity relies on periodic interrupt handling, DTrace will refuse to execute the chill
action for more than 500 milliseconds out of each one-second interval on any given CPU. If the maximum chill
interval is exceeded, DTrace will report an illegal operation error, as shown in the following example:
# dtrace -w -n syscall::open:entry'{chill(500000001)}'
dtrace: allowing destructive actions
dtrace: description 'syscall::open:entry' matched 1 probe
dtrace: 57 errors
CPU ID FUNCTION:NAME
dtrace: error on enabled probe ID 1 (ID 14: syscall::open:entry): \
illegal operation in action #1
This limit is enforced even if the time is spread across multiple calls to chill
, or multiple DTrace consumers of a single probe. For example, the same error would be generated by the following command:
# dtrace -w -n syscall::open:entry'{chill(250000000); chill(250000001);}'