mirror of
https://github.com/raspberrypi/linux.git
synced 2026-01-02 15:53:42 +00:00
Like trace subcommand, it should be able to pass some options to control
the tracing behavior for the function graph tracer.
But some options are limited in order to maintain the internal behavior.
For example, it can limit the function call depth like below:
# perf ftrace profile --graph-opts depth=5 -- myprog
Committer testing:
root@number:~# perf ftrace profile --graph-opts thresh=1000 -- sleep 1
# Total (us) Avg (us) Max (us) Count Function
1001419.301 500709.650 1000032.000 2 x64_sys_call
1000032.000 1000032.000 1000032.000 1 __x64_sys_clock_nanosleep
1000032.000 1000032.000 1000032.000 1 common_nsleep
1000031.000 1000031.000 1000031.000 1 do_nanosleep
1000031.000 1000031.000 1000031.000 1 hrtimer_nanosleep
1000024.000 1000024.000 1000024.000 1 schedule
1387.208 1387.208 1387.208 1 __x64_sys_execve
1386.691 1386.691 1386.691 1 do_execveat_common.isra.0
1334.170 1334.170 1334.170 1 bprm_execve
1258.413 1258.413 1258.413 1 load_elf_binary
1123.068 1123.068 1123.068 1 begin_new_exec
1113.550 1113.550 1113.550 1 mmput
1109.237 1109.237 1109.237 1 exit_mmap
root@number:~# perf ftrace profile --graph-opts thresh=1200 -- sleep 1
# Total (us) Avg (us) Max (us) Count Function
1001448.204 500724.102 1000018.000 2 x64_sys_call
1000017.000 1000017.000 1000017.000 1 __x64_sys_clock_nanosleep
1000017.000 1000017.000 1000017.000 1 common_nsleep
1000017.000 1000017.000 1000017.000 1 hrtimer_nanosleep
1000016.000 1000016.000 1000016.000 1 do_nanosleep
1000012.000 1000012.000 1000012.000 1 schedule
1430.112 1430.112 1430.112 1 __x64_sys_execve
1429.581 1429.581 1429.581 1 do_execveat_common.isra.0
1376.289 1376.289 1376.289 1 bprm_execve
1301.743 1301.743 1301.743 1 load_elf_binary
root@number:~#
Reviewed-by: James Clark <james.clark@linaro.org>
Signed-off-by: Namhyung Kim <namhyung@kernel.org>
Tested-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Cc: Adrian Hunter <adrian.hunter@intel.com>
Cc: Ian Rogers <irogers@google.com>
Cc: Ingo Molnar <mingo@kernel.org>
Cc: Jiri Olsa <jolsa@kernel.org>
Cc: Kan Liang <kan.liang@linux.intel.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Link: https://lore.kernel.org/r/20250107224352.1128669-2-namhyung@kernel.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
216 lines
6.4 KiB
Plaintext
216 lines
6.4 KiB
Plaintext
perf-ftrace(1)
|
|
==============
|
|
|
|
NAME
|
|
----
|
|
perf-ftrace - simple wrapper for kernel's ftrace functionality
|
|
|
|
|
|
SYNOPSIS
|
|
--------
|
|
[verse]
|
|
'perf ftrace' {trace|latency|profile} <command>
|
|
|
|
DESCRIPTION
|
|
-----------
|
|
The 'perf ftrace' command provides a collection of subcommands which use
|
|
kernel's ftrace infrastructure.
|
|
|
|
'perf ftrace trace' is a simple wrapper of the ftrace. It only supports
|
|
single thread tracing currently and just reads trace_pipe in text and then
|
|
write it to stdout.
|
|
|
|
'perf ftrace latency' calculates execution latency of a given function
|
|
(optionally with BPF) and display it as a histogram.
|
|
|
|
'perf ftrace profile' show a execution profile for each function including
|
|
total, average, max time and the number of calls.
|
|
|
|
The following options apply to perf ftrace.
|
|
|
|
COMMON OPTIONS
|
|
--------------
|
|
|
|
-p::
|
|
--pid=::
|
|
Trace on existing process id (comma separated list).
|
|
|
|
--tid=::
|
|
Trace on existing thread id (comma separated list).
|
|
|
|
-a::
|
|
--all-cpus::
|
|
Force system-wide collection. Scripts run without a <command>
|
|
normally use -a by default, while scripts run with a <command>
|
|
normally don't - this option allows the latter to be run in
|
|
system-wide mode.
|
|
|
|
-C::
|
|
--cpu=::
|
|
Only trace for the list of CPUs provided. Multiple CPUs can
|
|
be provided as a comma separated list with no space like: 0,1.
|
|
Ranges of CPUs are specified with -: 0-2.
|
|
Default is to trace on all online CPUs.
|
|
|
|
-v::
|
|
--verbose::
|
|
Increase the verbosity level.
|
|
|
|
|
|
OPTIONS for 'perf ftrace trace'
|
|
-------------------------------
|
|
|
|
-t::
|
|
--tracer=::
|
|
Tracer to use when neither -G nor -F option is not
|
|
specified: function_graph or function.
|
|
|
|
-F::
|
|
--funcs::
|
|
List available functions to trace. It accepts a pattern to
|
|
only list interested functions.
|
|
|
|
-D::
|
|
--delay::
|
|
Time (ms) to wait before starting tracing after program start.
|
|
|
|
-m::
|
|
--buffer-size::
|
|
Set the size of per-cpu tracing buffer, <size> is expected to
|
|
be a number with appended unit character - B/K/M/G.
|
|
|
|
--inherit::
|
|
Trace children processes spawned by our target.
|
|
|
|
-T::
|
|
--trace-funcs=::
|
|
Select function tracer and set function filter on the given
|
|
function (or a glob pattern). Multiple functions can be given
|
|
by using this option more than once. The function argument also
|
|
can be a glob pattern. It will be passed to 'set_ftrace_filter'
|
|
in tracefs.
|
|
|
|
-N::
|
|
--notrace-funcs=::
|
|
Select function tracer and do not trace functions given by the
|
|
argument. Like -T option, this can be used more than once to
|
|
specify multiple functions (or glob patterns). It will be
|
|
passed to 'set_ftrace_notrace' in tracefs.
|
|
|
|
--func-opts::
|
|
List of options allowed to set:
|
|
|
|
- call-graph - Display kernel stack trace for function tracer.
|
|
- irq-info - Display irq context info for function tracer.
|
|
|
|
-G::
|
|
--graph-funcs=::
|
|
Select function_graph tracer and set graph filter on the given
|
|
function (or a glob pattern). This is useful to trace for
|
|
functions executed from the given function. This can be used more
|
|
than once to specify multiple functions. It will be passed to
|
|
'set_graph_function' in tracefs.
|
|
|
|
-g::
|
|
--nograph-funcs=::
|
|
Select function_graph tracer and set graph notrace filter on the
|
|
given function (or a glob pattern). Like -G option, this is useful
|
|
for the function_graph tracer only and disables tracing for function
|
|
executed from the given function. This can be used more than once to
|
|
specify multiple functions. It will be passed to 'set_graph_notrace'
|
|
in tracefs.
|
|
|
|
--graph-opts::
|
|
List of options allowed to set:
|
|
|
|
- nosleep-time - Measure on-CPU time only for function_graph tracer.
|
|
- noirqs - Ignore functions that happen inside interrupt.
|
|
- verbose - Show process names, PIDs, timestamps, etc.
|
|
- thresh=<n> - Setup trace duration threshold in microseconds.
|
|
- depth=<n> - Set max depth for function graph tracer to follow.
|
|
- tail - Print function name at the end.
|
|
|
|
|
|
OPTIONS for 'perf ftrace latency'
|
|
---------------------------------
|
|
|
|
-T::
|
|
--trace-funcs=::
|
|
Set the function name to get the histogram. Unlike perf ftrace trace,
|
|
it only allows single function to calculate the histogram.
|
|
|
|
-b::
|
|
--use-bpf::
|
|
Use BPF to measure function latency instead of using the ftrace (it
|
|
uses function_graph tracer internally).
|
|
|
|
-n::
|
|
--use-nsec::
|
|
Use nano-second instead of micro-second as a base unit of the histogram.
|
|
|
|
--bucket-range=::
|
|
Bucket range in ms or ns (according to -n/--use-nsec), default is log2() mode.
|
|
|
|
--min-latency=::
|
|
Minimum latency for the start of the first bucket, in ms or ns (according to
|
|
-n/--use-nsec).
|
|
|
|
--max-latency=::
|
|
Maximum latency for the start of the last bucket, in ms or ns (according to
|
|
-n/--use-nsec). The setting is ignored if the value results in more than
|
|
22 buckets.
|
|
|
|
OPTIONS for 'perf ftrace profile'
|
|
---------------------------------
|
|
|
|
-T::
|
|
--trace-funcs=::
|
|
Set function filter on the given function (or a glob pattern).
|
|
Multiple functions can be given by using this option more than once.
|
|
The function argument also can be a glob pattern. It will be passed
|
|
to 'set_ftrace_filter' in tracefs.
|
|
|
|
-N::
|
|
--notrace-funcs=::
|
|
Do not trace functions given by the argument. Like -T option, this
|
|
can be used more than once to specify multiple functions (or glob
|
|
patterns). It will be passed to 'set_ftrace_notrace' in tracefs.
|
|
|
|
-G::
|
|
--graph-funcs=::
|
|
Set graph filter on the given function (or a glob pattern). This is
|
|
useful to trace for functions executed from the given function. This
|
|
can be used more than once to specify multiple functions. It will be
|
|
passed to 'set_graph_function' in tracefs.
|
|
|
|
-g::
|
|
--nograph-funcs=::
|
|
Set graph notrace filter on the given function (or a glob pattern).
|
|
Like -G option, this is useful for the function_graph tracer only and
|
|
disables tracing for function executed from the given function. This
|
|
can be used more than once to specify multiple functions. It will be
|
|
passed to 'set_graph_notrace' in tracefs.
|
|
|
|
-m::
|
|
--buffer-size::
|
|
Set the size of per-cpu tracing buffer, <size> is expected to
|
|
be a number with appended unit character - B/K/M/G.
|
|
|
|
-s::
|
|
--sort=::
|
|
Sort the result by the given field. Available values are:
|
|
total, avg, max, count, name. Default is 'total'.
|
|
|
|
--graph-opts::
|
|
List of options allowed to set:
|
|
|
|
- nosleep-time - Measure on-CPU time only for function_graph tracer.
|
|
- noirqs - Ignore functions that happen inside interrupt.
|
|
- thresh=<n> - Setup trace duration threshold in microseconds.
|
|
- depth=<n> - Set max depth for function graph tracer to follow.
|
|
|
|
|
|
SEE ALSO
|
|
--------
|
|
linkperf:perf-record[1], linkperf:perf-trace[1]
|