Trampolines
Trampolines in computing has a number of meanings. In the context of Linux this refers to locations in memory containing addresses of logic to jump to. Trampolines are also referred to as "indirect jump vectors". It is a mechanism that has a number of use cases such as interrupt service routines or I/O routines. In these classic use cases the hardware hard-codes memory locations to which execution will jump when certain events such as interrupt happen. A trampoline typically jumps immediately to some other function where the actual handler lives, hence the term trampoline.
ftrace
Ftrace (function trace) is a mechanism in the kernel for observing function execution, traditionally for debugging purposes. Ftrace also uses trampolines, but in a slightly different way. It is enabled when compiled with the CONFIG_FUNCTION_TRACER
kernel configuration. When enabled, most files in the kernel are compiled with the -pg
and -mnop-mcount=...
flags. The -pg
flag enables profiling, which adds some additional CPU instructions to the start of most global functions. The -mnop-mcount=...
flag makes it so these CPU instructions are NOP
(No Operation) instructions as apposed to the normal profiling logic. So by default, when a function is called, the CPU encounters a few NOP
instructions which it will ignore and then continue the actual function. However, at runtime the kernel can replace these NOP
instructions with a trampoline to in the case of ftrace some tracing logic.
It should be noted that certain directories, files or functions in the kernel are excluded. For example the trace subsystem (kernel/trace) is excluded and any functions with the notrace
attribute. This is done to protect the user from infinite recursion or breaking assumptions in critical code sections of the kernel.
BPF trampolines
BPF programs can be attached to any function in the kernel that has enough NOP
instructions. When this is done, a "BPF trampoline" is created. This is a architecture specific function that boils down to:
- Allocate room on the stack for all function arguments + return value
- Copy all arguments to the stack, from the registers specified by the calling convention
- For each fentry program attached
- Call the fentry program with a pointer to the stack as context (if any is attached)
- Disable CPU migration (preemption on older kernels) before calling the program, re-enable after
- If stats tracking is enabled, start timer before execution and add run time to stats after execution
- Call the fentry program with a pointer to the stack as context (if any is attached)
- For each fmodify_return program attached
- Call a fmodify_return program with a pointer to the stack as context (if any is attached)
- Disable CPU migration (preemption on older kernels) before calling the program, re-enable after
- If stats tracking is enabled, start timer before execution and add run time to stats after execution
- If the return value is non zero, return it instead of continuing
- Call a fmodify_return program with a pointer to the stack as context (if any is attached)
- Call the original function (unless a fmodify_return program returned a non
0
value) - Copy return value from register onto the stack
- For each fexit program attached
- Call a fexit program with a pointer to the stack as context (if any is attached)
- Disable CPU migration (preemption on older kernels) before calling the program, re-enable after
- If stats tracking is enabled, start timer before execution and add run time to stats after execution
- Call a fexit program with a pointer to the stack as context (if any is attached)
- Return the return value from the stack
The NOP
s are replaced with a call to the generated trampoline and a return instruction.
Architecture support
Since BPF trampolines are architecture specific, support for a given architecture may be added later than the initial support for BPF trampolines. Here is a table of when support was added for each architecture:
Architecture | Support added |
---|---|
X86-64 | v5.5 |
ARM64 | v6.0 |
RISC-V | v6.3 |
S390 | v6.3 |
Other architectures currently lack support for BPF trampolines.
fentry/fexit/fmodify_return
The BPF_PROG_TYPE_TRACING
fentry/fexit/fmodify_return programs make use of these trampolines to attach and execute just before entering a function or right after it exits. Since the trampoline is essentially just a function call, the overhead is very minimal. So a way faster alternative to a kprobe which uses an interrupt, and thus a context switch which is way more expensive.
Not only native kernel functions can have these blank spots for trampolines. When BPF programs are JIT-ed, the kernel also gives them these spots which can be instrumented in the same manner. This allows fentry/fexit/fmodify_return programs to attach to other programs, for the purposes of observability.
Program replacement
Trampolines are also used to implement freplace programs, where one program replaces another. When a freplace program is attached, it installs a trampoline before the original program, jumps to the extension program and executes it instead, then returns without calling the original.
LSM
LSM programs also attach via trampolines very similar to fexit/fret_mod programs. The kernel defines placeholder functions for every hook, always starting with the prefix bpf_lsm_
. These placeholders simply return the default return value. When attached, the LSM program acts as fexit or fret_mod probe for the purposes of the BPF trampoline on these placeholder hooks.