mirror of https://github.com/bytecodealliance/wasm-micro-runtime.git synced 2025-07-11 06:53:16 +00:00

Enhance wasm loading with LoadArgs and support module names (#3265 )

- Add new API wasm_runtime_load_ex() in wasm_export.h
  and wasm_module_new_ex in wasm_c_api.h
- Put aot_create_perf_map() into a separated file aot_perf_map.c
- In perf.map, function names include user specified module name
- Enhance the script to help flamegraph generations

2024-04-07 15:04:35 +08:00

3.5 KiB

Raw Blame History

linux perf sample introduction

This is a sample to show how to use the Linux perf tool to profile the execution of a WebAssembly application. And how to use the Flamegraph tool to visualize the profiling result.

Build and run the sample

There are two Wasm modules and their instance will be created and run in the sample. The first module is a simple Wasm module that calculates the Fibonacci number. The second module is a simple Wasm module that execute the Ackermann function. The target is enable to profile the execution of both two modules separately.

$ cmake -S . -B build
$ cmake --build build

Profile the execution

$ cd build
$ perf record -k mono -g --output=perf.data -- ./linux_perf_sample

Enable to use perf report --stdio to do a quick analysis of the profiling result.

Visualize the profiling result

Need to download Flamegraph tool from Flamegraph firstly.

$ perf script > out.perf
$ ./FlameGraph/stackcollapse-perf.pl out.perf > out.folded
$ ./FlameGraph/flamegraph.pl out.folded > perf.svg

In this result, you'll see two modules's profiling result and all wasm functions are named as "aot_func#N" which is a little hard to distinguish.

Separate profiling result

process_folded_data.py is a script can a) translate "aot_func#N" into its original function name in name sections, b) separate the profiling result of different modules.

In this sample, we want to separate fib and ackermann profiling data from out.folded. In demo, we decide to name the module of fib1.wasm as fib2 and the module of ackermann1.wasm as ackermann2.

$ python process_folded_data.py --wabt_home /opt/wabt --wasm_names fib2=./fib1.wasm,ackermann2=./ackermann1.wasm out.folded
-> write into out.fib2.translated
-> write into out.ackermann2.translated
-> write into out.translated

More scenarios:

if only using one wasm during profiling, the script can be used like this:

$ python process_folded_data.py --wabt_home /opt/wabt --wasm <wasm_file> --folded <folded_file>

if only using one wasm during profiling and specify the module name via APIs, the script can be used like this:

$ python process_folded_data.py --wabt_home /opt/wabt --wasm_names <module name>=<wasm_file> --folded <folded_file>

if only using one wasm during profiling and specify the module name, which is same with the basename of wasm file, via APIs, the script can be used like this:

$ python process_folded_data.py --wabt_home /opt/wabt --wasm <wasm_file> --folded <folded_file>

if using multiple wasm during profiling and specify module names, which are same with basename of wasm files, via APIs, the script can be used like this:

$ python process_folded_data.py --wabt_home /opt/wabt --wasm <wasm_file> --wasm <wasm_file> --wasm <wasm_file> --folded <folded_file>

if using multiple wasm during profiling and specify module names via APIs, the script can be used like this:

$ python process_folded_data.py --wabt_home /opt/wabt --wasm_names <module_name>=<wasm_file>,<module_name>=<wasm_file>,<module_name>=<wasm_file> --folded <folded_file>

Now we have two flame-graphs for two wasm modules:

Reference

perf_tune

3.5 KiB Raw Blame History