Profiling Native Code

We use cargo bench to run benchmarks to measure the performance of individual operators and expressions and cargo-flamegraph for profiling.

Running micro benchmarks with cargo bench

When implementing a new operator or expression, it is good practice to add a new microbenchmark under core/benches.

It is often easiest to copy an existing benchmark and modify it for the new operator or expression. It is also necessary to add a new section to the Cargo.toml file, such as:

[[bench]]
name = "shuffle_writer"
harness = false

These benchmarks are useful when for comparing performance between releases or between feature branches and the main branch to help prevent regressions in performance when adding new features or fixing bugs.

Individual benchmarks can be run by name with the following command.

cargo bench shuffle_writer

Here is some sample output from running this command.

     Running benches/shuffle_writer.rs (target/release/deps/shuffle_writer-e37b59e37879cce7)
Gnuplot not found, using plotters backend
shuffle_writer/shuffle_writer
                        time:   [2.0880 ms 2.0989 ms 2.1118 ms]
Found 9 outliers among 100 measurements (9.00%)
  3 (3.00%) high mild
  6 (6.00%) high severe

Profiling with cargo-flamegraph

Install cargo-flamegraph:

cargo install flamegraph

Follow the instructions in cargo-flamegraph for your platform for running flamegraph.

Here is a sample command for running cargo-flamegraph on MacOS.

cargo flamegraph --root --bench shuffle_writer

This will produce output similar to the following.

dtrace: system integrity protection is on, some features will not be available

dtrace: description 'profile-997 ' matched 1 probe
Gnuplot not found, using plotters backend
Testing shuffle_writer/shuffle_writer
Success

dtrace: pid 66402 has exited
writing flamegraph to "flamegraph.svg"

The generated flamegraph can now be opened in a browser that supports svg format.

Here is the flamegraph for this example:

flamegraph