Comet Benchmarking Guide#

To track progress on performance, we regularly run benchmarks derived from TPC-H and TPC-DS.

The benchmarking scripts are contained at apache/datafusion-comet.

Data generation scripts are available in the DataFusion Benchmarks GitHub repository.

Current Benchmark Results#

The published benchmarks are performed on a Linux workstation with PCIe 5, AMD 7950X CPU (16 cores), 128 GB RAM, and data stored locally in Parquet format on NVMe storage. Performance characteristics will vary in different environments and we encourage you to run these benchmarks in your own environments.

The operating system used was Ubuntu 22.04.5 LTS.

Benchmarking Guides#

Available benchmarking guides:

We also have many micro benchmarks that can be run from an IDE located here.