Apache DataFusion Blog

Articles by Andrew Lamb, Achraf B, and Sean Smith

tpchgen-rs World’s fastest open source TPC-H data generator, written in Rust

TLDR: TPC-H SF=100 in 1min using tpchgen-rs vs 30min+ with dbgen.

3 members of the Apache DataFusion community used Rust and open source development to build tpchgen-rs, a fully open TPC-H data generator over 20x faster than any other implementation we know of.

It is now possible to create …