Introduction

The library user guide explains how to use the DataFusion library as a dependency in your Rust project and customize its behavior using its extension APIs.

Please check out the user guide for getting started using DataFusion’s SQL and DataFrame APIs, or the contributor guide for details on how to contribute to DataFusion.

If you haven’t reviewed the architecture section in the docs, it’s a useful place to get the lay of the land before starting down a specific path.

DataFusion is designed to be extensible at all points, including

  • User Defined Functions (UDFs)

  • User Defined Aggregate Functions (UDAFs)

  • User Defined Table Source (TableProvider) for tables

  • User Defined Optimizer passes (plan rewrites)

  • User Defined LogicalPlan nodes

  • User Defined ExecutionPlan nodes