Extensions List¶
DataFusion tries to provide a good set of features “out of the box” to quickly
start with a working system, but it can’t include every useful feature (e.g.
TableProvider
s for all data formats).
Thankfully one of the core features of DataFusion is a flexible extension API that allows users to extend its behavior at all points. This page lists some community maintained extensions available for DataFusion. These extensions are not part of the core DataFusion project, and not under Apache Software Foundation governance but we list them here to be useful to others in the community.
If you know of an available extension that is not listed below, please open a PR
to add it to this page. If there is some feature you would like to see in
DataFusion, please consider creating a new extension in the datafusion-contrib
project (see below). Please contact us via github issue, slack, or Discord and
we’ll gladly set up a new repository for your extension.
Name |
Type |
Description |
---|---|---|
Support for |
||
Framework |
Allows DataFusion to execute (part of) a query plan by a remote execution engine. |
|
Apache ORC file format |
||
Functions |
Scalar functions for querying JSON strings |
datafusion-contrib
¶
The datafusion-contrib
project contains a collection of community maintained
extensions that are not part of the core DataFusion project, and not under
Apache Software Foundation governance but may be useful to others in the
community. If you are interested adding a feature to DataFusion, a new extension
in datafusion-contrib
is likely a good place to start. Please contact us via
github issue, slack, or Discord and we’ll gladly set up a new repository for
your extension.