All Classes and Interfaces
Class
Description
Configuration knobs for Arrow IPC sources passed to
SessionContext.registerArrow(String, String, ArrowReadOptions) and SessionContext.readArrow(String, ArrowReadOptions).Configuration knobs for Avro sources passed to
SessionContext.registerAvro(String, String, AvroReadOptions) and SessionContext.readAvro(String, AvroReadOptions).Configuration for DataFusion's built-in
CacheManager.Builder for
CacheManagerOptions.The value of a scalar UDF argument or result: either a per-row
ColumnarValue.Array of length
rowCount, or a ColumnarValue.Scalar (length-1 vector) that the framework broadcasts.Per-row Arrow vector of length equal to the batch row count.
Length-1 Arrow vector representing a single value broadcast across all rows.
Invalid or unrecognised DataFusion configuration option.
Configuration knobs for CSV sources passed to
SessionContext.registerCsv(String, String, CsvReadOptions) and SessionContext.readCsv(String, CsvReadOptions).Configuration knobs for writing CSV, passed to
DataFrame.writeCsv(String, CsvWriteOptions).A lazy representation of a query plan, mirroring the Rust DataFusion
DataFrame.Base unchecked exception for every error surfaced from the native DataFusion side.
Runtime execution failure: a UDF threw, a join task panicked, an external (non-DataFusion) error
propagated up, or an FFI-level failure surfaced.
Compression of a file-format source.
IO-shaped failure: a local filesystem read failed, an object store request failed, or a parquet /
arrow / avro decoder reported a malformed file.
Internal trampoline invoked from native code on every UDF call.
Configuration knobs for writing JSON, passed to
DataFrame.writeJson(String, JsonWriteOptions).Snapshot of session-wide memory usage from
SessionContext.memoryUsage().Loads the
datafusion_jni native library on demand.Configuration knobs for newline-delimited JSON sources passed to
SessionContext.registerJson(String, String, NdJsonReadOptions) and SessionContext.readJson(String, NdJsonReadOptions).The requested feature is recognised by DataFusion but not implemented yet.
Google Cloud Storage.
Listing-capable HTTP / WebDAV store.
Amazon S3 (and any S3-compatible endpoint such as MinIO, Cloudflare R2, Wasabi).
Configuration knobs for parquet sources passed to
SessionContext.registerParquet(String, String, ParquetReadOptions) and SessionContext.readParquet(String, ParquetReadOptions).Configuration knobs for writing parquet, passed to
DataFrame.writeParquet(String, ParquetWriteOptions).SQL parsing, logical planning, or schema-resolution failure.
The DataFusion runtime exhausted a configured resource budget — typically the memory pool, but
applies to any guard upstream surfaces as
DataFusionError::ResourcesExhausted.Snapshot of the underlying Tokio runtime from
SessionContext.runtimeStats().A Java-implemented scalar SQL function.
Bundle of inputs passed to
ScalarFunction.evaluate(org.apache.arrow.memory.BufferAllocator, org.apache.datafusion.ScalarFunctionArgs): the per-arg ColumnarValues
(in declared order) and the batch row count DataFusion is driving.A scalar UDF registration handle: pairs a
ScalarFunction implementation with the metadata
DataFusion needs to dispatch SQL calls to it.Convert between Arrow Java
Schema and the datafusion_common.Schema protobuf shape
used by DataFusion plan messages such as ListingTableScanNode.schema.A DataFusion session context.
Builder for a configured
SessionContext.A
TableProvider that pairs a fixed Schema with a function that opens a fresh
ArrowReader for each scan.A single sort key passed to
DataFrame.sort(SortExpr...), mirroring DataFusion's
expr::Sort{ expr, asc, nulls_first }.A Java-implemented table that can be registered with a
SessionContext via SessionContext.registerTable(String, TableProvider).Configuration knobs for
DataFrame.unnestColumns(UnnestOptions, String...), mirroring
DataFusion's UnnestOptions.Volatility classification for a UDF.