Uses of Class
org.apache.datafusion.DataFrame
Packages that use DataFrame
-
Uses of DataFrame in org.apache.datafusion
Methods in org.apache.datafusion that return DataFrameModifier and TypeMethodDescriptionDataFrame.cache()Materialise this DataFrame into an in-memory table and return a new DataFrame that scans it.DataFrame.describe()Compute summary statistics (count, null_count, mean, std, min, max, median) over this DataFrame's columns and return them as a new DataFrame.DataFrame.distinct()Deduplicate rows across all columns.DataFrame.dropColumns(String... columnNames) Drop the named columns.DataFrame.explain(boolean verbose, boolean analyze) Return a new DataFrame whose rows describe the plan that would execute this DataFrame.Apply a SQL predicate to produce a filtered DataFrame.SessionContext.fromProto(byte[] planBytes) Decode a DataFusion-ProtoLogicalPlanNodeand return a lazyDataFrame.Equi-join this DataFrame withrighton the named columns, using the givenJoinType.DataFrame.join(DataFrame right, JoinType type, String[] leftCols, String[] rightCols, String filter) Equi-join this DataFrame withright, restricting the result with a residual SQL filter parsed against the combined schema (left columns followed by right columns; columns may be qualified with the relation alias when ambiguous).Join this DataFrame withrightusing arbitrary SQL predicates parsed against the combined schema.DataFrame.limit(int fetch) Take the firstfetchrows.DataFrame.limit(int skip, int fetch) Skipskiprows, then take the nextfetchrows.Read an Arrow IPC file as aDataFramewithout registering it.SessionContext.readArrow(String path, ArrowReadOptions options) Read an Arrow IPC file as aDataFramewith the suppliedArrowReadOptions.Read an Avro file as aDataFramewithout registering it.SessionContext.readAvro(String path, AvroReadOptions options) Read an Avro file as aDataFramewith the suppliedAvroReadOptions.Read a CSV file as aDataFramewithout registering it.SessionContext.readCsv(String path, CsvReadOptions options) Read a CSV file as aDataFramewith the suppliedCsvReadOptions.Read a newline-delimited JSON file as aDataFramewithout registering it.SessionContext.readJson(String path, NdJsonReadOptions options) Read a newline-delimited JSON file as aDataFramewith the suppliedNdJsonReadOptions.SessionContext.readParquet(String path) Read a parquet file as aDataFramewithout registering it.SessionContext.readParquet(String path, ParquetReadOptions options) Read a parquet file as aDataFramewith the suppliedParquetReadOptions.Project the listed columns into a new DataFrame.Parse and planquery, returning a lazyDataFrame.DataFrame.unnestColumns(String... columns) Expand list or struct columns into rows or fields, with defaultUnnestOptions(i.e.DataFrame.unnestColumns(UnnestOptions options, String... columns) Expand list or struct columns into rows or fields with the suppliedUnnestOptions.DataFrame.withColumn(String name, String expr) Add a column to this DataFrame computed from a SQL expression.DataFrame.withColumnRenamed(String oldName, String newName) Rename a column.Methods in org.apache.datafusion with parameters of type DataFrameModifier and TypeMethodDescriptionEqui-join this DataFrame withrighton the named columns, using the givenJoinType.DataFrame.join(DataFrame right, JoinType type, String[] leftCols, String[] rightCols, String filter) Equi-join this DataFrame withright, restricting the result with a residual SQL filter parsed against the combined schema (left columns followed by right columns; columns may be qualified with the relation alias when ambiguous).Join this DataFrame withrightusing arbitrary SQL predicates parsed against the combined schema.